Commit 8935518
fix: register endpoint before jax.distributed.initialize to avoid deadlock
jax.distributed.initialize() blocks until all processes connect, so
registering the endpoint after calling it would deadlock — tasks 1..N-1
would never discover the coordinator address. JAX's internal gRPC retry
handles the brief window between registration and the coordinator
starting to listen.
Co-authored-by: Russell Power <rjpower@users.noreply.github.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 3ccc7d0 commit 8935518
1 file changed
+6
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
99 | | - | |
100 | | - | |
101 | | - | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
102 | 104 | | |
103 | 105 | | |
104 | 106 | | |
105 | 107 | | |
| 108 | + | |
106 | 109 | | |
107 | 110 | | |
108 | 111 | | |
0 commit comments