Skip to content

chore: make DistributedRunner use random ports by default#919

Open
Gothos wants to merge 2 commits intomainfrom
chore/make-distributedrunner-use-random-ports
Open

chore: make DistributedRunner use random ports by default#919
Gothos wants to merge 2 commits intomainfrom
chore/make-distributedrunner-use-random-ports

Conversation

@Gothos
Copy link
Contributor

@Gothos Gothos commented Mar 11, 2026

Makes DistributedRunner use random ports by default, (random ports will be chosen when the master port/ worker ports are set to 0). Suffers from a minor TOCTOU problem, but we retry up to three times, so this should not be a problem in practice. The current default port of 29500 conflicts with other packages' default like pytorch distributed/ accelerate launch commands.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant