Skip to content

[BUG] Using Client.wait_for_workers Does Not Properly Wait for Workers #4082

Open
@alexbarghi-nv

Description

@alexbarghi-nv

While running benchmarks for the GNN packages in a multinode environment, @jnke2016 and I found that calling Client.wait_for_workers was not working properly, causing a hang or crash when running a dask workflow. Currently, we have a workaround that uses a separate script (wait_for_workers.py) to wait for all workers prior to launching a workflow. This workaround should be eliminated in favor of fixing the bug and calling Client.wait_for_workers as intended by the dask API.

Metadata

Metadata

Labels

bugSomething isn't workingpython

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions