Scale testing

@bbockelm reports possible issues observed by the Coffea team when scaling past ~50 workers with TLS, as well as issues where auto-scaled-down workers are killed while still holding useful results in memory. We should investigate both issues on our setup and see if we can reproduce them.

I'm also interested in testing overall stability during large/long calculations by manually killing workers and seeing if Dask can dynamically recover in a reasonable way (as it claims it can).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale testing #23

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Scale testing #23

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions