Description
What's the use case?
Submitting large backfills to QueuedRunCoordinator
can be very slow and painful. How can we speed up the submission of large backfills to the Queue?
Internal context:
Self hosting Dagster and on 1.3.11. Using the K8sRunLauncher
+ Multiprocess Executor with the QueuedRunCoordinator
We’re submitting multiple large backfills (in the 1000s of runs) and have noticed it takes quite awhile for jobs to be enqueued. It appears that all backfills use the same Queue, and if you submit sequential backfills all jobs in earlier backfills must be enqueued before later backfills can be enqueued. This is somewhat problematic if we want to start different backfills in parallel.
Here’s our config for QueuedRunCoordinator
:
class: QueuedRunCoordinator
config:
dequeue_interval_seconds: 5
dequeue_num_workers: 4
dequeue_use_threads: true
max_concurrent_runs: 100
Ideas of implementation
Enqueue runs in parallel, similar to #8642 which was for run submission
Additional information
No response
Message from the maintainers
Impacted by this issue? Give it a 👍! We factor engagement into prioritization.