Skip to content

[Feature] Enable Thinker CUDA Graph by Default#404

Merged
zhaochenyang20 merged 1 commit intomainfrom
enable-thinker-cuda-graph
May 6, 2026
Merged

[Feature] Enable Thinker CUDA Graph by Default#404
zhaochenyang20 merged 1 commit intomainfrom
enable-thinker-cuda-graph

Conversation

@zhaochenyang20
Copy link
Copy Markdown
Collaborator

As titled. Will calibrate on #403 and benchmark on #400

Default `disable_cuda_graph` to `False` when creating the SGLang Thinker
executor, and propagate `can_run_cuda_graph` from the model runner output
through `OmniScheduler.run_batch` so the upstream batching path sees the
real value instead of a hardcoded `False`.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@zhaochenyang20 zhaochenyang20 merged commit c3198a9 into main May 6, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants