Skip to content

[Kernel] Hook batch_memcpy: replace triton with xspeedgate_ops#351

Open
Marshall-Ge wants to merge 1 commit into
baidu:mainfrom
Marshall-Ge:feature/hook-batch-memcpy-xspeedgate
Open

[Kernel] Hook batch_memcpy: replace triton with xspeedgate_ops#351
Marshall-Ge wants to merge 1 commit into
baidu:mainfrom
Marshall-Ge:feature/hook-batch-memcpy-xspeedgate

Conversation

@Marshall-Ge
Copy link
Copy Markdown
Contributor

Kunlun XPU does not support triton. Add vllm_kunlun/v1/worker/mamba_utils.py that replaces the upstream triton-based batch_memcpy_kernel with xspeedgate_ops.batch_memcpy for mamba prefix caching support.

@Marshall-Ge Marshall-Ge changed the title [Feature] Hook batch_memcpy: replace triton with xspeedgate_ops for K… [Feature] Hook batch_memcpy: replace triton with xspeedgate_ops May 7, 2026
@Marshall-Ge Marshall-Ge force-pushed the feature/hook-batch-memcpy-xspeedgate branch from 8b23c1f to 39064b0 Compare May 7, 2026 06:16
…unlun XPU

Kunlun XPU does not support triton. Add vllm_kunlun/v1/worker/mamba_utils.py
that replaces the upstream triton-based batch_memcpy_kernel with
xspeedgate_ops.batch_memcpy for mamba prefix caching support.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Marshall <marshall@MarshalldeMacBook-Pro-2.local>
@Marshall-Ge Marshall-Ge force-pushed the feature/hook-batch-memcpy-xspeedgate branch from 39064b0 to 2ed6ace Compare May 7, 2026 06:52
@Marshall-Ge Marshall-Ge changed the title [Feature] Hook batch_memcpy: replace triton with xspeedgate_ops [Kernel] Hook batch_memcpy: replace triton with xspeedgate_ops May 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant