-
Notifications
You must be signed in to change notification settings - Fork 76
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support the fusion of topk and add operators in the router module.
#322
opened Nov 5, 2025 by
DongheJin
Loading…
refactor: optimize the 'set_device' function calling to avoid set device on each step.
#321
opened Nov 5, 2025 by
yq33victor
Loading…
feat: remove redundant input parameters by add batch forward type.
#314
opened Nov 3, 2025 by
RobbieLeung
Loading…
feat: support PreFetchWeight and IntraAddNorm for qwen3-dense model.
#304
opened Oct 30, 2025 by
edison240121
Loading…
feat: add flashinfer as kernel backend for cuda device.
#246
opened Oct 17, 2025 by
XuZhang99
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.