Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][fix] Always sync local ranks after prefetch in HfWeightLoader
#13556 opened Apr 28, 2026 by lancelly Collaborator Loading…
[https://nvbugs/6087632][fix] fix test def to use local model
#13555 opened Apr 28, 2026 by bo-nv Collaborator Loading…
1 task
[None][test] Test coverage and repro for #13320
#13553 opened Apr 28, 2026 by eopXD Collaborator Loading…
1 task done
Pearl
#13551 opened Apr 28, 2026 by zhaoyangwang-nvidia Collaborator Draft
1 task
[None][test] Unwaive DSR1 V32 Agg TEP tests
#13550 opened Apr 28, 2026 by chenfeiz0326 Collaborator Loading…
1 task done
[None][doc] Blogpost for Helix Parallelism
#13547 opened Apr 28, 2026 by brb-nv Collaborator Loading…
1 task done
disagg support of cpp/KVCacheManager+LinearAttention
#13546 opened Apr 28, 2026 by VALLIS-NERIA Collaborator Draft
1 task
[TRTLLM-11228][feat] Update quickstart for DFlash
#13545 opened Apr 28, 2026 by ziyixiong-nv Collaborator Loading…
1 task
[https://nvbugs/6029882][fix] Clamp tokens_info writes in computeSeqAndPaddingOffsets
#13544 opened Apr 28, 2026 by bobboli Collaborator Loading…
2 of 3 tasks
Fix beam-search requests not terminating at large beam_width Community want to contribute PRs initiated from Community
#13543 opened Apr 28, 2026 by Doloxetine Loading…
[None][chore] Convert cubins in repository to compressed archives
#13542 opened Apr 28, 2026 by tongyuantongyu Member Loading…
1 task done
[None][test] Waive 9 failed cases for main in QA CI
#13540 opened Apr 28, 2026 by xinhe-nv Collaborator Loading…
[None][test] rename test case and add fallback for multinode cases
#13537 opened Apr 28, 2026 by ruodil Collaborator Loading…
1 task done
[None][fix] write per-rank torch profile traces
#13536 opened Apr 28, 2026 by GavinZhu-GMI Loading…
3 of 4 tasks
[None][chore] AutoDeploy: Remove Two Model Speculative Decoding Support
#13532 opened Apr 28, 2026 by govind-ramnarayan Collaborator Loading…
1 task done
[TRTLLMINF-43][feat] Extend infrastructure-failure retry to K8s test stages
#13530 opened Apr 27, 2026 by dpitman-nvda Collaborator Loading…
1 task done
[None][fix] fix PEFT page accumulation in MaxUtilizationPolicy scheduler
#13528 opened Apr 27, 2026 by achartier Collaborator Loading…
1 task done
[https://nvbugs/5996024][fix] Enforce trust_remote_code flag
#13527 opened Apr 27, 2026 by yibinl-nvidia Collaborator Draft
1 task
[TRTLLMINF-45][infra] Upload rendered HTML failure analysis
#13526 opened Apr 27, 2026 by dpitman-nvda Collaborator Loading…
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.