Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][chore] Remove closed bugs
#12766 opened Apr 5, 2026 by xinhe-nv Draft
feat: add Prometheus metrics collection for gRPC server mode Community want to contribute PRs initiated from Community
#12760 opened Apr 4, 2026 by ConnorLi96 Loading…
1 task
[None][fix] Draft KV cache should not allocate host memory Community want to contribute PRs initiated from Community
#12756 opened Apr 3, 2026 by Shang-Pin Loading…
1 task
[None][infra] Bump version to 1.2.1
#12755 opened Apr 3, 2026 by yuanjingx87 Loading…
1 task done
feat: add standard gRPC health service for Kubernetes native probes Community want to contribute PRs initiated from Community
#12752 opened Apr 3, 2026 by ConnorLi96 Loading…
1 task
Respect AutoDeploy trust_remote_code Community want to contribute PRs initiated from Community
#12751 opened Apr 3, 2026 by jmecom Loading…
feat: support multiple model names in --served_model_name Community want to contribute PRs initiated from Community
#12746 opened Apr 3, 2026 by nvyutwu Loading…
5 tasks
[https://nvbugs/5969216][fix] Ministral3 loading fix
#12743 opened Apr 3, 2026 by evezhier Draft
1 task done
[None][feat] Optimize qwen3.5 decode delta kernel
#12740 opened Apr 3, 2026 by nv-guomingz Loading…
1 task done
ProTip! Adding no:label will show everything without a label.