-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][infra] PLC nightly source code scanning
#12124
opened Mar 11, 2026 by
yuanjingx87
Loading…
1 task done
[None][refactor] parallel vae refactor
VisualGen
#12123
opened Mar 11, 2026 by
NVShreyas
Loading…
1 task done
[#11083][feat] Add hardware-aware MLA defaults to get_model_defaults()
Community want to contribute
PRs initiated from Community
#12122
opened Mar 11, 2026 by
wojciech-wais
Loading…
1 task
[TRTLLM-11362][feat] Add batch generation support to visual gen pipelines
#12121
opened Mar 11, 2026 by
karljang
Loading…
7 tasks done
[None][fix] kvcache storeContextBlocks toctou
#12120
opened Mar 11, 2026 by
thorjohnsen
Loading…
1 task done
[#12116][fix] prevent KVBM hang when requests abort during KV cache transfer
Community want to contribute
PRs initiated from Community
#12117
opened Mar 11, 2026 by
zyang-Modular
Loading…
1 task done
[None][feat] Add support for phi4 and phi4-mini-flash
#12113
opened Mar 11, 2026 by
bmarimuthu-nv
•
Draft
1 task
[None][Chore] Fix KVCacheManagerV2 shrink for last level and improve init_ratio
#12112
opened Mar 11, 2026 by
lowsfer
Loading…
1 task
[#12071][fix] Replace cudaMemcpy2DAsync with flat copy in copyKvBlockOffsets
Community want to contribute
PRs initiated from Community
#12111
opened Mar 11, 2026 by
wojciech-wais
Loading…
1 task
[None][chore] Add multinode e2e and accuracy cases on DGX-Spark
#12110
opened Mar 11, 2026 by
JennyLiu-nv
Loading…
1 task done
[TRTLLM-11288][feat] Configurable warmup shapes for VisualGen
#12107
opened Mar 11, 2026 by
luyiyun1021
Loading…
1 task done
[TRTLLM-10303][feat] Deprecate trtllm-serve CLI options
#12106
opened Mar 11, 2026 by
JunyiXu-nv
Loading…
1 task done
[TRTLLM-10076][feat] Serve CLI improvements: renames, new flags, and mm_embedding_serve enhancements
#12105
opened Mar 11, 2026 by
JunyiXu-nv
Loading…
1 task done
[TRTLLM-10077][feat] Add 'auto' option for tool and reasoning parsers
#12104
opened Mar 11, 2026 by
JunyiXu-nv
Loading…
1 task done
[None][chore] Add explicit error for intermediate size misalignment with fp8 block size
#12101
opened Mar 11, 2026 by
leslie-fang25
Loading…
1 task done
[None][fix] Enforce minimum NVSHMEM_QP_DEPTH of 128 for DeepEP low latency
#12100
opened Mar 11, 2026 by
Tabrizian
Loading…
1 task done
[https://nvbugs/5963423][fix] Fix kv token estimation when ADP is on.
#12099
opened Mar 11, 2026 by
dominicshanshan
Loading…
1 task done
[TRTLLM-9911] [doc] Update Perf-Overview.md for Release 1.2
Doc
<NV>TRTLLM's textual/illustrative materials: API refs, guides, tutorials. Improvement & clarity.
Release Blocker
PRs that blocking the final release build or branching out the release branch
#12098
opened Mar 11, 2026 by
zbpatel
Loading…
1 task done
[None][test] fix perf test cases issue of incorrect match
#12096
opened Mar 11, 2026 by
ruodil
Loading…
1 task done
[TRTLLM-9523][feat] Additional adaptation to manager v2 (step 6)
#12095
opened Mar 11, 2026 by
Shixiaowei02
•
Draft
1 task
[https://nvbugs/5826604][test] Remove test waive for Llama3.1 8B bfloat16 4gpu timeout …
#12092
opened Mar 11, 2026 by
syuoni
Loading…
1 task done
Draft - Don't Review - AD Deepseek-V3-Lite and mla enablement
#12089
opened Mar 10, 2026 by
MrGeva
Loading…
1 task
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.