Skip to content

[pull] main from NVIDIA:main#397

Merged
pull[bot] merged 8 commits intoyingguo-trt:mainfrom
NVIDIA:main
Apr 17, 2026
Merged

[pull] main from NVIDIA:main#397
pull[bot] merged 8 commits intoyingguo-trt:mainfrom
NVIDIA:main

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented Apr 17, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

ZhanruiSunCh and others added 8 commits April 16, 2026 23:33
Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com>
…devs (#13110)

Signed-off-by: venkywonka <23023424+venkywonka@users.noreply.github.com>
)

Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
Signed-off-by: Chenfei Zhang <chenfeiz@nvidia.com>
Co-authored-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
…ility (#12968)

Signed-off-by: yuhangh <58161490+heyuhhh@users.noreply.github.com>
…AutoDeploy's perf test: deepseek_r1_distill_qwen_32b (#12965)

Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
…3025)

* Why?

In order to opt into the caching functionality for chunked prefix, there
are certain assumptions on the return type of the encoder's forward
function. These assumptions did not hold for nemotron nano VL prior to
this commit.

* What?

This commit fixes this issue, and adds tests to catch regressions.

Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com>
@pull pull Bot locked and limited conversation to collaborators Apr 17, 2026
@pull pull Bot added the ⤵️ pull label Apr 17, 2026
@pull pull Bot merged commit 2a0bcb1 into yingguo-trt:main Apr 17, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants