TensorRT Edge-LLM 0.4.0 Release #4
nvluxiaoz
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
TensorRT Edge-LLM 0.4.0 Release 2026-01-06
We are very excited to announce the first release of TensorRT Edge-LLM! TensorRT Edge-LLM is NVIDIA's high-performance C++ inference runtime for Large Language Models (LLMs) and Vision-Language Models (VLMs) on embedded platforms. Please follow our Quick Start Guide for the usage.
Key Components
Model Support
Please check the model support page for more details.
Key Features
Model Export
Runtime
This discussion was created from the release TensorRT Edge-LLM 0.4.0 Release.
Beta Was this translation helpful? Give feedback.
All reactions