Releases · NVIDIA-NeMo/Export-Deploy

22 Oct 23:36

chtruong814

v0.2.1

950000c

NVIDIA NeMo-Export-Deploy 0.2.1 Latest

Latest

Bug fixes for HuggingFace model deployment (#459)
- Fixed HuggingFace deployable implementations for both Triton and Ray Serve backends
- Improved tokenizer handling in HuggingFace deployment scripts
Minor fixes for Ray deployment (#464)
- Additional bug fixes in Ray deployment utilities

Assets 2

09 Oct 20:01

chtruong814

v0.2.0

726695b

NVIDIA NeMo-Export-Deploy 0.2.0

MegatronLM and Megatron-Bridge model deployment support with Triton Inference Server and Ray Serve
Multi-node multi-instance Ray Serve based deployment for NeMo 2, Megatron-Bridge, and Megatron-LM models.
Update vLLM export to use NeMo->HF->vLLM export path
Multi-Modal deployment for NeMo 2 models with Triton Inference Server
NeMo Retriever Text Reranking ONNX and TensorRT export support

Assets 2

18 Aug 06:32

chtruong814

v0.2.0rc2

7867110

NVIDIA NeMo-Export-Deploy 0.2.0rc2 Pre-release

Pre-release

Prerelease: NVIDIA NeMo-Export-Deploy 0.2.0rc2 (2025-08-18)

Assets 2

15 Aug 08:24

chtruong814

v0.1.1

ca72da9

NVIDIA NeMo-Export-Deploy 0.1.1

ci: Mock DCO check

Signed-off-by: oliver könig <[email protected]>

Assets 2

14 Aug 15:54

chtruong814

v0.2.0rc1

62485cc

NVIDIA NeMo-Export-Deploy 0.2.0rc1 Pre-release

Pre-release

Prerelease: NVIDIA NeMo-Export-Deploy 0.2.0rc1 (2025-08-14)

Assets 2

03 Aug 16:48

chtruong814

v0.2.0rc0

657c525

NVIDIA NeMo-Export-Deploy 0.2.0rc0 Pre-release

Pre-release

Prerelease: NVIDIA NeMo-Export-Deploy 0.2.0rc0 (2025-08-03)

Assets 2

30 Jul 16:01

ko3n1g

v0.1.0

b6cf209

NVIDIA NeMo-Export-Deploy 0.1.0

NeMo Export-Deploy Release
Pip installers for export and deploy
RayServe support for multi-instance deployment
TensorRT-LLM PyTorch backend
mcore inference optimizations

Assets 2

Releases: NVIDIA-NeMo/Export-Deploy

NVIDIA NeMo-Export-Deploy 0.2.1

Uh oh!

NVIDIA NeMo-Export-Deploy 0.2.0

Uh oh!

NVIDIA NeMo-Export-Deploy 0.2.0rc2

Uh oh!

NVIDIA NeMo-Export-Deploy 0.1.1

Uh oh!

NVIDIA NeMo-Export-Deploy 0.2.0rc1

Uh oh!

NVIDIA NeMo-Export-Deploy 0.2.0rc0

Uh oh!

NVIDIA NeMo-Export-Deploy 0.1.0

Uh oh!