DeepSeek-R1-Distill-Qwen-1.5B optimization

This folder contains examples of Olive recipes for DeepSeek-R1-Distill-Qwen-1.5B optimization.

FP16 Model Building

The olive recipe DeepSeek-R1-Distill-Qwen-1.5B_model_builder_fp16.json uses ModelBuilder pass to generate the FP16 model for NvTensorRTRTXExecutionProvider (aka NvTensorRtRtx EP).

Setup

Install Olive
Install onnxruntime-genai package that has support for NvTensorRTRTXExecutionProvider.

Steps to run

Use the following command to export the model using Olive with NvTensorRTRTXExecutionProvider:

olive run --config DeepSeek-R1-Distill-Qwen-1.5B_model_builder_fp16.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DeepSeek-R1-Distill-Qwen-1.5B optimization

FP16 Model Building

Setup

Steps to run

Uh oh!

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

DeepSeek-R1-Distill-Qwen-1.5B optimization

FP16 Model Building

Setup

Steps to run