Name	Name	Last commit message	Last commit date
parent directory ..
Qwen2.5-7B-Instruct_model_builder_int4.json	Qwen2.5-7B-Instruct_model_builder_int4.json
README.md	README.md
info.yml	info.yml

Name

Last commit message

Last commit date

Qwen2.5-7B-Instruct_model_builder_int4.json

Qwen2.5-7B-Instruct optimization

This folder contains examples of Olive recipes for Qwen2.5-7B-Instruct optimization.

INT4 Model Building

The olive recipe Qwen2.5-7B-Instruct_model_builder_int4.json uses ModelBuilder and MatMulNBitsToQDQ passes to generate the INT4 model for NvTensorRTRTXExecutionProvider (aka NvTensorRtRtx EP).

Setup

Install Olive
Install onnxruntime-genai package that has support for NvTensorRTRTXExecutionProvider.

Steps to run

Use the following command to export the model using Olive with NvTensorRTRTXExecutionProvider:

olive run --config Qwen2.5-7B-Instruct_model_builder_int4.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Qwen2.5-7B-Instruct optimization

INT4 Model Building

Setup

Steps to run

Uh oh!

FilesExpand file tree

NvTensorRtRtx

Directory actions

More options

Directory actions

More options

Latest commit

History

NvTensorRtRtx

Folders and files

parent directory

README.md

Qwen2.5-7B-Instruct optimization

INT4 Model Building

Setup

Steps to run