Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

README.md

Mistral-7B-Instruct-v0.2 optimization

This folder contains examples of Olive recipes for Mistral-7B-Instruct-v0.2 optimization.

INT4 Model Building

The olive recipe Mistral-7B-Instruct-v0.2_model_builder_int4.json uses ModelBuilder and MatMulNBitsToQDQ passes to generate the INT4 model for NvTensorRTRTXExecutionProvider (aka NvTensorRtRtx EP).

Setup

  1. Install Olive

  2. Install onnxruntime-genai package that has support for NvTensorRTRTXExecutionProvider.

Steps to run

Use the following command to export the model using Olive with NvTensorRTRTXExecutionProvider:

olive run --config Mistral-7B-Instruct-v0.2_model_builder_int4.json