Name	Name	Last commit message	Last commit date
parent directory ..
Mistral-7B-Instruct-v0.2_model_builder_int4.json	Mistral-7B-Instruct-v0.2_model_builder_int4.json
README.md	README.md
info.yml	info.yml

Name

Last commit message

Last commit date

Mistral-7B-Instruct-v0.2_model_builder_int4.json

Mistral-7B-Instruct-v0.2 optimization

This folder contains examples of Olive recipes for Mistral-7B-Instruct-v0.2 optimization.

INT4 Model Building

The olive recipe Mistral-7B-Instruct-v0.2_model_builder_int4.json uses ModelBuilder and MatMulNBitsToQDQ passes to generate the INT4 model for NvTensorRTRTXExecutionProvider (aka NvTensorRtRtx EP).

Setup

Install Olive
Install onnxruntime-genai package that has support for NvTensorRTRTXExecutionProvider.

Steps to run

Use the following command to export the model using Olive with NvTensorRTRTXExecutionProvider:

olive run --config Mistral-7B-Instruct-v0.2_model_builder_int4.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Mistral-7B-Instruct-v0.2 optimization

INT4 Model Building

Setup

Steps to run

Uh oh!

FilesExpand file tree

NvTensorRtRtx

Directory actions

More options

Directory actions

More options

Latest commit

History

NvTensorRtRtx

Folders and files

parent directory

README.md

Mistral-7B-Instruct-v0.2 optimization

INT4 Model Building

Setup

Steps to run