Name	Name	Last commit message	Last commit date
parent directory ..
Llama-3.1-8B-Instruct_model_builder_int4.json	Llama-3.1-8B-Instruct_model_builder_int4.json
README.md	README.md
info.yml	info.yml

Name

Last commit message

Last commit date

Llama-3.1-8B-Instruct_model_builder_int4.json

Llama-3.1-8B-Instruct optimization

This folder contains examples of Olive recipes for Llama-3.1-8B-Instruct optimization.

INT4 Model Building

The olive recipe Llama-3.1-8B-Instruct_model_builder_int4.json uses ModelBuilder pass to generate the INT4 model for NvTensorRTRTXExecutionProvider (aka NvTensorRtRtx EP).

Setup

Install Olive
Install onnxruntime-genai package that has support for NvTensorRTRTXExecutionProvider.

Steps to run

Use the following command to export the model using Olive with NvTensorRTRTXExecutionProvider:

olive run --config Llama-3.1-8B-Instruct_model_builder_int4.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Llama-3.1-8B-Instruct optimization

INT4 Model Building

Setup

Steps to run

Uh oh!

FilesExpand file tree

NvTensorRtRtx

Directory actions

More options

Directory actions

More options

Latest commit

History

NvTensorRtRtx

Folders and files

parent directory

README.md

Llama-3.1-8B-Instruct optimization

INT4 Model Building

Setup

Steps to run