Skip to content

add int8 quantization support for llm models#4086

Open
lanluo-nvidia wants to merge 3 commits intomainfrom
lluo/int8_non_prequantized
Open

add int8 quantization support for llm models#4086
lanluo-nvidia wants to merge 3 commits intomainfrom
lluo/int8_non_prequantized

Commits

Commits on Feb 19, 2026

Commits on Feb 20, 2026

Comments