Skip to content

Commit 7536f03

Browse files
authored
Update autoround RTN example to tuning (#2558)
SUMMARY: Update autoround RTN example to tuning Signed-off-by: Wang, Mengni <mengni.wang@intel.com>
1 parent 8f08737 commit 7536f03

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/autoround/quantization_w8a8_fp8/llama3.1_block_quant_example.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@
2525
# Configure the quantization algorithm to run.
2626
# NOTE: AutoRoundModifier with iters=0 is equivalent to RTN
2727
recipe = AutoRoundModifier(
28-
targets="Linear", scheme="FP8_BLOCK", ignore=["lm_head"], iters=0
28+
targets="Linear", scheme="FP8_BLOCK", ignore=["lm_head"], iters=200
2929
)
3030

3131

0 commit comments

Comments
 (0)