Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion docs/source/tutorials/Qwen3-32B-W4A4.md
Original file line number Diff line number Diff line change
Expand Up @@ -55,10 +55,12 @@ cd example/Qwen
MODEL_PATH=/home/models/Qwen3-32B
# Path to save converted weight, Replace with your local path
SAVE_PATH=/home/models/Qwen3-32B-w4a4
# Set two idle NPU cards
export ASCEND_RT_VISIBLE_DEVICES=0,1

python3 w4a4.py --model_path $MODEL_PATH \
--save_directory $SAVE_PATH \
--calib_file ../common/qwen_qwen3_cot_w4a4.json \
--calib_file ./calib_data/qwen3_cot_w4a4.json \
--trust_remote_code True \
--batch_size 1
```
Expand Down
2 changes: 2 additions & 0 deletions docs/source/tutorials/Qwen3-8B-W4A8.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,8 @@ cd example/Qwen
MODEL_PATH=/home/models/Qwen3-8B
# Path to save converted weight, Replace with your local path
SAVE_PATH=/home/models/Qwen3-8B-w4a8
# Set an idle NPU card
export ASCEND_RT_VISIBLE_DEVICES=0

python quant_qwen.py \
--model_path $MODEL_PATH \
Expand Down
Loading