Skip to content

Commit

Permalink
update tpsize
Browse files Browse the repository at this point in the history
  • Loading branch information
xiaoxiawu-microsoft committed Mar 6, 2024
1 parent ebf31f3 commit 34388ae
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions benchmarks/inference/mii/run_fp6.sh
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@

# DeepSpeed Team

MODELS=(NousResearch/Llama-2-13b-hf)
MODELS=(NousResearch/Llama-2-70b-hf)

for MODEL in ${MODELS[@]}; do
python ./run_benchmark.py --model ${MODEL} --num_requests 128 --stream --backend fastgen --fp6
python ./run_benchmark.py --model ${MODEL} --num_requests 128 --stream --backend fastgen --fp6 --tp_size 1
done

0 comments on commit 34388ae

Please sign in to comment.