Skip to content

Commit 274213b

Browse files
use float32
1 parent b709f53 commit 274213b

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

test/vllm/ec2/utils/run_vllm_on_arm64.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -63,7 +63,7 @@ docker run --rm \
6363
$DLC_IMAGE \
6464
-c "python3 /vllm/examples/offline_inference/basic/generate.py \
6565
--model ${MODEL_NAME} \
66-
--dtype half \
66+
--dtype float32 \
6767
--tensor-parallel-size 1 \
6868
--max-model-len 2048"
6969

@@ -79,7 +79,7 @@ docker run -d \
7979
--gpus=all \
8080
$DLC_IMAGE \
8181
-c "vllm serve ${MODEL_NAME} \
82-
--dtype half
82+
--dtype float32 \
8383
--tensor-parallel-size 1"
8484

8585
wait_for_api

0 commit comments

Comments
 (0)