Skip to content

Commit 742b968

Browse files
committed
removed bf16
1 parent c088dc9 commit 742b968

File tree

1 file changed

+1
-1
lines changed
  • src/routes/blogs/deepseek-r1-on-device

1 file changed

+1
-1
lines changed

src/routes/blogs/deepseek-r1-on-device/+page.svx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ ONNX enables you to run your models on-device across CPU, GPU, NPU. With ONNX yo
6161
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | Int4 | CUDA | RTX 4090 | 313.32 | 6.3X |
6262
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | fp16 | CUDA | RTX 4090 | 57.316 | 1.3X |
6363
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4 | CUDA | RTX 4090 | 161.00 | 3.7X |
64-
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4/bfloat16 | CPU | 13th Gen Intel i9 | 3.184 | 20X |
64+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4 | CPU | 13th Gen Intel i9 | 3.184 | 20X |
6565
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | Int4 | CPU | 13th Gen Intel i9 | 11.749 | 1.4X |
6666

6767
_CUDA BUILD SPECS: onnxruntime-genai-cuda==0.6.0, transformers==4.46.2, onnxruntime-gpu==1.20.1_ <br/>

0 commit comments

Comments
 (0)