We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent c088dc9 commit 742b968Copy full SHA for 742b968
src/routes/blogs/deepseek-r1-on-device/+page.svx
@@ -61,7 +61,7 @@ ONNX enables you to run your models on-device across CPU, GPU, NPU. With ONNX yo
61
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | Int4 | CUDA | RTX 4090 | 313.32 | 6.3X |
62
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | fp16 | CUDA | RTX 4090 | 57.316 | 1.3X |
63
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4 | CUDA | RTX 4090 | 161.00 | 3.7X |
64
-| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4/bfloat16 | CPU | 13th Gen Intel i9 | 3.184 | 20X |
+| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | Int4 | CPU | 13th Gen Intel i9 | 3.184 | 20X |
65
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | Int4 | CPU | 13th Gen Intel i9 | 11.749 | 1.4X |
66
67
_CUDA BUILD SPECS: onnxruntime-genai-cuda==0.6.0, transformers==4.46.2, onnxruntime-gpu==1.20.1_ <br/>
0 commit comments