-
I have ktransformers (25/02/2025 update) installed on a machine with Linux Mint. DeepSeek-R1 works without issues with all the quants I have tried. Yet, DeepSeek-V3 doesn't work. When I try loading it with
I get the following error:
The file used, DeepSeek-V3.i1-IQ3_XL.gguf, works without issues on llama.cpp. According to https://docs.rs/llama_cpp_sys/latest/llama_cpp_sys/enum.ggml_type.html, ggml_type 21 is for IQ3_S, not for IQ3_XS. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I have tried a different quant (Q4_K_S) and it works flawlessly. I guess the issue is the mismatch between ggml_type 21 and IQ3_XS. |
Beta Was this translation helpful? Give feedback.
I have tried a different quant (Q4_K_S) and it works flawlessly. I guess the issue is the mismatch between ggml_type 21 and IQ3_XS.