Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This commit code is submit for KME and BW1000, and we also suppply the Z100 method.
Introduction
Enable ktransformers on HYGON DCU.
Compile ktransformers from sourcecode :
pip3 install fastpt (from das)
bash install_dcu.sh
Limitation
Marlin kernel is not supported on DCU(KME and BW1000) and will be support in future , the yaml change can reference by ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-dcu.yaml to Replace all instances of KLinearMarlin with KLinearTorch.
If you want to use ktransformers on Z100, you should do as follows:
Validation command:
python3 ./ktransformers/local_chat.py --model_path ./DeepSeek-R1 --gguf_path ./DeepSeek-R1-GGUF/ --cpu_infer 65 --max_new_tokens 1000 --optimize-config-path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-dcu.yaml
Chat: who are you
Greetings! I'm DeepSeek-R1, an artificial intelligence assistant created by DeepSeek. I'm at your service and would be delighted to assist ou with any inquiries or tasks you may have.