-
Notifications
You must be signed in to change notification settings - Fork 15.8k
Pull requests: deepseek-ai/DeepSeek-V3
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
doc: Update deployment instructions using TensorRT-LLM in README.
#876
opened May 21, 2025 by
bobboli
Loading…
gh repo clone deepseek-ai/DeepSeek-V3Create devcontainer.jsonj
#867
opened May 10, 2025 by
xxxyalaxx90xxx
Loading…
Fix: safer and cleaner forward() in distributed embedding layer
#834
opened Apr 5, 2025 by
saro1993
Loading…
Fix: Add metadata to bf16 safetensors for compatibility with transformers
#749
opened Mar 6, 2025 by
tflsxyy
Loading…
Critical Improvements for Model Correctness, Efficiency, and Robustness
#717
opened Feb 25, 2025 by
abdurrahman482937
Loading…
Optimize Multi-head Latent Attention (MLA) with Fast Path for Short Sequences
#684
opened Feb 19, 2025 by
XxAlonexX
Loading…
7 tasks done
Fix incorrect comment in linear function regarding weight.element_size()
#662
opened Feb 14, 2025 by
iamvalenciia
Loading…
Refactor checkpoint conversion script for improved readability and efficiency
#633
opened Feb 10, 2025 by
tdas3001
Loading…
Improve convert.py with error handling and code optimization
#618
opened Feb 8, 2025 by
wowrakibul
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.