-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Issues: deepspeedai/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[REQUEST]Does the current version support distributed fine-tuning on mac devices (M2-M4)?
enhancement
New feature or request
#7148
opened Mar 18, 2025 by
hsoftxl
[REQUEST] Support for Nvidia 50 Series GPUs: Pytorch >=2.6 and CUDA 12.8 required
enhancement
New feature or request
#7144
opened Mar 17, 2025 by
elkay
[REQUEST]Does DeepSpeed support multi-node inference?
enhancement
New feature or request
#7137
opened Mar 14, 2025 by
zyyyyy5
[REQUEST] Is there any plan to support deepseek v3's MOE structure
enhancement
New feature or request
#7129
opened Mar 11, 2025 by
glowwormX
[REQUEST] An option for SUM gradient allreduce instead of MEAN
enhancement
New feature or request
#7107
opened Mar 4, 2025 by
sfc-gh-lmerrick
[REQUEST] Proposal for Enhancing ChatGPT's Response Quality During Training
enhancement
New feature or request
#7097
opened Mar 1, 2025 by
sandyotic
[REQUEST] Publish your Windows Wheels build workflow
enhancement
New feature or request
#7057
opened Feb 20, 2025 by
acidbubbles
[REQUEST] activation checkpoint API should have parity with Pytorch, keywords arguments not supported
enhancement
New feature or request
#7038
opened Feb 15, 2025 by
AndreasMadsen
[REQUEST] Why is the column linear layer with all-gather not implemented in DeepSpeed Inference?
enhancement
New feature or request
#7037
opened Feb 14, 2025 by
zhangvia
[REQUEST]Can the Mamba model be supported?
enhancement
New feature or request
#7022
opened Feb 11, 2025 by
fxnie
[REQUEST] Support Offload deepspeed engine in RLHF training
enhancement
New feature or request
#7013
opened Feb 7, 2025 by
hijkzzz
[REQUEST] Possiblity of integrating LongVU with DeepSpeed
enhancement
New feature or request
#7006
opened Feb 5, 2025 by
xiaoqian-shen
[REQUEST] adding type hints and New feature or request
py.typed
metadata
enhancement
#6988
opened Jan 31, 2025 by
jamesbraza
[REQUEST] FPDT backward test
enhancement
New feature or request
#6955
opened Jan 16, 2025 by
YizhouZ
[REQUEST] Pipeline Parallelism support multi optimizer to train
enhancement
New feature or request
#6951
opened Jan 15, 2025 by
whcjb
Multi node multi gpu distributed load
enhancement
New feature or request
#6927
opened Jan 6, 2025 by
rastinrastinii
[REQUEST] Deepspeed Inference Supports VL (vision language) model
enhancement
New feature or request
#6917
opened Dec 26, 2024 by
ethen8181
[REQUEST] Support for XLA/TPU
enhancement
New feature or request
#6901
opened Dec 21, 2024 by
radna0
Opinion on Refactoring Ulysses
enhancement
New feature or request
#6843
opened Dec 9, 2024 by
Eugene29
[REQUEST] domino integration to nanotron
enhancement
New feature or request
#6835
opened Dec 7, 2024 by
NouamaneTazi
zero-3 cpuadam is so slow
enhancement
New feature or request
#6834
opened Dec 7, 2024 by
SeunghyunSEO
[REQUEST] Let ZeRO-offload use CPU and GPU parallelly
enhancement
New feature or request
#6778
opened Nov 23, 2024 by
fzyzcjy
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-02-19.