Skip to content

Issues: deepspeedai/DeepSpeed

[Roadmap] DeepSpeed Roadmap Q1 2025
#6946 opened Jan 13, 2025 by loadams
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[BUG] error :past_key, past_value = layer_past,how to solve this ? bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#6522 opened Sep 11, 2024 by lovychen
[BUG] Running llama2-7b step3 with tensor parallel and HE fails due to incompatible shapes bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#5656 opened Jun 13, 2024 by ShellyNR
[BUG] DeepSpeed Hybrid Engine Does not Work for Mistral-7B bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4954 opened Jan 14, 2024 by liziniu
[BUG] bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4841 opened Dec 19, 2023 by ldeal3
[OOM] Stage 3 deepspeed-chat Related to DeepSpeed-Chat
#4629 opened Nov 6, 2023 by lljjgg
[BUG] container dose bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4469 opened Oct 7, 2023 by hxdtest
How to train inside multiple nodes' Docker containers? bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4387 opened Sep 22, 2023 by chenfengshijie
[BUG] RuntimeError(f"{param.ds_summary()} already in registry") bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4356 opened Sep 18, 2023 by omeruth
[BUG]not a bug, ask for solution:ppo zero3+offload generate too slow bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4354 opened Sep 18, 2023 by iamsile
[BUG] Actor model generates nothing in step3 bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4301 opened Sep 11, 2023 by xyxxxxx
[BUG] How to checkpoint optimiser states to resume fine-tuning at a later stage? bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4275 opened Sep 6, 2023 by vinod-sarvam
[BUG] Execution hangs after 1 epoch for LLaMa-2-7B SFT. bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4247 opened Sep 1, 2023 by vinod-sarvam
[BUG] Very strange error while running LLaMa-2 with DeepSpeed-Chat bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4229 opened Aug 29, 2023 by vinod-sarvam
RuntimeError: Error building extension 'transformer_inference' bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4219 opened Aug 25, 2023 by lylcst
[BUG] apply_tensor_parallelism() is not executed in Zero3 without self.mpu bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#4080 opened Aug 3, 2023 by devamanyu
[BUG] In step3, a runtime error will be thrown when inference_tp_size>1 bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat
#3998 opened Jul 20, 2023 by haolin-nju
ProTip! Exclude everything labeled bug with -label:bug.