-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Issues: deepspeedai/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] RuntimeError: The size of tensor a (2048) must match the size of tensor b (1024) at non-singleton dimension 2
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#6910
opened Dec 24, 2024 by
Lowlowlowlowlowlow
[BUG] error :past_key, past_value = layer_past,how to solve this ?
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#6522
opened Sep 11, 2024 by
lovychen
[BUG] Running llama2-7b step3 with tensor parallel and HE fails due to incompatible shapes
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#5656
opened Jun 13, 2024 by
ShellyNR
[BUG] RuntimeError encountered when generating tokens from a DeepSpeedHybridEngine initialized with 4-bit quantization.
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#5630
opened Jun 8, 2024 by
Atry
[BUG]I found that the parameters of model will be fully transferred to the VRAM of each process. Is this abnormal in my understanding?
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#5575
opened May 28, 2024 by
tiandazhao
[BUG] DeepSpeed Hybrid Engine Does not Work for Mistral-7B
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4954
opened Jan 14, 2024 by
liziniu
[BUG] Step 3 with ZeRO=3 see error: RuntimeError: CUDA error: an illegal memory access was encountered
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4945
opened Jan 12, 2024 by
N33MO
[BUG]
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4841
opened Dec 19, 2023 by
ldeal3
[BUG] Hybrid Engine with DeepSpeed Stage 3 results and Llama V2 results in gibberish outputs
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4788
opened Dec 8, 2023 by
pacman100
[BUG]Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
assert param.ds_status == ZeroParamStatus.AVAILABLE, param.ds_summary()
when training deepspeed-chat step3 with ZeRO3 and a larger generation_batches
bug
#4533
opened Oct 18, 2023 by
GoSz
[BUG] container dose
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4469
opened Oct 7, 2023 by
hxdtest
How to train inside multiple nodes' Docker containers?
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4387
opened Sep 22, 2023 by
chenfengshijie
[BUG] RuntimeError(f"{param.ds_summary()} already in registry")
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4356
opened Sep 18, 2023 by
omeruth
[BUG]not a bug, ask for solution:ppo zero3+offload generate too slow
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4354
opened Sep 18, 2023 by
iamsile
[BUG]RuntimeError: The size of tensor a (3072) must match the size of tensor b (4096) at non-singleton dimension 0
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4302
opened Sep 11, 2023 by
4daJKong
[BUG] Actor model generates nothing in step3
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4301
opened Sep 11, 2023 by
xyxxxxx
[BUG] How to checkpoint optimiser states to resume fine-tuning at a later stage?
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4275
opened Sep 6, 2023 by
vinod-sarvam
[BUG] Execution hangs after 1 epoch for LLaMa-2-7B SFT.
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4247
opened Sep 1, 2023 by
vinod-sarvam
[BUG] Very strange error while running LLaMa-2 with DeepSpeed-Chat
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4229
opened Aug 29, 2023 by
vinod-sarvam
RuntimeError: Error building extension 'transformer_inference'
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4219
opened Aug 25, 2023 by
lylcst
[BUG] apply_tensor_parallelism() is not executed in Zero3 without self.mpu
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#4080
opened Aug 3, 2023 by
devamanyu
[BUG] In step3, a runtime error will be thrown when inference_tp_size>1
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#3998
opened Jul 20, 2023 by
haolin-nju
[BUG]I cannot run use DeepSpeed Chat train my model with enabled "hybrid_engine" in step3
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#3870
opened Jul 4, 2023 by
NostalgiaOfTime
[BUG] Is it right that per_device_train_batch_size = per_device_mini_train_batch_size * gradient_accumulation_steps
deepspeed-chat
Related to DeepSpeed-Chat
training
#3737
opened Jun 12, 2023 by
feiliya333
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.