Skip to content

Issues: deepspeedai/DeepSpeedExamples

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

step3_rlhf_finetuning and two tokenizers deespeed chat DeepSpeed Chat modeling Related to modeling questions. new-config A modified config from the given example question Further information is requested
#577 opened Jun 6, 2023 by GenVr
DeepSpeed-Chat cannot load models from local file? deespeed chat DeepSpeed Chat new-config A modified config from the given example
#511 opened May 10, 2023 by MianWang123
Eextension of the issus #479, chatbot.py cannot load the bloom model deespeed chat DeepSpeed Chat new-config A modified config from the given example
#505 opened May 10, 2023 by korlin0110
deepspeed hybrid-engine support bloom model with zero3? deespeed chat DeepSpeed Chat new-config A modified config from the given example
#497 opened May 8, 2023 by null-test-7
how to use 'load_in_8bit=True' when train deespeed chat DeepSpeed Chat new-config A modified config from the given example
#494 opened May 8, 2023 by Haoran1234567
A100 40 GB: OOM on step-3 for opt-6.7B deespeed chat DeepSpeed Chat new-config A modified config from the given example system An issue with a environment/system setup.
#482 opened May 5, 2023 by akashsaravanan-georgian
unable to load 4 7b size model in step3 deespeed chat DeepSpeed Chat new-config A modified config from the given example system An issue with a environment/system setup.
#480 opened May 5, 2023 by Mr-lonely0
Can not use bloom-560m model in the step2_reward_model_finetuning deespeed chat DeepSpeed Chat new-config A modified config from the given example system An issue with a environment/system setup.
#479 opened May 5, 2023 by korlin0110
Adding two loss from actor will lead to an error " gradient computed twice for this partition" deespeed chat DeepSpeed Chat modeling Related to modeling questions. new-config A modified config from the given example
#458 opened Apr 28, 2023 by piekey1994
training 12b model seems to require more memory than expected deespeed chat DeepSpeed Chat new-config A modified config from the given example
#447 opened Apr 27, 2023 by ChaoChungWu-Johnson
gpt ppo training error deespeed chat DeepSpeed Chat new-config A modified config from the given example
#435 opened Apr 26, 2023 by lljjgg
[ERROR]In Step3,load reward Model failed which trainged with zero-stage 3 deespeed chat DeepSpeed Chat new-config A modified config from the given example
#429 opened Apr 25, 2023 by Clitost
Step 3 1.3b Running process stuck deespeed chat DeepSpeed Chat new-config A modified config from the given example system An issue with a environment/system setup.
#428 opened Apr 25, 2023 by awelldone
Error after changing the model from opt to gpt deespeed chat DeepSpeed Chat hybrid engine relating to the hybrid engine new-config A modified config from the given example
#403 opened Apr 23, 2023 by lljjgg
SFT training ,single gpu (V100 32G), how to adjust my parameters to avoid OOM, thx deespeed chat DeepSpeed Chat new-config A modified config from the given example
#389 opened Apr 21, 2023 by Modas-Li
map[i] = val_or_map.get(i, Std.NONE) AttributeError: 'NoneType' object has no attribute 'get' bug Something isn't working deespeed chat DeepSpeed Chat new-config A modified config from the given example
#359 opened Apr 19, 2023 by SeekPoint
use bloom-350m to train reward model in step2 deespeed chat DeepSpeed Chat new-config A modified config from the given example
#356 opened Apr 19, 2023 by 70557dzqc
Could this tool apply for encoder-decoder model, like Flan-T5? deespeed chat DeepSpeed Chat new-config A modified config from the given example
#340 opened Apr 18, 2023 by henryxiao1997
Error when using BLOOMZ for reward model training deespeed chat DeepSpeed Chat new-config A modified config from the given example
#338 opened Apr 18, 2023 by Luoyang144
If I use a self-improved transformer architecture, can it support? deespeed chat DeepSpeed Chat new-config A modified config from the given example question Further information is requested
#304 opened Apr 14, 2023 by liujuncn
ProTip! Updated in the last three days: updated:>2025-03-21.