-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: deepspeedai/DeepSpeedExamples
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Do we have any plans on supporting pipeline parallel?
question
Further information is requested
#596
opened Jun 15, 2023 by
LSX-Sneakerprogrammer
enable critic_gradient_checkpointing, get error
deespeed chat
DeepSpeed Chat
question
Further information is requested
#578
opened Jun 7, 2023 by
BaiStone2017
step3_rlhf_finetuning and two tokenizers
deespeed chat
DeepSpeed Chat
modeling
Related to modeling questions.
new-config
A modified config from the given example
question
Further information is requested
#577
opened Jun 6, 2023 by
GenVr
what data should I use in step 3
deespeed chat
DeepSpeed Chat
modeling
Related to modeling questions.
question
Further information is requested
#555
opened May 29, 2023 by
scarydemon2
Model and data downloading
deespeed chat
DeepSpeed Chat
question
Further information is requested
#550
opened May 26, 2023 by
treya-lin
How to save the model after each epoch
deespeed chat
DeepSpeed Chat
question
Further information is requested
#510
opened May 10, 2023 by
nieallen
Is there any Deepspeed Inference PTQ Example?
deespeed chat
DeepSpeed Chat
question
Further information is requested
#508
opened May 10, 2023 by
tingshua-yts
is Deepspeed-Chat support tensor parallelism for Codegen
deespeed chat
DeepSpeed Chat
question
Further information is requested
#431
opened Apr 25, 2023 by
Emerald01
How can I train step3 in DeepSpeed-Chat by pipeline parallelism?
deespeed chat
DeepSpeed Chat
question
Further information is requested
#427
opened Apr 25, 2023 by
GongCQ
Exchange group; 交流群
deespeed chat
DeepSpeed Chat
question
Further information is requested
#367
opened Apr 20, 2023 by
yrqUni
single gpu 6.7b lora CUDA OOM with A6000 48G
deespeed chat
DeepSpeed Chat
question
Further information is requested
#330
opened Apr 17, 2023 by
HyeongminMoon
In instructGPT, during the RM training process, different <prompt, response> pairs of a prompt are put together to calculate the loss. Is this also implemented in DeepSpeed-chat?
deespeed chat
DeepSpeed Chat
question
Further information is requested
#320
opened Apr 17, 2023 by
BaiStone2017
Does the model in this framework have to be trained to have conversations?
deespeed chat
DeepSpeed Chat
modeling
Related to modeling questions.
question
Further information is requested
#306
opened Apr 15, 2023 by
code-isnot-cold
If I use a self-improved transformer architecture, can it support?
deespeed chat
DeepSpeed Chat
new-config
A modified config from the given example
question
Further information is requested
#304
opened Apr 14, 2023 by
liujuncn
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.