-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: deepspeedai/DeepSpeedExamples
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Load model error in step3
bug
Something isn't working
deespeed chat
DeepSpeed Chat
#560
opened May 31, 2023 by
YingtongBu2
[bug]AttributeError: 'DeepSpeedHybridEngine' object has no attribute 'mp_group'
bug
Something isn't working
deespeed chat
DeepSpeed Chat
hybrid engine
relating to the hybrid engine
#525
opened May 15, 2023 by
qingchu123
Missing key(s) in state_dict for bias in attention blocks
bug
Something isn't working
deespeed chat
DeepSpeed Chat
#374
opened Apr 20, 2023 by
EikeKohl
map[i] = val_or_map.get(i, Std.NONE) AttributeError: 'NoneType' object has no attribute 'get'
bug
Something isn't working
deespeed chat
DeepSpeed Chat
new-config
A modified config from the given example
#359
opened Apr 19, 2023 by
SeekPoint
多机分布式训练,加载模型,报a leaf Variable that requires grad is being used in an in-place operation错误
bug
Something isn't working
deespeed chat
DeepSpeed Chat
#339
opened Apr 18, 2023 by
sc-lj
DeepSpeed-Chat: prefetch of layers during reward model forward pass leads to error during sample generation
bug
Something isn't working
deespeed chat
DeepSpeed Chat
#337
opened Apr 18, 2023 by
adammoody
Step1 training failed
bug
Something isn't working
deespeed chat
DeepSpeed Chat
system
An issue with a environment/system setup.
#328
opened Apr 17, 2023 by
omoiji
Running multinode training and received unclear error for stage 2 training
bug
Something isn't working
deespeed chat
DeepSpeed Chat
system
An issue with a environment/system setup.
#327
opened Apr 17, 2023 by
alibabadoufu
[BUG]Step1 RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling Something isn't working
deespeed chat
DeepSpeed Chat
system
An issue with a environment/system setup.
cublasCreate(handle)
bug
#323
opened Apr 17, 2023 by
qinqinqaq
Step2: memory allocation of 2097152 bytes failed
bug
Something isn't working
deespeed chat
DeepSpeed Chat
#321
opened Apr 17, 2023 by
YukinoshitaKaren
run deepspeed_chat example code error
bug
Something isn't working
deespeed chat
DeepSpeed Chat
hybrid engine
relating to the hybrid engine
#313
opened Apr 15, 2023 by
bestpredicts
when I am running RLHF script, I encountered a error
bug
Something isn't working
deespeed chat
DeepSpeed Chat
hybrid engine
relating to the hybrid engine
#311
opened Apr 15, 2023 by
liuzhiyong01
Single node multi card training failed
bug
Something isn't working
deespeed chat
DeepSpeed Chat
system
An issue with a environment/system setup.
#310
opened Apr 15, 2023 by
menkeyi
ProTip!
Add no:assignee to see everything that’s not assigned.