Skip to content

how to use 'load_in_8bit=True' when train #494

Open
@Haoran1234567

Description

@Haoran1234567

I want to train model llama-13b ,SFT stage,7b is ok when i use 8*24g(3090). but 13B is OOM. i have try all the ways in deepspeedchat to reduce memory,all OOM!
i want to try use 'load_in_8bit=True' when load model,but ERROR!
how to modify the code???

Metadata

Metadata

Assignees

Labels

deespeed chatDeepSpeed Chatnew-configA modified config from the given example

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions