Skip to content

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

@shifu-learner

Description

@shifu-learner

Hello,
I am trying to finetune GPT-j-6b.
I followed the instructions provided in the documentation. But, I get this error.

I tried by changing batch size =1, gradient_accumulation_steps=4.

Any idea how can i solve this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions