Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training

I tried to run example.py on an A100 (80GB) GPU. It seems there is a bug at line [41] https://github.com/datamllab/LongLM/blob/ee92c841eaf8c6e0989f49c2d63231ba06136345/example.py#L41

The current implementation doesn't load the input_ids tensors onto the device, which causes an error. I replaced the above code, and it's now working. Fixed the issue by adding: `input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to("cuda")`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training #37

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training #37

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions