Finetune LLaVaNeXT -> ValueError: Image features and image tokens do not match

Hi everyone,

I tried running the notebook provided here for finetuning LLaVaNeXT:
https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LLaVa-NeXT/Fine_tune_LLaVaNeXT_on_a_custom_dataset_(with_PyTorch_Lightning).ipynb

However, during training, I encountered the following error:
`ValueError: Image features and image tokens do not match: tokens: 251, features 2160`

Im using transformers==4.51.3 and did not modify the notebook. I attempted to debug this by reviewing the code around the collate function, but couldn’t find the issue. Has anyone else run into this error or might have ideas on what’s going wrong?

Thanks


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetune LLaVaNeXT -> ValueError: Image features and image tokens do not match #492

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Finetune LLaVaNeXT -> ValueError: Image features and image tokens do not match #492

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions