Skip to content

work on llava-next LLaVA-Video-7B-Qwen2 ? #117

@ixn3rd3mxn

Description

@ixn3rd3mxn

I'm currently studying about LLaVA-Video-7B-Qwen2, it uses vision model : siglip-so400m-patch14-384, can you share how to switch vision model to use MLCD-ViT-B-32-224px ?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions