How to use multimodal models

To use multimodal models, when adding a chat, select the appropriate text model, e.g. MobileVLM-3B-q3_K_S.gguf, activate the CLIP option and select the appropriate CLIP (mmproj) model, e.g. MobileVLM-3B-mmproj-f16.gguf. If everything is done, a button will appear in the chat to add an image to the message. If the model does not respond to the image, check if the text and clip models are selected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to use multimodal models

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally