Which Ollama models are recommended for OpenRAG? #715

CharnaParkey · 2025-12-22T22:40:36Z

CharnaParkey
Dec 22, 2025
Maintainer

I'm using Ollama as my model provider. Which models work best with OpenRAG?

Dec 22, 2025

OpenRAG isn't guaranteed to be compatible with all Ollama models. Some models might produce unexpected results (like JSON output instead of natural language) or aren't appropriate for RAG tasks.

Recommended models:

Language models:

gpt-oss:20b (requires at least 16GB of RAM - consider using Ollama Cloud or a remote machine)
mistral-nemo:12b

Embedding models:

nomic-embed-text:latest
mxbai-embed-large:latest
embeddinggemma:latest

You can experiment with other models, but if you encounter issues that you can't resolve through RAG best practices (like context filters and prompt engineering), try switching to one of these recommended models.

If you need support for a specific model, please…

View full answer

CharnaParkey · 2025-12-22T22:41:06Z

CharnaParkey
Dec 22, 2025
Maintainer Author

OpenRAG isn't guaranteed to be compatible with all Ollama models. Some models might produce unexpected results (like JSON output instead of natural language) or aren't appropriate for RAG tasks.

Recommended models:

Language models:

gpt-oss:20b (requires at least 16GB of RAM - consider using Ollama Cloud or a remote machine)
mistral-nemo:12b

Embedding models:

nomic-embed-text:latest
mxbai-embed-large:latest
embeddinggemma:latest

You can experiment with other models, but if you encounter issues that you can't resolve through RAG best practices (like context filters and prompt engineering), try switching to one of these recommended models.

If you need support for a specific model, please submit a GitHub issue.

0 replies

wverhoeff · 2026-02-22T20:44:09Z

wverhoeff
Feb 22, 2026

mistral-nemo:12b doesn't work. It can't retrieval_call and bring back information from the knowledge. Any other suggestions would be apprecated my test environment is a 11GB so can't run the 20b gpt.

0 replies

rclarke87 · 2026-03-18T23:28:39Z

rclarke87
Mar 18, 2026

Has anyone get Qwen3.5:9b working? gpt-oss:20b is the only local model that works for me so far. I'm running on a 16GB AMD GPU.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which Ollama models are recommended for OpenRAG? #715

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Which Ollama models are recommended for OpenRAG? #715

Uh oh!

Uh oh!

CharnaParkey Dec 22, 2025 Maintainer

Replies: 3 comments

Uh oh!

CharnaParkey Dec 22, 2025 Maintainer Author

Uh oh!

wverhoeff Feb 22, 2026

Uh oh!

rclarke87 Mar 18, 2026

CharnaParkey
Dec 22, 2025
Maintainer

CharnaParkey
Dec 22, 2025
Maintainer Author

wverhoeff
Feb 22, 2026

rclarke87
Mar 18, 2026