Skip to content

Conversation

@BBC-Esq
Copy link

@BBC-Esq BBC-Esq commented Jan 8, 2025

I've added prompt formats for models that I've personally tested as being especially good for RAG:

Zephyr:
Lots of models use the name "zephyr." This new sub-class pertains to the 1.6b and 3b models from stabilityai.. Oldies but goodies (DESPITE the fact that they are "relatively" older models.).

Qwen - pertains to Qwen 2.5 that everybody knows. Not sure if it'll work with the "coder" variants, however.

Exaone - overall exceptional for RAG IMHO and any respectable RAG program should have pre-made templates for these models.

Granite - another that's very good for RAG, relatively new but IBM created especially for enterprise level customer service chat based on a company's knowledge base. Like these new ones a lot.

@BBC-Esq BBC-Esq closed this by deleting the head repository Jan 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant