Skip to content

Conversation

@jeremie-lesage
Copy link

Summary

Switch to BGE-M3 embedding model with lazy loading.

Why

I needed better French support for my documents. The default model is optimized for English and wasn't giving good results in French. BGE-M3 handles multilingual content much better.

I also added lazy loading because the model takes ~2 min to load, which was slowing down server startup.

What do you think about making the embedding model configurable in the future?

Breaking change

Vector dimension changed from 768 to 1024, so existing indexes need to be rebuilt.

- Replace DefaultEmbeddingFunction with BGEM3EmbeddingFunction
- Add lazy loading for embedding model to improve startup time
- Update vector dimension from 768 to 1024 for BGE-M3 compatibility
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant