You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added support for batch embedding of multiple text strings if using the all-Mini-L6-v2 model. As it is so small it barely uses the GPU so can batch process embedding to maximize embedding performance. Runs much faster now.
Added support for 2 Embedding models:
Google's EmbeddingGemma (highest quality and works with larger texts but 300Mb - 1024 token input, 768 dimensional output)
all-Mini-L6-v2 (fastest and only 30mb - 128 token input, 384 dimensional output)