You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,15 +18,15 @@ A Gradio web app showcasing multilingual semantic search and retrieval-augmented
18
18
19
19
## Embedding Model
20
20
21
-
The app uses a SEA-LION embedding model from HuggingFace. By default it uses `aisingapore/SEA-LION-Embedding-300M`, which is downloaded automatically on first run into the HuggingFace cache (`~/.cache/huggingface/`).
21
+
The app uses a SEA-LION embedding model from HuggingFace. By default it uses `aisingapore/SEA-LION-ModernBERT-Embedding-300M`, which is downloaded automatically on first run into the HuggingFace cache (`~/.cache/huggingface/`).
22
22
23
23
Browse the full collection of SEA-LION embedding models here: <https://huggingface.co/aisingapore/collections>
> **Note:** If the model is saved to a non-default location (i.e. not `~/.cache/huggingface/`), set the `HF_CACHE_PATH` environment variable to point to that directory so the app can find it.
@@ -70,10 +70,10 @@ vLLM, Amazon Bedrock Access Gateway, Google Vertex AI, and others are supported.
Open <http://localhost:7860> once the Gradio app container is running.
77
77
78
78
ChromaDB and the Gradio app run as separate services. Document index data is persisted in a named Docker volume (`chroma_data`).
79
79
@@ -113,11 +113,11 @@ Copy `.env.example` to `.env` and edit as needed. All variables are optional —
113
113
114
114
| Variable | Default | Description |
115
115
|---|---|---|
116
-
|`EMBEDDING_MODEL`|`aisingapore/SEA-LION-Embedding-300M`| HuggingFace model ID for embeddings |
116
+
|`EMBEDDING_MODEL`|`aisingapore/SEA-LION-ModernBERT-Embedding-300M`| HuggingFace model ID for embeddings |
117
117
|`HF_CACHE_PATH`|*(unset — uses `~/.cache/huggingface/`)*| Override HuggingFace cache directory; required when the model is pre-downloaded to a non-default path and when running in Docker with a mounted model cache |
118
118
|`OPENAI_BASE_URL`|`http://host.docker.internal:11434/v1`| LLM API base URL, use `http://localhost:11434/v1` if running local dev setup |
119
119
|`OPENAI_API_KEY`|`ollama`| LLM API key (`ollama` for local Ollama) |
120
-
|`LLM_MODEL`|`llama3`| Model name passed to the LLM API |
120
+
|`LLM_MODEL`|`aisingapore/Qwen-SEA-LION-v4-32B-IT`| Model name passed to the LLM API |
121
121
|`LLM_TEMPERATURE`|`0.3`| Generation temperature for RAG answers |
122
122
|`CHROMA_HOST`|`localhost`| ChromaDB host (`chromadb` when using Docker Compose) |
0 commit comments