Skip to content

Commit ce2f4be

Browse files
committed
update naming conventions and setup instructions
1 parent 0ec19e5 commit ce2f4be

2 files changed

Lines changed: 7 additions & 7 deletions

File tree

.env.example

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# Embedding model (HuggingFace model name)
2-
EMBEDDING_MODEL=aisingapore/SEA-LION-Embedding-300M
2+
EMBEDDING_MODEL=aisingapore/SEA-LION-ModernBERT-Embedding-300M
33

44
# LLM (OpenAI-compatible endpoint)
55
OPENAI_BASE_URL=http://host.docker.internal:11434/v1

README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -18,15 +18,15 @@ A Gradio web app showcasing multilingual semantic search and retrieval-augmented
1818

1919
## Embedding Model
2020

21-
The app uses a SEA-LION embedding model from HuggingFace. By default it uses `aisingapore/SEA-LION-Embedding-300M`, which is downloaded automatically on first run into the HuggingFace cache (`~/.cache/huggingface/`).
21+
The app uses a SEA-LION embedding model from HuggingFace. By default it uses `aisingapore/SEA-LION-ModernBERT-Embedding-300M`, which is downloaded automatically on first run into the HuggingFace cache (`~/.cache/huggingface/`).
2222

2323
Browse the full collection of SEA-LION embedding models here: <https://huggingface.co/aisingapore/collections>
2424

2525
To pre-download a model manually:
2626

2727
```bash
2828
pip install huggingface_hub
29-
huggingface-cli download aisingapore/SEA-LION-Embedding-300M
29+
hf download aisingapore/SEA-LION-ModernBERT-Embedding-300M
3030
```
3131

3232
> **Note:** If the model is saved to a non-default location (i.e. not `~/.cache/huggingface/`), set the `HF_CACHE_PATH` environment variable to point to that directory so the app can find it.
@@ -70,10 +70,10 @@ vLLM, Amazon Bedrock Access Gateway, Google Vertex AI, and others are supported.
7070

7171
```bash
7272
cp .env.example .env # edit LLM_MODEL, OPENAI_BASE_URL, OPENAI_API_KEY
73-
docker compose up --build
73+
docker compose up --build -d
7474
```
7575

76-
Open <http://localhost:7860>.
76+
Open <http://localhost:7860> once the Gradio app container is running.
7777

7878
ChromaDB and the Gradio app run as separate services. Document index data is persisted in a named Docker volume (`chroma_data`).
7979

@@ -113,11 +113,11 @@ Copy `.env.example` to `.env` and edit as needed. All variables are optional —
113113

114114
| Variable | Default | Description |
115115
|---|---|---|
116-
| `EMBEDDING_MODEL` | `aisingapore/SEA-LION-Embedding-300M` | HuggingFace model ID for embeddings |
116+
| `EMBEDDING_MODEL` | `aisingapore/SEA-LION-ModernBERT-Embedding-300M` | HuggingFace model ID for embeddings |
117117
| `HF_CACHE_PATH` | *(unset — uses `~/.cache/huggingface/`)* | Override HuggingFace cache directory; required when the model is pre-downloaded to a non-default path and when running in Docker with a mounted model cache |
118118
| `OPENAI_BASE_URL` | `http://host.docker.internal:11434/v1` | LLM API base URL, use `http://localhost:11434/v1` if running local dev setup |
119119
| `OPENAI_API_KEY` | `ollama` | LLM API key (`ollama` for local Ollama) |
120-
| `LLM_MODEL` | `llama3` | Model name passed to the LLM API |
120+
| `LLM_MODEL` | `aisingapore/Qwen-SEA-LION-v4-32B-IT` | Model name passed to the LLM API |
121121
| `LLM_TEMPERATURE` | `0.3` | Generation temperature for RAG answers |
122122
| `CHROMA_HOST` | `localhost` | ChromaDB host (`chromadb` when using Docker Compose) |
123123
| `CHROMA_PORT` | `8000` | ChromaDB port |

0 commit comments

Comments
 (0)