Skip to content

Commit 1c1f5b4

Browse files
committed
Add instructions to run it as a docker container
- Note that it requires Milvus to be installed elsewhere (also in docker or iron or cloud).
1 parent e8c90f6 commit 1c1f5b4

File tree

2 files changed

+38
-13
lines changed

2 files changed

+38
-13
lines changed

README.md

+37-12
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,15 @@ To get started with Wiki-RAG, ensure you have the following:
1717
- [Docker](https://www.docker.com/get-started) (if you intend to run the project using Docker)
1818
- [Milvus 2.5.5](https://milvus.io/docs/release_notes.md#v255) or later (for vector similarity search). Standalone or Distributed deployments are supported. Lite deployments are not supported. It's highly recommended to use the [Docker Compose deployment](https://milvus.io/docs/install_standalone-docker-compose.md) specially for testing and development purposes.
1919

20+
## Configuration
21+
22+
1. **Set Environment Variables (to be replaced soon by `config.yml`file)**
23+
- Using the [env_template](env_template) file as source, create a new `.env` file:
24+
```bash
25+
cp env_template .env
26+
```
27+
- Edit it to suit your needs (mediawiki site and namespaces or exclusions, models to use for both embeddings and generation, etc.)
28+
2029
## Installation
2130

2231
### Running Locally
@@ -42,29 +51,45 @@ To get started with Wiki-RAG, ensure you have the following:
4251
pip install -e .[dev]
4352
```
4453

45-
4. **Set Environment Variables**:
46-
- Using the [env_template](env_template) file as source, create a new `.env` file:
47-
```bash
48-
cp env_template .env
49-
```
50-
- Edit it to suit your needs (mediawiki site and namespaces or exclusions, models to use for both embeddings and generation, etc.)
51-
52-
5. **Run the Application**:
54+
4. **Run the Application**:
5355
The application comes with four different executables:
5456
- `wr-load`: Will parse all the configured pages in the source Mediawiki site, extracting contents and other important metadata. All the generated information will be stored into a JSON file in the `data` directory.
5557
- `wr-index`: In charge of creating the collection in the vector index (Milvus) with all the information extracted in the previous step.
5658
- `wr-search`: A tiny CLI utility to perform searches against the RAG system from the command line.
5759
- `wr-server`: A comprehensive and secure web service (documented with OpenAPI) that allows users to interact with the RAG system using the OpenAI API (`v1/models` and `v1/chat/completions` endpoints) as if it were a large language model (LLM).
5860

59-
### Running with Docker
61+
### Running with Docker (Milvus elsewhere)
6062

61-
_Coming soon..._
63+
1. Pull the image from GitHub [Container Registry](https://github.com/moodlehq/wiki-rag/pkgs/container/wiki-rag):
64+
```bash
65+
docker pull ghcr.io/moodlehq/wiki-rag:latest # or specify a tag
66+
```
67+
68+
2. Create local "data" directory and run the container:
69+
```bash
70+
mkdir data
71+
docker run --rm --detach \
72+
--volume $(pwd)/data:/app/data \
73+
--volume $(pwd)/.env:/app/.env \
74+
--env MILVUS_URL=http://milvus-standalone:19530 \
75+
--network milvus \
76+
--publish 8080:8080 \
77+
--env OPENAI_API_KEY=YOUR_OPENAI_API_KEY \
78+
--env LOG_LEVEL=info \
79+
--name wiki-rag \
80+
wiki-rag:latest
81+
```
82+
- Note 1: The command above will start the `wr-server` automatically, listening on the configured port (8080) and the `OPENAI_API_KEY` is required to interact with the embedding and LLM models. If, instead, you want to execute any of the other commands (`wr-load`, `wr-index`, `wr-search`), you can specify it as the last argument.
83+
- Note 2: The 2 lines related to Milvus are required to connect to the Milvus server **if also running in Docker**. If it's running elsewhere, you can replace the `MILVUS_URL` with the appropriate URL or configure it in the `.env` file instead
84+
- Note 3: You can use `docker logs wiki-rag` to check the logs of the running container (`wr-server` logs).
85+
- Note 4: When running the `wr-server`, you still can execute any of the commands (`wr-load`, `wr-index`, `wr-search`) using `docker exec -it wiki-rag <command>`.
86+
- Note 5: To stop and remove the container, you can use `docker stop wiki-rag`.
6287
63-
### Features
88+
## Features
6489
6590
_Coming soon..._
6691
67-
### Future Work
92+
## Future Work
6893
6994
_Coming soon..._
7095

env_template

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ EMBEDDING_DIMENSIONS=768
2424
LLM_MODEL="your_llm_model"
2525

2626
# OpenAI wrapper base URL.
27-
WRAPPER_API_BASE="0.0.0.0:8000"
27+
WRAPPER_API_BASE="0.0.0.0:8080"
2828
# To limit the number of OpenAI history turns allowed in a chat (0 = no limit).
2929
WRAPPER_CHAT_MAX_TURNS=10
3030
# To limit the number of OpenAI history tokens allowed in a chat (0 = no limit).

0 commit comments

Comments
 (0)