You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+37-12
Original file line number
Diff line number
Diff line change
@@ -17,6 +17,15 @@ To get started with Wiki-RAG, ensure you have the following:
17
17
-[Docker](https://www.docker.com/get-started) (if you intend to run the project using Docker)
18
18
-[Milvus 2.5.5](https://milvus.io/docs/release_notes.md#v255) or later (for vector similarity search). Standalone or Distributed deployments are supported. Lite deployments are not supported. It's highly recommended to use the [Docker Compose deployment](https://milvus.io/docs/install_standalone-docker-compose.md) specially for testing and development purposes.
19
19
20
+
## Configuration
21
+
22
+
1.**Set Environment Variables (to be replaced soon by `config.yml`file)**
23
+
- Using the [env_template](env_template) file as source, create a new `.env` file:
24
+
```bash
25
+
cp env_template .env
26
+
```
27
+
- Edit it to suit your needs (mediawiki site and namespaces or exclusions, models to use for both embeddings and generation, etc.)
28
+
20
29
## Installation
21
30
22
31
### Running Locally
@@ -42,29 +51,45 @@ To get started with Wiki-RAG, ensure you have the following:
42
51
pip install -e .[dev]
43
52
```
44
53
45
-
4. **Set Environment Variables**:
46
-
- Using the [env_template](env_template) file as source, create a new `.env` file:
47
-
```bash
48
-
cp env_template .env
49
-
```
50
-
- Edit it to suit your needs (mediawiki site and namespaces or exclusions, models to use for both embeddings and generation, etc.)
51
-
52
-
5. **Run the Application**:
54
+
4. **Run the Application**:
53
55
The application comes with four different executables:
54
56
- `wr-load`: Will parse all the configured pages in the source Mediawiki site, extracting contents and other important metadata. All the generated information will be stored into a JSON file in the `data` directory.
55
57
- `wr-index`: In charge of creating the collection in the vector index (Milvus) with all the information extracted in the previous step.
56
58
- `wr-search`: A tiny CLI utility to perform searches against the RAG system from the command line.
57
59
- `wr-server`: A comprehensive and secure web service (documented with OpenAPI) that allows users to interact with the RAG system using the OpenAI API (`v1/models` and `v1/chat/completions` endpoints) as if it were a large language model (LLM).
58
60
59
-
### Running with Docker
61
+
### Running with Docker (Milvus elsewhere)
60
62
61
-
_Coming soon..._
63
+
1. Pull the image from GitHub [Container Registry](https://github.com/moodlehq/wiki-rag/pkgs/container/wiki-rag):
64
+
```bash
65
+
docker pull ghcr.io/moodlehq/wiki-rag:latest # or specify a tag
66
+
```
67
+
68
+
2. Create local"data" directory and run the container:
69
+
```bash
70
+
mkdir data
71
+
docker run --rm --detach \
72
+
--volume $(pwd)/data:/app/data \
73
+
--volume $(pwd)/.env:/app/.env \
74
+
--env MILVUS_URL=http://milvus-standalone:19530 \
75
+
--network milvus \
76
+
--publish 8080:8080 \
77
+
--env OPENAI_API_KEY=YOUR_OPENAI_API_KEY \
78
+
--env LOG_LEVEL=info \
79
+
--name wiki-rag \
80
+
wiki-rag:latest
81
+
```
82
+
- Note 1: The command above will start the `wr-server` automatically, listening on the configured port (8080) and the `OPENAI_API_KEY` is required to interact with the embedding and LLM models. If, instead, you want to execute any of the other commands (`wr-load`, `wr-index`, `wr-search`), you can specify it as the last argument.
83
+
- Note 2: The 2 lines related to Milvus are required to connect to the Milvus server **if also running in Docker**. If it's running elsewhere, you can replace the `MILVUS_URL` with the appropriate URL or configure it in the `.env` file instead
84
+
- Note 3: You can use `docker logs wiki-rag` to check the logs of the running container (`wr-server` logs).
85
+
- Note 4: When running the `wr-server`, you still can execute any of the commands (`wr-load`, `wr-index`, `wr-search`) using `docker exec -it wiki-rag <command>`.
86
+
- Note 5: To stop and remove the container, you can use `docker stop wiki-rag`.
0 commit comments