You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/core_features/colvision.md
+76-64Lines changed: 76 additions & 64 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,10 +1,49 @@
1
-
# 🖼️ ColPali Integration
1
+
# 🖼️ ColVision Integration
2
2
3
-
## Overview
3
+
PDF retrieval pipeline using ColVision embeddings, stored in Milvus.
4
4
5
-
This module provides a complete pipeline for processing PDF documents with ColPali embeddings, storing them in a Milvus vector database, and performing semantic search.
5
+
## Installation
6
6
7
-
It is designed for efficient document retrieval and RAG applications.
7
+
The `[colvision]` extra is mutually exclusive with `[process]` — use a dedicated venv.
8
+
9
+
```bash
10
+
uv sync --extra colvision
11
+
```
12
+
13
+
## Supported Models
14
+
15
+
| Model |`model_name`|
16
+
|---|---|
17
+
| ColPali v1.3 |`vidore/colpali-v1.3`|
18
+
| ColQwen2 v1.0 |`vidore/colqwen2-v1.0`|
19
+
| ColQwen2.5 v0.2 |`vidore/colqwen2.5-v0.2`|
20
+
| ColGemma3 |`Cognitive-Lab/ColNetraEmbed`|
21
+
| ColSmol 256M |`vidore/colSmol-256M`|
22
+
| ColSmol 500M |`vidore/colSmol-500M`|
23
+
24
+
All models are installed with the single `[colvision]` extra.
25
+
26
+
The model/processor class is auto-detected from `model_name`, and the embedding dimension is inferred at every stage (from the loaded model at `process` / `retrieve` time, from the parquet contents at `index` time).
27
+
28
+
## Choosing a Model
29
+
30
+
Set `model_name` in the YAML config, or override it via the `-m` / `--model` CLI flag on the `process` and `retrieve` commands.
31
+
32
+
The pipeline runs in three steps — `process`, then `index`, then `retrieve` — and the
33
+
`-m` / `--model` flag must be passed to both `process` and `retrieve`:
34
+
35
+
```bash
36
+
# 1. Process PDFs into embeddings
37
+
python3 -m mmore colvision process --config-file examples/colvision/config_process.yml -m vidore/colqwen2.5-v0.2
38
+
39
+
# 2. Index the embeddings into Milvus (no model needed here)
40
+
python3 -m mmore colvision index --config-file examples/colvision/config_index.yml
41
+
42
+
# 3. Retrieve with the same model used at processing time
Copy file name to clipboardExpand all lines: docs/source/getting_started/architecture.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ MMORE is designed as a multimodal ingestion and retrieval framework for heteroge
16
16
MMORE is organized around three main executable stages:
17
17
18
18
-`run_process`, which handles ingestion, crawling, dispatching, and document processing
19
-
-`run_indexer`, which builds the searchable index and can integrate multimodal retrieval components such as ColPali
19
+
-`run_indexer`, which builds the searchable index and can integrate multimodal retrieval components such as ColVision models
20
20
-`run_rag`, which serves retrieval and RAG workflows through interfaces such as the API and CLI
21
21
22
22
These stages interact with intermediate outputs, the vector database, and optional external components such as hosted LLM endpoints, WebRAG, or Live RAG.
@@ -104,9 +104,9 @@ That means the framework may work with:
104
104
- plain text documents
105
105
- structured metadata
106
106
- images or layout-aware representations
107
-
- multimodal retrieval models such as ColPali-related components
107
+
- multimodal retrieval models such as ColVision-related components
108
108
109
-
See [ColPali](../core_features/colpali.md) for the multimodal retrieval side.
109
+
See [ColVision](../core_features/colvision.md) for the multimodal retrieval side.
Copy file name to clipboardExpand all lines: docs/source/getting_started/installation.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -102,8 +102,14 @@ cd mmore
102
102
103
103
### Step 4: Install the project and dependencies
104
104
105
+
Pick the pipeline you intend to use. The standard pipeline (`[process]`, with text/document extraction) and the ColVision pipeline (`[colvision]`, with vision-based RAG) are **mutually exclusive** — set up a separate venv if you need both.
106
+
105
107
```bash
106
-
uv sync
108
+
# Standard pipeline (document processing + text RAG)
109
+
uv sync --extra process --extra rag
110
+
111
+
# OR ColVision pipeline (vision-based RAG: ColPali, ColQwen2/2.5, ColGemma3, ColSmol)
0 commit comments