You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Total recommended setup**: 3× H100 + 1× L40S (or 4× H100 for uniform configuration). Embeddings model can be deploy on the same GPU as Flux or Trellis models.
74
77
75
78
### Deployment Options
76
79
@@ -146,6 +149,10 @@ Make sure you have accepted [https://huggingface.co/black-forest-labs/FLUX.1-Kon
146
149
147
150
trellis:
148
151
url: "http://localhost:8004/v1/infer"# Your TRELLIS NIM endpoint
152
+
153
+
embeddings:
154
+
url: "http://localhost:8005/v1"#Your Embeddings NIM endpoint
155
+
model: "nvidia/nv-embedqa-e5-v5"
149
156
```
150
157
151
158
See the **[Docker Deployment Guide](docs/DOCKER.md)** for instructions on deploying these NIMs.
@@ -166,7 +173,7 @@ The frontend at `http://localhost:3000`.
166
173
167
174
### Docker Deployment (Self-Hosted NIMs)
168
175
169
-
The Docker deployment includes all required self-hosted NVIDIA NIM containers (Nemotron VLM, Nemotron LLM, FLUX, and TRELLIS). The `shared/config/config.yaml` is pre-configured with the correct service URLs for Docker networking.
176
+
The Docker deployment includes all required self-hosted NVIDIA NIM containers (Nemotron VLM, Nemotron LLM, FLUX, and TRELLIS). If you want to use uploaded policy PDFs in the UI, start the companion Milvus stack from `docker-compose.rag.yml` as well. The `shared/config/config.yaml` is pre-configured with the correct service URLs for Docker networking.
170
177
171
178
For complete Docker deployment instructions, see the **[Docker Deployment Guide](docs/DOCKER.md)**.
172
179
@@ -185,15 +192,27 @@ For complete Docker deployment instructions, see the **[Docker Deployment Guide]
185
192
chmod a+w "$LOCAL_NIM_CACHE"
186
193
```
187
194
188
-
3.**Start all services**:
195
+
3.**Create the shared Docker network**:
196
+
```bash
197
+
docker network create catalog-network ||true
198
+
```
199
+
200
+
4.**Start the policy RAG stack**:
201
+
```bash
202
+
docker compose -f docker-compose.rag.yml up -d
203
+
```
204
+
205
+
5.**Start the application stack**:
189
206
```bash
190
-
docker-compose up -d
207
+
dockercompose up -d
191
208
```
192
209
193
-
4.**Access the application**:
210
+
6.**Access the application**:
194
211
- Frontend: `http://localhost:3000`
195
212
- Backend API: `http://localhost:8000`
196
213
- Health Check: `http://localhost:8000/health`
214
+
- Milvus: `localhost:19530`
215
+
- MinIO Console: `http://localhost:9001`
197
216
198
217
## API Endpoints
199
218
@@ -211,12 +230,12 @@ For detailed API documentation with request/response examples, see **[API Docume
211
230
212
231
## License
213
232
214
-
GOVERNING TERMS: The Blueprint scripts are governed by Apache License, Version 2.0, and enables use of separate open source and proprietary software governed by their respective licenses: [NVIDIA-Nemotron-Nano-12B-v2-VL](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-nano-12b-v2-vl?version=1), [Nemotron-Nano-V3](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-3-nano?version=1.7.0), [FLUX.1-Kontext-Dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev/blob/main/LICENSE.md), and [Microsoft TRELLIS](https://catalog.ngc.nvidia.com/orgs/nim/teams/microsoft/containers/trellis?version=1).
233
+
GOVERNING TERMS: The Blueprint scripts are governed by Apache License, Version 2.0, and enables use of separate open source and proprietary software governed by their respective licenses: [NVIDIA-Nemotron-Nano-12B-v2-VL](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-nano-12b-v2-vl?version=1), [Nemotron-Nano-V3](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-3-nano?version=1.7.0), [nv-embedqa-e5-v5](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nv-embedqa-e5-v5?version=latest), [FLUX.1-Kontext-Dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev/blob/main/LICENSE.md), and [Microsoft TRELLIS](https://catalog.ngc.nvidia.com/orgs/nim/teams/microsoft/containers/trellis?version=1).
The FLUX Kontext model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to: black-forest-labs/FLUX.1-Kontext-dev Model Card - [https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev).
221
240
222
-
This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.
241
+
This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.
Copy file name to clipboardExpand all lines: deploy/1_Deploy_Catalog_Enrichment.ipynb
+14-6Lines changed: 14 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -357,7 +357,7 @@
357
357
"source": [
358
358
"<a id=\"spin-up-blueprint\"></a>\n",
359
359
"## Spin Up Blueprint\n",
360
-
"Docker compose scripts are provided which spin up the microservices on a single node. This docker-compose yaml file will start the agents as well as dependant microservices. This may take up to **15 minutes** to complete.\n"
360
+
"Docker compose scripts are provided which spin up the microservices on a single node. Start by creating the shared Docker network, then launch the Milvus policy RAG stack from `docker-compose.rag.yml`, and finally bring up the main application stack. This may take up to **15 minutes** to complete.\n"
"To shut down the microservices, run the following command"
538
+
"To shut down the microservices, run the following commands"
533
539
]
534
540
},
535
541
{
@@ -539,7 +545,8 @@
539
545
"metadata": {},
540
546
"outputs": [],
541
547
"source": [
542
-
"!docker compose down > /dev/null 2>&1"
548
+
"!docker compose down > /dev/null 2>&1\n",
549
+
"!docker compose -f docker-compose.rag.yml down > /dev/null 2>&1"
543
550
]
544
551
},
545
552
{
@@ -577,7 +584,8 @@
577
584
"\n",
578
585
"**Explanation:** When running the blueprint for the first time, all models need to be downloaded from their respective sources. Depending on your internet connection speed, this process can take 20-30 minutes or longer. The models include:\n",
579
586
"- NVIDIA Nemotron VLM\n",
580
-
"- NVIDIA Nemotron LLM \n",
587
+
"- NVIDIA Nemotron LLM \n",
588
+
"- NVIDIA Embeddings \n",
581
589
"- FLUX image generation model\n",
582
590
"- TRELLIS 3D asset generation model\n",
583
591
"\n",
@@ -596,7 +604,7 @@
596
604
"source": [
597
605
"## LICENSE\n",
598
606
"\n",
599
-
"GOVERNING TERMS: The Blueprint scripts are governed by Apache License, Version 2.0, and enables use of separate open source and proprietary software governed by their respective licenses: [NVIDIA-Nemotron-Nano-12B-v2-VL](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-nano-12b-v2-vl?version=1), [Nemotron-Nano-V3](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-3-nano?version=1.7.0), [FLUX.1-Kontext-Dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev/blob/main/LICENSE.md), and [Microsoft TRELLIS](https://catalog.ngc.nvidia.com/orgs/nim/teams/microsoft/containers/trellis?version=1).\n",
607
+
"GOVERNING TERMS: The Blueprint scripts are governed by Apache License, Version 2.0, and enables use of separate open source and proprietary software governed by their respective licenses: [NVIDIA-Nemotron-Nano-12B-v2-VL](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-nano-12b-v2-vl?version=1), [Nemotron-Nano-V3](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nemotron-3-nano?version=1.7.0), [nv-embedqa-e5-v5](https://catalog.ngc.nvidia.com/orgs/nim/teams/nvidia/containers/nv-embedqa-e5-v5?version=latest) [FLUX.1-Kontext-Dev](https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev/blob/main/LICENSE.md), and [Microsoft TRELLIS](https://catalog.ngc.nvidia.com/orgs/nim/teams/microsoft/containers/trellis?version=1).\n",
0 commit comments