Deploy with Helm

This section shows how to deploy the Visual Search and QA Application using Helm chart.

Prerequisites

Before you begin, ensure that you have the following:

Kubernetes* cluster set up and running.
The cluster must support dynamic provisioning of Persistent Volumes (PV). Refer to the Kubernetes Dynamic Provisioning Guide for more details.
Install kubectl on your system. See the Installation Guide. Ensure access to the Kubernetes cluster.
Helm chart installed on your system. See the Installation Guide.

Steps to deploy with Helm

Do the following to deploy VSQA using Helm chart.

Step 1: Acquire the helm chart

Option 1: Get the charts from Docker Hub

Use the following command to pull the Helm chart from Docker Hub:

helm pull oci://registry-1.docker.io/intel/metro-ai-suite-vsqa-chart

You may add --version <version-no> to specify a version number. Refer to the release notes for details on the latest version number to use for the sample application.

After pulling the chart, extract the .tgz file

tar -xvf metro-ai-suite-vsqa-chart-<version-no>.tgz

This will create a directory named metro-ai-suite-vsqa-chart containing the chart files. Navigate to the extracted directory with to access the charts.

cd metro-ai-suite-vsqa-chart

Option 2: Install from source

Clone the source repository

git clone https://github.com/open-edge-platform/edge-ai-suites.git

Navigate to the chart directory

cd edge-ai-suites/metro-ai-suite/visual-search-question-and-answering/deployment/helm-chart

Step 2: Configure the `values.yaml` File

Edit the values.yaml file to set the necessary environment variables. At minimum, ensure you set the models, and proxy settings as required.

Settings that must be configured

Key	Description	Example Value
`global.proxy.http_proxy`	HTTP proxy if required	`http://proxy-example.com:000`
`global.proxy.https_proxy`	HTTPS proxy if required	`http://proxy-example.com:000`
`global.VLM_MODEL_NAME`	VLM model to be used by vlm-openvino-serving	`Qwen/Qwen2.5-VL-7B-Instruct`
`global.EMBEDDING_MODEL_NAME`	Embedding model to be used for feature extraction by multimodal-embedding-serving	`CLIP/clip-vit-h-14`
`global.registry`	Remote registry to pull images from. Default as blank	`intel/`
`global.env.keeppvc`	Set to true to persist the storage. Default is false	false

Step 3: Build Helm Dependencies

Navigate to the chart directory and build the Helm dependencies using the following command:

helm dependency update

Step 4: Deploy Milvus as the vector DB

Create a namespace for Milvus

kubectl create namespace milvus

Install Milvus latest helm chart

helm repo add milvus https://zilliztech.github.io/milvus-helm/
helm repo update

Deploy Milvus in a simplified standalone mode

helm install my-milvus milvus/milvus -n milvus --set image.all.tag=v2.6.0   --set cluster.enabled=false --set etcd.replicaCount=1 --set minio.mode=standalone --set pulsar.enabled=false --set pulsarv3.enabled=false

Note: if you need customized settings for Milvus, please refer to the official guide.

Check the pods status with kubectl get po -n milvus. RESTARTS are possible, as long as the 3 pods are stablized after a while, the deployment is successful.

Step 5: Prepare host directories for models and data

mkdir -p $HOME/data

Make sure the host directories are available to the cluster nodes, and the host-paths under the volumes.hostDataPath section in values.yaml file match the correct directories. Particularly, the default path in values.yaml is /home/user/data, which corresponds to a host username user.

Note: supported media types: jpg, png, mp4

Step 6: Deploy the Application

Create a namespace for VSQA app

kubectl create namespace vsqa

Install

helm install vsqa . --values values.yaml -n vsqa

Step 7: Verify the Deployment

Check the status of the deployed resources to ensure everything is running correctly:

kubectl get pods -n vsqa
kubectl get services -n vsqa

Ensure all pods are in the "Running" state before proceeding.

Step 8: Access the application

For a simpler access, we can do a port forward

kubectl port-forward -n vsqa svc/visual-search-qa-app 17580:17580

Leave the session alive, then access http://localhost:17580 to view the application.

Step 9: Uninstall the Application

To uninstall, use the following command:

helm uninstall vsqa -n vsqa
helm uninstall my-milvus -n milvus

Verification

Ensure that all pods are running and the services are accessible.

Troubleshooting

If you encounter any issues during the deployment process, check the Kubernetes logs for errors:
```
kubectl logs <pod-name> -n <your-namespace>
```
If the data preparation pod shows error while loading a large dataset, it might be caused by too large of the dataset size. Try breaking the dataset into smaller subsets and ingest each of them instead.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy with Helm

Prerequisites

Steps to deploy with Helm

Step 1: Acquire the helm chart

Option 1: Get the charts from Docker Hub

Option 2: Install from source

Step 2: Configure the `values.yaml` File

Settings that must be configured

Step 3: Build Helm Dependencies

Step 4: Deploy Milvus as the vector DB

Step 5: Prepare host directories for models and data

Step 6: Deploy the Application

Step 7: Verify the Deployment

Step 8: Access the application

Step 9: Uninstall the Application

Verification

Troubleshooting

Related links

FilesExpand file tree

deploy-with-helm.md

Latest commit

History

deploy-with-helm.md

File metadata and controls

Deploy with Helm

Prerequisites

Steps to deploy with Helm

Step 1: Acquire the helm chart

Option 1: Get the charts from Docker Hub

Option 2: Install from source

Step 2: Configure the values.yaml File

Settings that must be configured

Step 3: Build Helm Dependencies

Step 4: Deploy Milvus as the vector DB

Step 5: Prepare host directories for models and data

Step 6: Deploy the Application

Step 7: Verify the Deployment

Step 8: Access the application

Step 9: Uninstall the Application

Verification

Troubleshooting

Related links

Step 2: Configure the `values.yaml` File