Vespa Core

Standalone Vespa deployment for Xyne projects. This repository contains all Vespa-related schemas, deployment scripts, and configuration that can be run independently from the main Xyne application.

Overview

This is a standalone Vespa deployment running in isolation. Applications connect to Vespa via HTTP endpoints for document indexing and search operations.

Directory Structure

vespa-core/
├── vespa/
│   ├── schemas/              # 14 Vespa schema definitions
│   │   ├── file.sd
│   │   ├── user.sd
│   │   ├── mail.sd
│   │   └── ...
│   ├── services.xml          # Vespa service configuration
│   ├── deploy.sh             # Deployment script (local)
│   ├── deploy-docker.sh      # Deployment script (Docker)
│   ├── deploy-pod.sh         # Deployment script (Kubernetes)
│   ├── reindex.sh            # Reindexing script
│   ├── replaceDIMS.ts        # Replace embedding dimensions
│   └── models/               # Embedding models (downloaded)
├── deployment/
│   ├── Dockerfile-vespa-gpu  # GPU-enabled Vespa image
│   └── vespa-deploy/         # Deployment container
├── monitoring/
│   ├── prometheus.yml        # Prometheus configuration
│   └── vespa-detailed-monitoring.json  # Grafana dashboard
├── docker-compose.yml        # Standalone Vespa deployment
├── init-vespa.sh            # Initialize Vespa data directories
└── README.md

Schemas

This instance includes 14 schemas:

file - Drive files with embeddings
user - User profiles
mail - Email documents
mail_attachment - Email attachments
event - Calendar events
chat_message - Slack/Teams messages
chat_container - Slack channels/Teams
chat_user - Chat platform users
chat_team - Slack workspaces/Teams
chat_attachment - Chat attachments
datasource - Custom data sources
datasource_file - Data source files
kb_items - Knowledge base items
user_query - User search history

Quick Start

1. Local Development

# Start Vespa
docker-compose up -d

# Wait for Vespa to be ready (30-60 seconds)
docker logs vespa -f

# Deploy schemas
cd vespa
./deploy-docker.sh

# Verify deployment
curl http://localhost:8080/status.html
curl http://localhost:8081/status.html

2. VPC Deployment (Recommended for Production)

# On your VPC server (e.g., 10.0.2.50)
git clone <vespa-core-repo>
cd vespa-core

# Set embedding model
export EMBEDDING_MODEL=bge-base-en-v1.5

# Start Vespa
docker-compose up -d

# Deploy schemas
cd vespa
./deploy-docker.sh

Applications connect via:

const client = new VespaClient({
  feedEndpoint: 'http://localhost:8080',
  queryEndpoint: 'http://localhost:8081'
})

3. With Monitoring

# Start with Prometheus & Grafana
docker-compose --profile monitoring up -d

# Access Grafana: http://localhost:3002
# Import dashboard from monitoring/vespa-detailed-monitoring.json

Deployment

Environment Variables

# Required
EMBEDDING_MODEL=bge-base-en-v1.5  # or bge-small-en-v1.5, bge-large-en-v1.5

# Optional
VESPA_FEED_PORT=8080
VESPA_QUERY_PORT=8081

Embedding Models

Choose one based on your needs:

Model	Dimensions	Memory	Speed	Quality
bge-small-en-v1.5	384	Low	Fast	Good
bge-base-en-v1.5	768	Medium	Medium	Better
bge-large-en-v1.5	1024	High	Slow	Best

Initial Deployment

cd vespa

# 1. Set your embedding model
export EMBEDDING_MODEL=bge-base-en-v1.5

# 2. Deploy schemas
./deploy-docker.sh

# This will:
# - Download embedding models from HuggingFace
# - Replace DIMS placeholder with correct dimensions
# - Deploy all schemas to Vespa
# - Validate deployment

Redeployment (Schema Updates)

cd vespa

# Deploy updated schemas
./deploy-docker.sh

# Note: Vespa will automatically restart and reload schemas

Client Usage

Connect to Vespa from your application:

import { VespaClient } from '@xyne/vespa-ts'

const client = new VespaClient({
  feedEndpoint: 'http://localhost:8080',
  queryEndpoint: 'http://localhost:8081'
})

// Index documents
await client.feed({
  schema: 'file',
  id: 'doc-1',
  fields: {
    title: 'My Document',
    content: 'Document content here'
  }
})

// Search documents
const results = await client.search({
  schema: 'file',
  query: 'document content'
})

Endpoints

Endpoint	Port	Purpose	Used By
Feed	8080	Document ingestion	Applications
Query	8081	Search queries	Applications
Admin	19071	Schema deployment	Deployment scripts
Metrics	19092	Prometheus metrics	Monitoring

Data Persistence

Data is stored in Docker volumes:

# List volumes
docker volume ls | grep vespa

# Backup data
docker run --rm -v vespa-core_vespa-data:/data -v $(pwd):/backup \
  alpine tar czf /backup/vespa-backup.tar.gz -C /data .

# Restore data
docker run --rm -v vespa-core_vespa-data:/data -v $(pwd):/backup \
  alpine tar xzf /backup/vespa-backup.tar.gz -C /data

Maintenance

Health Check

# Check Vespa status
curl http://localhost:19071/state/v1/health

# Check containers
curl http://localhost:8080/status.html  # Feed
curl http://localhost:8081/status.html  # Query

Logs

# View Vespa logs
docker logs vespa -f

# View specific component logs
docker exec vespa tail -f /opt/vespa/logs/vespa/vespa.log

Reindexing

If you change schema field types or need to rebuild indexes:

cd vespa
./reindex.sh

Networking

Docker Network

Vespa runs on the xyne bridge network. To allow other Docker containers to access it:

# In your application's docker-compose.yml
networks:
  xyne:
    external: true

Production Deployment

For production deployment:

Deploy this stack on a dedicated server
Ensure firewall rules allow inbound traffic on ports 8080 (feed) and 8081 (query)
Applications connect via the server's IP address
Use TLS/SSL for production traffic (configure reverse proxy)

Troubleshooting

Deployment Fails

# Check container status
docker ps -a | grep vespa

# Check logs for errors
docker logs vespa | grep -i error

# Verify health
curl http://localhost:19071/state/v1/health

Out of Memory

# Increase memory limit in docker-compose.yml
deploy:
  resources:
    limits:
      memory: 12G  # Increase from 6G

Schema Deployment Timeout

# Increase wait time in deploy-docker.sh
vespa deploy --wait 1800  # 30 minutes instead of 960 seconds

Cannot Connect from Application

# Check network connectivity
docker exec -it xyne-app ping vespa

# Check firewall rules (VPC)
telnet 10.0.2.50 8081

# Verify endpoints in application config
echo $VESPA_QUERY_URL

Performance Tuning

Memory Settings

Edit docker-compose.yml:

environment:
  - VESPA_CONFIGSERVER_JVMARGS=-Xms2g -Xmx32g -XX:+UseG1GC
  - VESPA_CONFIGPROXY_JVMARGS=-Xms1g -Xmx16g -XX:+UseG1GC

Thread Pool

Edit vespa/services.xml:

<config name="container.handler.threadpool">
  <maxthreads>16</maxthreads>  <!-- Increase based on CPU cores -->
</config>

Development Workflow

Starting Vespa

# Start Vespa container
docker compose -f docker-compose.dev.yml up -d

# Wait for Vespa to be ready
docker logs vespa -f

# Deploy schemas
cd vespa && ./deploy-docker.sh

Making Schema Changes

# Edit schemas in vespa/schemas/
vim vespa/schemas/file.sd

# Redeploy
cd vespa && ./deploy-docker.sh

Stopping Vespa

docker compose -f docker-compose.dev.yml down

Contributing

When adding new schemas:

Add .sd file to vespa/schemas/
Update vespa/services.xml to include new schema
Update this README's schema list
Redeploy: cd vespa && ./deploy-docker.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
deployment		deployment
monitoring		monitoring
vespa		vespa
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
init-vespa.sh		init-vespa.sh
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Vespa Core

Overview

Directory Structure

Schemas

Quick Start

1. Local Development

2. VPC Deployment (Recommended for Production)

3. With Monitoring

Deployment

Environment Variables

Embedding Models

Initial Deployment

Redeployment (Schema Updates)

Client Usage

Endpoints

Data Persistence

Maintenance

Health Check

Logs

Reindexing

Networking

Docker Network

Production Deployment

Troubleshooting

Deployment Fails

Out of Memory

Schema Deployment Timeout

Cannot Connect from Application

Performance Tuning

Memory Settings

Thread Pool

Development Workflow

Starting Vespa

Making Schema Changes

Stopping Vespa

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages