diff --git a/README.md b/README.md
index c6a7840bf..ff80740ba 100644
--- a/README.md
+++ b/README.md
@@ -80,7 +80,58 @@ Detect if the prompt is a jailbreak prompt, avoiding sending jailbreak prompts t
 
 ## Quick Start 🚀
 
-Get up and running in seconds with our interactive setup script:
+### Using VSR CLI (Recommended)
+
+The `vsr` CLI tool provides a unified interface for managing the vLLM Semantic Router across all environments. It reduces setup time from hours to minutes with intelligent auto-detection, comprehensive diagnostics, and beautiful CLI output.
+
+#### Installation
+
+```bash
+# Clone and build
+cd semantic-router/src/semantic-router
+make build-cli
+export PATH=$PATH:$(pwd)/bin
+
+# Verify installation
+vsr --version
+```
+
+#### Get Started in 4 Commands
+
+```bash
+vsr init                    # Initialize configuration
+make download-models        # Download AI models
+vsr config validate        # Validate setup
+vsr deploy docker          # Deploy with Docker Compose
+```
+
+#### Key Features
+
+- **Multi-Environment Support**: Deploy to Local, Docker, Kubernetes, or Helm
+- **Model Management**: Download, validate, list, and inspect models
+- **Health Monitoring**: Status checks, diagnostics, and health reports
+- **Debug Tools**: Interactive debugging and troubleshooting
+- **Dashboard Integration**: Auto-detect and open dashboard in browser
+- **Enhanced Logging**: Multi-environment log fetching with filtering
+
+#### Common Commands
+
+```bash
+vsr status                  # Check deployment status
+vsr logs --follow          # View logs in real-time
+vsr health                 # Quick health check
+vsr dashboard              # Open dashboard
+vsr model list             # List available models
+vsr debug                  # Run diagnostics
+vsr upgrade docker         # Upgrade deployment
+vsr undeploy docker        # Stop deployment
+```
+
+For complete CLI documentation, see [src/semantic-router/cmd/vsr/README.md](src/semantic-router/cmd/vsr/README.md) or [Quick Start Guide](src/semantic-router/cmd/vsr/QUICKSTART.md).
+
+### Using Quickstart Script
+
+Alternatively, get up and running in seconds with our interactive setup script:
 
 ```bash
 bash ./scripts/quickstart.sh
diff --git a/src/semantic-router/cmd/vsr/QUICKSTART.md b/src/semantic-router/cmd/vsr/QUICKSTART.md
new file mode 100644
index 000000000..c9ec0bcaf
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/QUICKSTART.md
@@ -0,0 +1,165 @@
+# VSR Quick Start Guide
+
+Get the vLLM Semantic Router up and running in minutes.
+
+## Prerequisites
+
+- **Go 1.21+** (for building)
+- **Docker** (for Docker deployments)
+- **kubectl** (for Kubernetes deployments)
+- **Helm** (for Helm deployments)
+
+## 1. Build VSR
+
+```bash
+cd semantic-router/src/semantic-router
+make build-cli
+export PATH=$PATH:$(pwd)/bin
+```
+
+## 2. Initialize Configuration
+
+```bash
+vsr init
+```
+
+This creates `config/config.yaml`. Edit it to configure your model and endpoints.
+
+## 3. Download Models
+
+```bash
+make download-models
+```
+
+## 4. Validate Configuration
+
+```bash
+vsr config validate
+```
+
+Fix any errors reported before proceeding.
+
+## 5. Deploy
+
+Choose your deployment environment:
+
+### Local (Development)
+
+```bash
+vsr deploy local
+```
+
+### Docker Compose (Recommended)
+
+```bash
+vsr deploy docker
+```
+
+### Kubernetes
+
+```bash
+vsr deploy kubernetes --namespace default
+```
+
+### Helm
+
+```bash
+vsr deploy helm --namespace default
+```
+
+## 6. Check Status
+
+```bash
+vsr status
+```
+
+## 7. Test the Router
+
+```bash
+vsr test-prompt "What is the weather today?"
+```
+
+## 8. View Logs
+
+```bash
+vsr logs --follow
+```
+
+## Common Commands
+
+| Command | Purpose |
+|---------|---------|
+| `vsr status` | Check deployment status |
+| `vsr logs` | View logs |
+| `vsr health` | Quick health check |
+| `vsr dashboard` | Open dashboard in browser |
+| `vsr model list` | List available models |
+| `vsr undeploy [env]` | Stop deployment |
+| `vsr upgrade [env]` | Upgrade to latest version |
+| `vsr debug` | Run diagnostics |
+
+## Troubleshooting
+
+### Configuration Issues
+
+```bash
+vsr config validate --verbose
+```
+
+### Deployment Issues
+
+```bash
+vsr debug
+```
+
+### Port Conflicts
+
+Check which ports are in use:
+
+```bash
+vsr debug
+```
+
+### Can't Connect to Dashboard
+
+```bash
+# For Docker/Local
+vsr dashboard
+
+# For Kubernetes/Helm
+vsr dashboard --namespace [your-namespace]
+```
+
+## Next Steps
+
+- Read the [full documentation](README.md) for advanced features
+- Learn about [model management](README.md#model-commands)
+- Explore [deployment options](README.md#-deployment)
+- Set up [monitoring and metrics](README.md#monitoring-commands)
+
+## Getting Help
+
+```bash
+# General help
+vsr --help
+
+# Command-specific help
+vsr deploy --help
+vsr model --help
+```
+
+## Quick Reference
+
+```bash
+# Full workflow
+vsr init                        # Initialize config
+make download-models            # Download models
+vsr config validate            # Validate
+vsr deploy docker              # Deploy
+vsr status                     # Check status
+vsr test-prompt "hello"        # Test
+vsr logs --follow              # Monitor
+vsr undeploy docker            # Clean up
+```
+
+For complete documentation, see [README.md](README.md).
diff --git a/src/semantic-router/cmd/vsr/README.md b/src/semantic-router/cmd/vsr/README.md
new file mode 100644
index 000000000..3b1b0870a
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/README.md
@@ -0,0 +1,719 @@
+# VSR - vLLM Semantic Router CLI
+
+[![Go Version](https://img.shields.io/badge/Go-1.21+-00ADD8?style=flat&logo=go)](https://golang.org/doc/install)
+[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](LICENSE)
+
+VSR is a comprehensive command-line tool for managing the vLLM Semantic Router. It reduces setup time from hours to minutes and provides a unified interface for deployment, monitoring, and troubleshooting across multiple environments.
+
+## 🚀 Quick Start
+
+```bash
+# Initialize configuration
+vsr init
+
+# Validate configuration
+vsr config validate
+
+# Deploy locally
+vsr deploy local
+
+# Check status
+vsr status
+
+# Test a prompt
+vsr test-prompt "What is the weather today?"
+
+# View logs
+vsr logs --follow
+```
+
+## 📋 Table of Contents
+
+- [Features](#-features)
+- [Installation](#-installation)
+- [Configuration](#️-configuration)
+- [Deployment](#-deployment)
+- [Commands](#-commands)
+- [Workflows](#-common-workflows)
+- [Troubleshooting](#-troubleshooting)
+- [Contributing](#-contributing)
+
+## ✨ Features
+
+### 🎯 Core Features
+
+- **Multi-Environment Deployment**: Support for Local, Docker Compose, Kubernetes, and Helm
+- **Lifecycle Management**: Deploy, undeploy, upgrade, start, stop, restart
+- **Model Management**: Download, list, validate, remove, and inspect models
+- **Health Monitoring**: Status checks, health monitoring, and diagnostics
+- **Debug Tools**: Interactive debugging, health checks, and diagnostic reports
+
+### 🔧 Advanced Features
+
+- **Enhanced Logging**: Multi-environment log fetching with filtering and following
+- **Dashboard Integration**: Auto-detect and open dashboard in browser
+- **Metrics Display**: View request counts, latency, and model usage
+- **Configuration Validation**: Pre-deployment config validation
+- **Port Forwarding**: Automatic port-forwarding for Kubernetes/Helm deployments
+
+### 🎨 User Experience
+
+- **Beautiful CLI Output**: Box drawing, colors, and status symbols
+- **Smart Auto-Detection**: Automatically detects deployment types
+- **Helpful Error Messages**: Actionable suggestions for every error
+- **Comprehensive Help**: Detailed help text with examples for every command
+- **Progress Indicators**: Visual feedback for long-running operations
+
+## 📦 Installation
+
+### Prerequisites
+
+- **Go 1.21+** (for building from source)
+- **kubectl** (optional, for Kubernetes deployments)
+- **docker** (optional, for Docker deployments)
+- **helm** (optional, for Helm deployments)
+- **make** (optional, for building and downloading models)
+
+### From Source
+
+```bash
+# Clone the repository
+git clone https://github.com/vllm-project/semantic-router.git
+cd semantic-router/src/semantic-router
+
+# Build the CLI
+make build-cli
+
+# Or use go directly
+go build -o bin/vsr ./cmd/vsr
+
+# Add to PATH
+export PATH=$PATH:$(pwd)/bin
+
+# Verify installation
+vsr --version
+```
+
+### Using Pre-built Binary
+
+```bash
+# Download the latest release
+wget https://github.com/vllm-project/semantic-router/releases/latest/download/vsr-linux-amd64
+
+# Make executable
+chmod +x vsr-linux-amd64
+mv vsr-linux-amd64 /usr/local/bin/vsr
+
+# Verify installation
+vsr --version
+```
+
+## ⚙️ Configuration
+
+### Initialize Configuration
+
+```bash
+# Create a new configuration file
+vsr init
+
+# Create with template
+vsr init --template basic
+
+# Specify output location
+vsr init --output config/my-config.yaml
+```
+
+### Validate Configuration
+
+```bash
+# Validate configuration file
+vsr config validate
+
+# Validate specific file
+vsr config validate --config path/to/config.yaml
+
+# Validate and show details
+vsr config validate --verbose
+```
+
+### Configuration File Structure
+
+```yaml
+# config/config.yaml
+bert_model:
+  model_id: "your-model-id"
+  threshold: 0.8
+
+vllm_endpoints:
+  - name: "primary"
+    address: "127.0.0.1"
+    port: 8000
+
+model_config:
+  your-model-id:
+    pricing:
+      prompt: 0.01
+      completion: 0.02
+
+default_model: "your-model-id"
+```
+
+## 🚢 Deployment
+
+### Local Deployment
+
+```bash
+# Deploy locally (runs as background process)
+vsr deploy local
+
+# Deploy with custom config
+vsr deploy local --config custom-config.yaml
+
+# Check status
+vsr status
+
+# Stop
+vsr undeploy local
+```
+
+### Docker Compose Deployment
+
+```bash
+# Deploy with Docker Compose
+vsr deploy docker
+
+# Deploy with observability disabled
+vsr deploy docker --with-observability=false
+
+# Stop and remove volumes
+vsr undeploy docker --volumes
+```
+
+### Kubernetes Deployment
+
+```bash
+# Deploy to Kubernetes
+vsr deploy kubernetes
+
+# Deploy to specific namespace
+vsr deploy kubernetes --namespace production
+
+# Check status
+vsr status --namespace production
+
+# Undeploy and wait for cleanup
+vsr undeploy kubernetes --namespace production --wait
+```
+
+### Helm Deployment
+
+```bash
+# Deploy using Helm
+vsr deploy helm
+
+# Deploy with custom release name
+vsr deploy helm --release-name my-router --namespace production
+
+# Deploy with custom values
+vsr deploy helm --set replicas=3 --set resources.memory=4Gi
+
+# Upgrade release
+vsr upgrade helm --namespace production
+
+# Undeploy
+vsr undeploy helm --namespace production --wait
+```
+
+## 📖 Commands
+
+### Deployment Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr deploy [env]` | Deploy router to specified environment |
+| `vsr undeploy [env]` | Remove router deployment |
+| `vsr upgrade [env]` | Upgrade router to latest version |
+| `vsr status` | Check router and components status |
+| `vsr start` | Start router service (deprecated) |
+| `vsr stop` | Stop router service (deprecated) |
+| `vsr restart` | Restart router service (deprecated) |
+
+### Configuration Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr init` | Initialize new configuration file |
+| `vsr config validate` | Validate configuration |
+| `vsr config view` | View current configuration |
+| `vsr config set [key] [value]` | Set configuration value |
+
+### Model Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr model list` | List all models |
+| `vsr model info [id]` | Show model details |
+| `vsr model validate [id]` | Validate model integrity |
+| `vsr model remove [id]` | Remove downloaded model |
+| `vsr model download` | Download models |
+
+### Monitoring Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr logs` | Fetch router logs |
+| `vsr status` | Check deployment status |
+| `vsr health` | Quick health check |
+| `vsr metrics` | Display router metrics |
+| `vsr dashboard` | Open dashboard in browser |
+
+### Debug Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr debug` | Run interactive debugging session |
+| `vsr health` | Perform health check |
+| `vsr diagnose` | Generate diagnostic report |
+
+### Other Commands
+
+| Command | Description |
+|---------|-------------|
+| `vsr test-prompt [text]` | Send test prompt to router |
+| `vsr install` | Install semantic router |
+| `vsr get [resource]` | Get resource information |
+
+## 🔄 Common Workflows
+
+### First-Time Setup
+
+```bash
+# 1. Initialize configuration
+vsr init
+
+# 2. Download models
+make download-models
+
+# 3. Validate configuration
+vsr config validate
+
+# 4. Deploy locally for testing
+vsr deploy local
+
+# 5. Test with a prompt
+vsr test-prompt "Hello, router!"
+
+# 6. Check status and logs
+vsr status
+vsr logs --tail 50
+```
+
+### Development Workflow
+
+```bash
+# Start local deployment
+vsr deploy local
+
+# Make code changes
+# ...
+
+# Upgrade deployment
+vsr upgrade local --force
+
+# View logs in real-time
+vsr logs --follow
+
+# Test changes
+vsr test-prompt "Test prompt"
+
+# Stop when done
+vsr undeploy local
+```
+
+### Production Deployment
+
+```bash
+# 1. Validate configuration
+vsr config validate
+
+# 2. Run diagnostics
+vsr debug
+
+# 3. Deploy to Kubernetes
+vsr deploy kubernetes --namespace production
+
+# 4. Verify deployment
+vsr status --namespace production
+vsr health
+
+# 5. Monitor
+vsr logs --namespace production --follow
+vsr metrics --watch
+
+# 6. Access dashboard
+vsr dashboard --namespace production
+```
+
+### Troubleshooting Workflow
+
+```bash
+# 1. Check health
+vsr health
+
+# 2. Run full diagnostics
+vsr debug
+
+# 3. Check deployment status
+vsr status
+
+# 4. View recent logs
+vsr logs --tail 100 --grep error
+
+# 5. Generate diagnostic report
+vsr diagnose --output diagnostics.txt
+
+# 6. Validate models
+vsr model validate --all
+
+# 7. Check specific components
+vsr logs --component router --since 10m
+```
+
+### Upgrade Workflow
+
+```bash
+# 1. Check current status
+vsr status
+
+# 2. Backup configuration
+cp config/config.yaml config/config.yaml.bak
+
+# 3. Pull latest changes
+git pull origin main
+
+# 4. Rebuild
+make build-cli
+
+# 5. Upgrade deployment
+vsr upgrade kubernetes --namespace production --wait
+
+# 6. Verify upgrade
+vsr health
+vsr logs --tail 50
+
+# 7. Rollback if needed
+git checkout <previous-commit>
+vsr upgrade kubernetes --namespace production
+```
+
+## 🔍 Troubleshooting
+
+### Common Issues
+
+#### 1. Configuration Validation Fails
+
+```bash
+# Check what's wrong
+vsr config validate --verbose
+
+# Common issues:
+# - Missing required fields
+# - Invalid YAML syntax
+# - Model references not found
+# - Invalid endpoint addresses
+
+# Solution: Fix the issues and validate again
+vsr config validate
+```
+
+#### 2. Models Not Found
+
+```bash
+# Check model status
+vsr model list
+
+# Download models
+make download-models
+
+# Or manually download specific model
+# (future feature)
+vsr model download [model-id]
+
+# Validate models
+vsr model validate --all
+```
+
+#### 3. Deployment Fails
+
+```bash
+# Run diagnostics
+vsr debug
+
+# Check prerequisites
+# - kubectl installed? (for K8s)
+# - docker running? (for Docker)
+# - helm installed? (for Helm)
+
+# Check resources
+# - Disk space available?
+# - Ports available?
+# - Network connectivity?
+
+# View detailed logs
+vsr logs --tail 100
+```
+
+#### 4. Port Already in Use
+
+```bash
+# Check which ports are in use
+vsr debug
+
+# Find process using port
+netstat -tulpn | grep 8080
+
+# Kill process or use different port
+# (configure in config.yaml)
+```
+
+#### 5. Kubernetes Deployment Issues
+
+```bash
+# Check cluster connection
+kubectl cluster-info
+
+# Check namespace
+kubectl get namespaces
+
+# Check pods
+kubectl get pods -n [namespace]
+
+# View pod logs
+kubectl logs -n [namespace] [pod-name]
+
+# Or use vsr
+vsr logs --namespace [namespace] --follow
+```
+
+### Debug Mode
+
+```bash
+# Run comprehensive diagnostics
+vsr debug
+
+# This checks:
+# ✓ Prerequisites (Go, kubectl, docker, helm, make)
+# ✓ Configuration (file exists, valid YAML, passes validation)
+# ✓ Models (directory exists, models downloaded)
+# ✓ Resources (disk space, port availability)
+# ✓ Connectivity (endpoint reachability)
+
+# Provides recommendations based on failures
+```
+
+### Health Check
+
+```bash
+# Quick health check
+vsr health
+
+# Status indicators:
+# 🟢 GOOD - All systems operational
+# 🟡 DEGRADED - Environment ready, router not running
+# 🔴 POOR - Critical issues detected
+```
+
+### Getting Help
+
+```bash
+# General help
+vsr --help
+
+# Command-specific help
+vsr deploy --help
+vsr model list --help
+
+# View examples
+vsr upgrade --help  # Shows examples in help text
+```
+
+## 📊 Advanced Features
+
+### Log Filtering
+
+```bash
+# Filter by component
+vsr logs --component router
+
+# Filter by time
+vsr logs --since 10m
+vsr logs --since 1h
+
+# Filter by pattern
+vsr logs --grep error
+vsr logs --grep "HTTP 500"
+
+# Combine filters
+vsr logs --component router --since 10m --grep error --follow
+```
+
+### Multi-Format Output
+
+```bash
+# JSON output
+vsr model list --output json
+
+# YAML output
+vsr model list --output yaml
+
+# Table output (default)
+vsr model list --output table
+```
+
+### Environment Variables
+
+```bash
+# Set default config path
+export VSR_CONFIG=config/production.yaml
+
+# Set default namespace
+export VSR_NAMESPACE=production
+
+# Enable verbose output
+export VSR_VERBOSE=true
+
+# Use in commands
+vsr deploy kubernetes  # Uses VSR_CONFIG and VSR_NAMESPACE
+```
+
+### Shell Completion
+
+```bash
+# Generate bash completion
+vsr completion bash > /etc/bash_completion.d/vsr
+
+# Generate zsh completion
+vsr completion zsh > "${fpath[1]}/_vsr"
+
+# Generate fish completion
+vsr completion fish > ~/.config/fish/completions/vsr.fish
+
+# Source completion
+source <(vsr completion bash)
+```
+
+## 🏗️ Architecture
+
+### Command Structure
+
+```
+vsr
+├── config          # Configuration management
+├── deploy          # Deployment operations
+├── undeploy        # Removal operations
+├── upgrade         # Upgrade operations
+├── status          # Status checking
+├── logs            # Log fetching
+├── model           # Model management
+│   ├── list
+│   ├── info
+│   ├── validate
+│   ├── remove
+│   └── download
+├── debug           # Debugging tools
+├── health          # Health checking
+├── diagnose        # Diagnostics
+├── dashboard       # Dashboard access
+├── metrics         # Metrics display
+├── test-prompt     # Testing
+├── install         # Installation
+├── init            # Initialization
+└── get             # Resource querying
+```
+
+### Deployment Detection
+
+VSR automatically detects active deployments:
+
+1. **Local**: Checks for PID file at `/tmp/vsr-local-deployment.pid`
+2. **Docker**: Queries Docker for containers matching `semantic-router`
+3. **Kubernetes**: Queries kubectl for pods with label `app=semantic-router`
+4. **Helm**: Lists Helm releases containing `semantic-router`
+
+### Model Discovery
+
+VSR discovers models using intelligent architecture selection:
+
+1. **Priority**: BERT → RoBERTa → ModernBERT
+2. **Types**: LoRA models (preferred) or Legacy models
+3. **Categories**: Intent, PII, Security classifiers
+
+## 🤝 Contributing
+
+### Development Setup
+
+```bash
+# Clone repository
+git clone https://github.com/vllm-project/semantic-router.git
+cd semantic-router/src/semantic-router
+
+# Install dependencies
+go mod download
+
+# Build
+make build-cli
+
+# Run tests
+go test ./cmd/vsr/commands/... -v
+go test ./pkg/cli/... -v
+
+# Run linting
+golangci-lint run
+```
+
+### Adding a New Command
+
+1. Create command file in `cmd/vsr/commands/`
+2. Implement `New[Command]Cmd() *cobra.Command`
+3. Add command to `main.go`
+4. Add help text and examples
+5. Write tests
+6. Update documentation
+
+### Code Style
+
+- Follow Go best practices
+- Use Cobra patterns for commands
+- Include comprehensive help text
+- Add examples to help text
+- Write table-driven tests
+- Use existing CLI utilities (`pkg/cli`)
+
+## 📝 License
+
+Apache License 2.0 - See [LICENSE](../../LICENSE) for details.
+
+## 🔗 Links
+
+- [Main Repository](https://github.com/vllm-project/semantic-router)
+- [Documentation](https://docs.vllm-project.com)
+- [Issue Tracker](https://github.com/vllm-project/semantic-router/issues)
+- [Discussions](https://github.com/vllm-project/semantic-router/discussions)
+
+## 📮 Support
+
+- **Issues**: [GitHub Issues](https://github.com/vllm-project/semantic-router/issues)
+- **Discussions**: [GitHub Discussions](https://github.com/vllm-project/semantic-router/discussions)
+- **Email**: support@vllm-project.com
+
+## 🙏 Acknowledgments
+
+Built with:
+
+- [Cobra](https://github.com/spf13/cobra) - CLI framework
+- [vLLM](https://github.com/vllm-project/vllm) - Inference engine
+- [ModernBERT](https://huggingface.co/answerdotai/ModernBERT-base) - Model architecture
+
+---
+
+**Made with ❤️ by the vLLM Semantic Router team**
diff --git a/src/semantic-router/cmd/vsr/commands/completion.go b/src/semantic-router/cmd/vsr/commands/completion.go
new file mode 100644
index 000000000..d1c69a20f
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/completion.go
@@ -0,0 +1,82 @@
+package commands
+
+import (
+	"os"
+
+	"github.com/spf13/cobra"
+)
+
+// NewCompletionCmd creates the completion command
+func NewCompletionCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "completion [bash|zsh|fish|powershell]",
+		Short: "Generate shell completion script",
+		Long: `Generate shell completion script for VSR.
+
+To load completions:
+
+Bash:
+  # Linux:
+  $ vsr completion bash > /etc/bash_completion.d/vsr
+
+  # macOS:
+  $ vsr completion bash > /usr/local/etc/bash_completion.d/vsr
+
+  # Current session:
+  $ source <(vsr completion bash)
+
+Zsh:
+  # If shell completion is not already enabled:
+  $ echo "autoload -U compinit; compinit" >> ~/.zshrc
+
+  # Generate completion:
+  $ vsr completion zsh > "${fpath[1]}/_vsr"
+
+  # Current session:
+  $ source <(vsr completion zsh)
+
+Fish:
+  $ vsr completion fish > ~/.config/fish/completions/vsr.fish
+
+  # Current session:
+  $ vsr completion fish | source
+
+PowerShell:
+  PS> vsr completion powershell | Out-String | Invoke-Expression
+
+  # To load completions for every session:
+  PS> vsr completion powershell > vsr.ps1
+  # And source this file from your PowerShell profile.
+
+Examples:
+  # Generate bash completion
+  vsr completion bash
+
+  # Generate zsh completion and save to file
+  vsr completion zsh > /usr/local/share/zsh/site-functions/_vsr
+
+  # Generate fish completion
+  vsr completion fish > ~/.config/fish/completions/vsr.fish
+
+  # Generate PowerShell completion
+  vsr completion powershell > vsr.ps1`,
+		DisableFlagsInUseLine: true,
+		ValidArgs:             []string{"bash", "zsh", "fish", "powershell"},
+		Args:                  cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			switch args[0] {
+			case "bash":
+				return cmd.Root().GenBashCompletion(os.Stdout)
+			case "zsh":
+				return cmd.Root().GenZshCompletion(os.Stdout)
+			case "fish":
+				return cmd.Root().GenFishCompletion(os.Stdout, true)
+			case "powershell":
+				return cmd.Root().GenPowerShellCompletion(os.Stdout)
+			}
+			return nil
+		},
+	}
+
+	return cmd
+}
diff --git a/src/semantic-router/cmd/vsr/commands/completion_test.go b/src/semantic-router/cmd/vsr/commands/completion_test.go
new file mode 100644
index 000000000..acbf62766
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/completion_test.go
@@ -0,0 +1,132 @@
+package commands
+
+import (
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestCompletionCommandStructure(t *testing.T) {
+	tests := []struct {
+		name           string
+		expectedUse    string
+		expectedShort  string
+		validArgs      []string
+		validArgsCount int
+	}{
+		{
+			name:           "completion command has correct structure",
+			expectedUse:    "completion [bash|zsh|fish|powershell]",
+			expectedShort:  "Generate shell completion script",
+			validArgs:      []string{"bash", "zsh", "fish", "powershell"},
+			validArgsCount: 4,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewCompletionCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			if len(cmd.ValidArgs) != tt.validArgsCount {
+				t.Errorf("expected %d valid args, got %d", tt.validArgsCount, len(cmd.ValidArgs))
+			}
+
+			// Verify valid args
+			for _, expectedArg := range tt.validArgs {
+				found := false
+				for _, validArg := range cmd.ValidArgs {
+					if validArg == expectedArg {
+						found = true
+						break
+					}
+				}
+				if !found {
+					t.Errorf("expected valid arg %q not found", expectedArg)
+				}
+			}
+		})
+	}
+}
+
+func TestCompletionCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "completion bash",
+			args:      []string{"completion", "bash"},
+			wantError: false,
+		},
+		{
+			name:      "completion zsh",
+			args:      []string{"completion", "zsh"},
+			wantError: false,
+		},
+		{
+			name:      "completion fish",
+			args:      []string{"completion", "fish"},
+			wantError: false,
+		},
+		{
+			name:      "completion powershell",
+			args:      []string{"completion", "powershell"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+
+			completionCmd := NewCompletionCmd()
+			rootCmd.AddCommand(completionCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, err := rootCmd.ExecuteC()
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+		})
+	}
+}
+
+func TestCompletionCommandRequiresShell(t *testing.T) {
+	rootCmd := &cobra.Command{Use: "vsr"}
+	completionCmd := NewCompletionCmd()
+	rootCmd.AddCommand(completionCmd)
+
+	rootCmd.SetArgs([]string{"completion"})
+	_, err := rootCmd.ExecuteC()
+
+	if err == nil {
+		t.Error("expected error when no shell specified, got nil")
+	}
+}
+
+func TestCompletionCommandInvalidShell(t *testing.T) {
+	rootCmd := &cobra.Command{Use: "vsr"}
+	completionCmd := NewCompletionCmd()
+	rootCmd.AddCommand(completionCmd)
+
+	rootCmd.SetArgs([]string{"completion", "invalid-shell"})
+	_, err := rootCmd.ExecuteC()
+
+	// Cobra will return an error for invalid arg, which is expected
+	if err == nil {
+		t.Error("expected error for invalid shell, got nil")
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/config.go b/src/semantic-router/cmd/vsr/commands/config.go
new file mode 100644
index 000000000..f9be551ca
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/config.go
@@ -0,0 +1,237 @@
+package commands
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"strings"
+
+	"github.com/spf13/cobra"
+	"gopkg.in/yaml.v3"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/config"
+)
+
+// NewConfigCmd creates the config command
+func NewConfigCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "config",
+		Short: "Manage router configuration",
+		Long: `View, edit, validate, and modify router configuration files.
+
+The config command provides subcommands for managing your router's YAML configuration:
+  view      - Display the current configuration
+  edit      - Open configuration in your editor
+  validate  - Validate configuration file syntax and semantics
+  set       - Set specific configuration values
+  get       - Retrieve specific configuration values`,
+	}
+
+	cmd.AddCommand(newConfigViewCmd())
+	cmd.AddCommand(newConfigEditCmd())
+	cmd.AddCommand(newConfigValidateCmd())
+	cmd.AddCommand(newConfigSetCmd())
+	cmd.AddCommand(newConfigGetCmd())
+
+	return cmd
+}
+
+func newConfigViewCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "view",
+		Short: "Display current configuration",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Parent().Flag("config").Value.String()
+
+			// Read the config file
+			data, err := os.ReadFile(configPath)
+			if err != nil {
+				return fmt.Errorf("failed to read config: %w", err)
+			}
+
+			outputFormat := cmd.Parent().Parent().Flag("output").Value.String()
+
+			switch outputFormat {
+			case "json":
+				// Convert YAML to JSON for output
+				var yamlData interface{}
+				if err := yaml.Unmarshal(data, &yamlData); err != nil {
+					return fmt.Errorf("failed to parse config: %w", err)
+				}
+				return cli.PrintJSON(yamlData)
+			case "yaml", "table":
+				// Just print the raw YAML
+				fmt.Println(string(data))
+				return nil
+			default:
+				return fmt.Errorf("unsupported output format: %s", outputFormat)
+			}
+		},
+	}
+}
+
+func newConfigEditCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "edit",
+		Short: "Edit configuration in your default editor",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Parent().Flag("config").Value.String()
+
+			editor := os.Getenv("EDITOR")
+			if editor == "" {
+				editor = "vi" // fallback to vi
+			}
+
+			editorCmd := exec.Command(editor, configPath)
+			editorCmd.Stdin = os.Stdin
+			editorCmd.Stdout = os.Stdout
+			editorCmd.Stderr = os.Stderr
+
+			if err := editorCmd.Run(); err != nil {
+				return fmt.Errorf("failed to run editor: %w", err)
+			}
+
+			cli.Success(fmt.Sprintf("Configuration edited: %s", configPath))
+			cli.Warning("Remember to validate your changes with: vsr config validate")
+			return nil
+		},
+	}
+}
+
+func newConfigValidateCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "validate",
+		Short: "Validate configuration file",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Parent().Flag("config").Value.String()
+
+			// Parse the configuration
+			cfg, err := config.Parse(configPath)
+			if err != nil {
+				cli.Error(fmt.Sprintf("Validation failed: %v", err))
+				return err
+			}
+
+			// Perform additional semantic validation
+			if err := cli.ValidateConfig(cfg); err != nil {
+				cli.Error(fmt.Sprintf("Semantic validation failed: %v", err))
+				return err
+			}
+
+			cli.Success(fmt.Sprintf("Configuration is valid: %s", configPath))
+			return nil
+		},
+	}
+}
+
+func newConfigSetCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "set <key> <value>",
+		Short: "Set a configuration value",
+		Args:  cobra.ExactArgs(2),
+		Example: `  vsr config set bert_model.threshold 0.7
+  vsr config set default_model my-model`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Parent().Flag("config").Value.String()
+			key := args[0]
+			value := args[1]
+
+			// Read current config
+			data, err := os.ReadFile(configPath)
+			if err != nil {
+				return fmt.Errorf("failed to read config: %w", err)
+			}
+
+			var configData map[string]interface{}
+			if unmarshalErr := yaml.Unmarshal(data, &configData); unmarshalErr != nil {
+				return fmt.Errorf("failed to parse config: %w", unmarshalErr)
+			}
+
+			// Set the value using dot notation
+			if setErr := setNestedValue(configData, key, value); setErr != nil {
+				return setErr
+			}
+
+			// Write back to file
+			newData, err := yaml.Marshal(configData)
+			if err != nil {
+				return fmt.Errorf("failed to serialize config: %w", err)
+			}
+
+			if err := os.WriteFile(configPath, newData, 0o644); err != nil {
+				return fmt.Errorf("failed to write config: %w", err)
+			}
+
+			cli.Success(fmt.Sprintf("Set %s = %s", key, value))
+			cli.Warning("Validate changes with: vsr config validate")
+			return nil
+		},
+	}
+}
+
+func newConfigGetCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "get <key>",
+		Short: "Get a configuration value",
+		Args:  cobra.ExactArgs(1),
+		Example: `  vsr config get bert_model.threshold
+  vsr config get default_model`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Parent().Flag("config").Value.String()
+			key := args[0]
+
+			// Read config
+			data, err := os.ReadFile(configPath)
+			if err != nil {
+				return fmt.Errorf("failed to read config: %w", err)
+			}
+
+			var configData map[string]interface{}
+			if unmarshalErr := yaml.Unmarshal(data, &configData); unmarshalErr != nil {
+				return fmt.Errorf("failed to parse config: %w", unmarshalErr)
+			}
+
+			// Get the value
+			value, err := getNestedValue(configData, key)
+			if err != nil {
+				return err
+			}
+
+			fmt.Printf("%s: %v\n", key, value)
+			return nil
+		},
+	}
+}
+
+// Helper functions for nested key access
+func setNestedValue(data map[string]interface{}, key string, value string) error {
+	keys := strings.Split(key, ".")
+	current := data
+
+	for i := 0; i < len(keys)-1; i++ {
+		if next, ok := current[keys[i]].(map[string]interface{}); ok {
+			current = next
+		} else {
+			return fmt.Errorf("key not found: %s", strings.Join(keys[:i+1], "."))
+		}
+	}
+
+	current[keys[len(keys)-1]] = value
+	return nil
+}
+
+func getNestedValue(data map[string]interface{}, key string) (interface{}, error) {
+	keys := strings.Split(key, ".")
+	var current interface{} = data
+
+	for _, k := range keys {
+		if m, ok := current.(map[string]interface{}); ok {
+			current = m[k]
+		} else {
+			return nil, fmt.Errorf("key not found: %s", key)
+		}
+	}
+
+	return current, nil
+}
diff --git a/src/semantic-router/cmd/vsr/commands/config_test.go b/src/semantic-router/cmd/vsr/commands/config_test.go
new file mode 100644
index 000000000..df4fc52b3
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/config_test.go
@@ -0,0 +1,339 @@
+package commands
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestConfigCommandStructure(t *testing.T) {
+	tests := []struct {
+		name            string
+		expectedUse     string
+		expectedShort   string
+		subcommandCount int
+		subcommands     []string
+	}{
+		{
+			name:            "config command has correct structure",
+			expectedUse:     "config",
+			expectedShort:   "Manage router configuration",
+			subcommandCount: 5,
+			subcommands:     []string{"view", "edit", "validate", "set", "get"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewConfigCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			if len(cmd.Commands()) != tt.subcommandCount {
+				t.Errorf("expected %d subcommands, got %d", tt.subcommandCount, len(cmd.Commands()))
+			}
+
+			// Verify subcommands exist
+			for _, subcmd := range tt.subcommands {
+				found := false
+				for _, c := range cmd.Commands() {
+					if c.Use == subcmd {
+						found = true
+						break
+					}
+				}
+				if !found {
+					t.Errorf("expected subcommand %q not found", subcmd)
+				}
+			}
+		})
+	}
+}
+
+func TestConfigViewCmd(t *testing.T) {
+	// Create temporary config file
+	tmpDir := t.TempDir()
+	configPath := filepath.Join(tmpDir, "config.yaml")
+	configContent := `bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+
+default_model: "test-model"
+`
+	if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+		t.Fatalf("Failed to create test config: %v", err)
+	}
+
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "view config with yaml format",
+			args:      []string{"config", "view", "-c", configPath, "-o", "yaml"},
+			wantError: false,
+		},
+		{
+			name:      "view config with table format",
+			args:      []string{"config", "view", "-c", configPath, "-o", "table"},
+			wantError: false,
+		},
+		{
+			name:      "view config with json format",
+			args:      []string{"config", "view", "-c", configPath, "-o", "json"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+			rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format")
+
+			configCmd := NewConfigCmd()
+			rootCmd.AddCommand(configCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, err := rootCmd.ExecuteC()
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+		})
+	}
+}
+
+func TestConfigValidateCmd(t *testing.T) {
+	tmpDir := t.TempDir()
+
+	tests := []struct {
+		name          string
+		configContent string
+		wantError     bool
+	}{
+		{
+			name: "valid config",
+			configContent: `bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+
+vllm_endpoints:
+  - name: "primary"
+    address: "127.0.0.1"
+    port: 8000
+
+default_model: "test-model"
+`,
+			wantError: false,
+		},
+		{
+			name:          "invalid yaml syntax",
+			configContent: `bert_model: [invalid yaml`,
+			wantError:     true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			configPath := filepath.Join(tmpDir, tt.name+".yaml")
+			if err := os.WriteFile(configPath, []byte(tt.configContent), 0o644); err != nil {
+				t.Fatalf("Failed to create test config: %v", err)
+			}
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", configPath, "Path to configuration file")
+
+			configCmd := NewConfigCmd()
+			rootCmd.AddCommand(configCmd)
+
+			rootCmd.SetArgs([]string{"config", "validate", "-c", configPath})
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestConfigSetGetCmd(t *testing.T) {
+	tmpDir := t.TempDir()
+	configPath := filepath.Join(tmpDir, "config.yaml")
+	configContent := `bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+
+default_model: "test-model"
+`
+	if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+		t.Fatalf("Failed to create test config: %v", err)
+	}
+
+	tests := []struct {
+		name      string
+		command   string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "set top-level value",
+			command:   "set",
+			args:      []string{"config", "set", "default_model", "new-model", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "set nested value",
+			command:   "set",
+			args:      []string{"config", "set", "bert_model.threshold", "0.9", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get top-level value",
+			command:   "get",
+			args:      []string{"config", "get", "default_model", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get nested value",
+			command:   "get",
+			args:      []string{"config", "get", "bert_model.threshold", "-c", configPath},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+
+			configCmd := NewConfigCmd()
+			rootCmd.AddCommand(configCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestNestedValueHelpers(t *testing.T) {
+	tests := []struct {
+		name      string
+		data      map[string]interface{}
+		key       string
+		value     string
+		operation string // "get" or "set"
+		wantError bool
+		expected  interface{}
+	}{
+		{
+			name: "set simple key",
+			data: map[string]interface{}{
+				"key1": "value1",
+			},
+			key:       "key1",
+			value:     "new-value",
+			operation: "set",
+			wantError: false,
+		},
+		{
+			name: "set nested key",
+			data: map[string]interface{}{
+				"parent": map[string]interface{}{
+					"child": "value",
+				},
+			},
+			key:       "parent.child",
+			value:     "new-value",
+			operation: "set",
+			wantError: false,
+		},
+		{
+			name: "get simple key",
+			data: map[string]interface{}{
+				"key1": "value1",
+			},
+			key:       "key1",
+			operation: "get",
+			wantError: false,
+			expected:  "value1",
+		},
+		{
+			name: "get nested key",
+			data: map[string]interface{}{
+				"parent": map[string]interface{}{
+					"child": "value",
+				},
+			},
+			key:       "parent.child",
+			operation: "get",
+			wantError: false,
+			expected:  "value",
+		},
+		{
+			name: "get non-existent key",
+			data: map[string]interface{}{
+				"key1": "value1",
+			},
+			key:       "nonexistent",
+			operation: "get",
+			wantError: false,
+			expected:  nil,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			switch tt.operation {
+			case "set":
+				err := setNestedValue(tt.data, tt.key, tt.value)
+				if tt.wantError && err == nil {
+					t.Error("expected error, got nil")
+				}
+				if !tt.wantError && err != nil {
+					t.Errorf("unexpected error: %v", err)
+				}
+			case "get":
+				value, err := getNestedValue(tt.data, tt.key)
+				if tt.wantError && err == nil {
+					t.Error("expected error, got nil")
+				}
+				if !tt.wantError && err != nil {
+					t.Errorf("unexpected error: %v", err)
+				}
+				if !tt.wantError && value != tt.expected {
+					t.Errorf("expected %v, got %v", tt.expected, value)
+				}
+			}
+		})
+	}
+}
+
+func TestConfigEditCmd(t *testing.T) {
+	// This test just verifies the command exists and has correct structure
+	// Actual editor interaction is hard to test in unit tests
+	cmd := NewConfigCmd()
+	var editCmd *cobra.Command
+	for _, c := range cmd.Commands() {
+		if c.Use == "edit" {
+			editCmd = c
+			break
+		}
+	}
+
+	if editCmd == nil {
+		t.Fatal("edit subcommand not found")
+	}
+
+	if editCmd.Short != "Edit configuration in your default editor" {
+		t.Errorf("unexpected Short description: %s", editCmd.Short)
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/dashboard.go b/src/semantic-router/cmd/vsr/commands/dashboard.go
new file mode 100644
index 000000000..eea968e1d
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/dashboard.go
@@ -0,0 +1,268 @@
+package commands
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"os/signal"
+	"runtime"
+	"syscall"
+	"time"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/deployment"
+)
+
+// NewDashboardCmd creates the dashboard command
+func NewDashboardCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "dashboard",
+		Short: "Open router dashboard in browser",
+		Long: `Open the router dashboard in your default web browser.
+
+Auto-detects the dashboard URL based on your deployment type:
+  - Docker: http://localhost:8700
+  - Kubernetes: Port-forwards and opens dashboard
+  - Helm: Port-forwards and opens dashboard
+  - Local: http://localhost:8700 (if available)
+
+Examples:
+  # Open dashboard (auto-detect deployment)
+  vsr dashboard
+
+  # Open dashboard for specific namespace
+  vsr dashboard --namespace production
+
+  # Open without auto-launching browser
+  vsr dashboard --no-open`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			namespace, _ := cmd.Flags().GetString("namespace")
+			noOpen, _ := cmd.Flags().GetBool("no-open")
+
+			cli.Info("Detecting dashboard deployment...")
+
+			// Auto-detect deployment type
+			deployType := detectActiveDeployment(namespace)
+
+			if deployType == "" {
+				cli.Warning("No active deployment detected")
+				cli.Info("Deploy the router first with: vsr deploy [local|docker|kubernetes|helm]")
+				return fmt.Errorf("no active deployment found")
+			}
+
+			cli.Info(fmt.Sprintf("Detected deployment type: %s", deployType))
+
+			var dashboardURL string
+			var portForwardCmd *exec.Cmd
+
+			switch deployType {
+			case "docker", "local":
+				dashboardURL = "http://localhost:8700"
+				cli.Info("Dashboard should be available at: " + dashboardURL)
+
+			case "kubernetes", "helm":
+				// Set up port forwarding
+				cli.Info("Setting up port forwarding...")
+
+				// Find dashboard pod
+				dashboardURL = "http://localhost:8700"
+
+				portForwardCmd = exec.Command("kubectl", "port-forward",
+					"-n", namespace,
+					"svc/semantic-router-dashboard",
+					"8700:8700")
+
+				// Start port-forward in background
+				if err := portForwardCmd.Start(); err != nil {
+					cli.Warning(fmt.Sprintf("Failed to start port-forward: %v", err))
+					cli.Info("Try manually: kubectl port-forward -n " + namespace + " svc/semantic-router-dashboard 8700:8700")
+					return err
+				}
+
+				// Give it a moment to establish
+				time.Sleep(2 * time.Second)
+				cli.Success("Port forwarding established")
+
+				// Clean up on exit
+				defer func() {
+					if portForwardCmd != nil && portForwardCmd.Process != nil {
+						_ = portForwardCmd.Process.Kill()
+						cli.Info("Port forwarding stopped")
+					}
+				}()
+			}
+
+			// Open browser
+			if !noOpen {
+				cli.Info("Opening dashboard in browser...")
+				if err := openBrowser(dashboardURL); err != nil {
+					cli.Warning(fmt.Sprintf("Failed to open browser: %v", err))
+					cli.Info("Please open manually: " + dashboardURL)
+				} else {
+					cli.Success("Dashboard opened!")
+				}
+			} else {
+				cli.Info("Dashboard URL: " + dashboardURL)
+			}
+
+			// For K8s/Helm, keep port-forward alive (Issue #3: Add signal handling)
+			if portForwardCmd != nil {
+				cli.Info("\nPort forwarding active. Press Ctrl+C to stop.")
+
+				// Handle interrupt signal for graceful shutdown
+				sigChan := make(chan os.Signal, 1)
+				signal.Notify(sigChan, os.Interrupt, syscall.SIGTERM)
+
+				// Wait for either process exit or interrupt signal
+				done := make(chan error)
+				go func() {
+					done <- portForwardCmd.Wait()
+				}()
+
+				select {
+				case <-sigChan:
+					cli.Info("Stopping port forwarding...")
+					if portForwardCmd.Process != nil {
+						_ = portForwardCmd.Process.Kill()
+					}
+				case err := <-done:
+					if err != nil {
+						cli.Warning(fmt.Sprintf("Port forwarding exited with error: %v", err))
+					}
+				}
+			}
+
+			return nil
+		},
+	}
+
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace")
+	cmd.Flags().Bool("no-open", false, "Don't open browser automatically")
+
+	return cmd
+}
+
+// NewMetricsCmd creates the metrics command
+func NewMetricsCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "metrics",
+		Short: "Display router metrics",
+		Long: `Display key metrics for the router.
+
+Shows:
+  - Request counts
+  - Latency statistics
+  - Error rates
+  - Model usage
+  - Cost tracking (if configured)
+
+Examples:
+  # Show current metrics
+  vsr metrics
+
+  # Show metrics for specific time range
+  vsr metrics --since 1h
+
+  # Show metrics with auto-refresh
+  vsr metrics --watch`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			since, _ := cmd.Flags().GetString("since")
+			watch, _ := cmd.Flags().GetBool("watch")
+
+			if watch {
+				// Watch mode - refresh every 5 seconds
+				cli.Info("Metrics (refreshing every 5s, Ctrl+C to stop)")
+				cli.Info("")
+
+				for {
+					displayMetrics(since)
+					time.Sleep(5 * time.Second)
+					// Clear screen
+					fmt.Print("\033[H\033[2J")
+					cli.Info("Metrics (refreshing every 5s, Ctrl+C to stop)")
+					cli.Info("")
+				}
+			} else {
+				// One-time display
+				displayMetrics(since)
+			}
+
+			return nil
+		},
+	}
+
+	cmd.Flags().String("since", "5m", "Time range (e.g., 5m, 1h, 24h)")
+	cmd.Flags().Bool("watch", false, "Auto-refresh metrics")
+
+	return cmd
+}
+
+// detectActiveDeployment detects the active deployment type
+func detectActiveDeployment(namespace string) string {
+	// Check in order of specificity
+	if status := deployment.DetectHelmDeployment(namespace); status != nil && status.IsRunning {
+		return "helm"
+	}
+	if status := deployment.DetectKubernetesDeployment(namespace); status != nil && status.IsRunning {
+		return "kubernetes"
+	}
+	if status := deployment.DetectDockerDeployment(); status != nil && status.IsRunning {
+		return "docker"
+	}
+	if status := deployment.DetectLocalDeployment(); status != nil && status.IsRunning {
+		return "local"
+	}
+	return ""
+}
+
+// openBrowser opens a URL in the default browser
+func openBrowser(url string) error {
+	var cmd *exec.Cmd
+
+	switch runtime.GOOS {
+	case "linux":
+		cmd = exec.Command("xdg-open", url)
+	case "darwin":
+		cmd = exec.Command("open", url)
+	case "windows":
+		cmd = exec.Command("rundll32", "url.dll,FileProtocolHandler", url)
+	default:
+		return fmt.Errorf("unsupported platform: %s", runtime.GOOS)
+	}
+
+	return cmd.Start()
+}
+
+// displayMetrics displays metrics (placeholder implementation)
+func displayMetrics(since string) {
+	cli.Info("╔═══════════════════════════════════════════════════════════════╗")
+	cli.Info("║                     Router Metrics                            ║")
+	cli.Info("╠═══════════════════════════════════════════════════════════════╣")
+	cli.Info(fmt.Sprintf("║ Time Range: %-48s║", since))
+	cli.Info("╠═══════════════════════════════════════════════════════════════╣")
+
+	// Placeholder metrics
+	cli.Info("║                                                               ║")
+	cli.Info("║ 📊 Request Statistics                                         ║")
+	cli.Info("║   Total Requests:        N/A                                  ║")
+	cli.Info("║   Success Rate:          N/A                                  ║")
+	cli.Info("║   Error Rate:            N/A                                  ║")
+	cli.Info("║                                                               ║")
+	cli.Info("║ ⏱️  Latency                                                    ║")
+	cli.Info("║   Avg Response Time:     N/A                                  ║")
+	cli.Info("║   P95 Response Time:     N/A                                  ║")
+	cli.Info("║   P99 Response Time:     N/A                                  ║")
+	cli.Info("║                                                               ║")
+	cli.Info("║ 🤖 Model Usage                                                ║")
+	cli.Info("║   Intent Classifier:     N/A                                  ║")
+	cli.Info("║   PII Detector:          N/A                                  ║")
+	cli.Info("║   Security Classifier:   N/A                                  ║")
+	cli.Info("║                                                               ║")
+	cli.Info("╚═══════════════════════════════════════════════════════════════╝")
+
+	cli.Warning("\nNote: Metrics collection is not yet implemented")
+	cli.Info("This feature requires the router to be running with observability enabled")
+	cli.Info("Deploy with observability: vsr deploy docker --with-observability")
+}
diff --git a/src/semantic-router/cmd/vsr/commands/dashboard_test.go b/src/semantic-router/cmd/vsr/commands/dashboard_test.go
new file mode 100644
index 000000000..3ee2bc019
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/dashboard_test.go
@@ -0,0 +1,172 @@
+package commands
+
+import (
+	"testing"
+)
+
+func TestDashboardCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "dashboard command has correct structure",
+			expectedUse:   "dashboard",
+			expectedShort: "Open router dashboard in browser",
+			hasFlags:      []string{"namespace", "no-open"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewDashboardCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestDashboardCommandFlags(t *testing.T) {
+	cmd := NewDashboardCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "namespace",
+			flagType:     "string",
+			defaultValue: "default",
+		},
+		{
+			flagName:     "no-open",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestMetricsCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "metrics command has correct structure",
+			expectedUse:   "metrics",
+			expectedShort: "Display router metrics",
+			hasFlags:      []string{"since", "watch"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewMetricsCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestMetricsCommandFlags(t *testing.T) {
+	cmd := NewMetricsCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "since",
+			flagType:     "string",
+			defaultValue: "5m",
+		},
+		{
+			flagName:     "watch",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestDetectActiveDeploymentExists(t *testing.T) {
+	// Test that the function exists and can be called
+	result := detectActiveDeployment("default")
+	// Result can be empty string or a deployment type
+	if result != "" && result != "local" && result != "docker" && result != "kubernetes" && result != "helm" {
+		t.Errorf("unexpected deployment type: %s", result)
+	}
+}
+
+func TestOpenBrowserFunction(t *testing.T) {
+	// Test that the function exists
+	// We can't actually test browser opening, just that the function is callable
+	err := openBrowser("http://example.com")
+	// Error is expected as we likely don't have display, but function should exist
+	_ = err // Just testing function exists
+}
diff --git a/src/semantic-router/cmd/vsr/commands/debug.go b/src/semantic-router/cmd/vsr/commands/debug.go
new file mode 100644
index 000000000..ee8629342
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/debug.go
@@ -0,0 +1,253 @@
+package commands
+
+import (
+	"fmt"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/debug"
+)
+
+// NewDebugCmd creates the debug command
+func NewDebugCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "debug",
+		Short: "Run interactive debugging session",
+		Long: `Run a comprehensive debugging session to identify issues.
+
+This command performs the following checks:
+  - Prerequisites (kubectl, docker, helm, make, git)
+  - Configuration file validation
+  - Model availability and integrity
+  - System resources (disk space, ports)
+  - Network connectivity
+
+Examples:
+  # Run full diagnostics
+  vsr debug
+
+  # Run with custom config
+  vsr debug --config /path/to/config.yaml
+
+  # Run with custom models directory
+  vsr debug --models-dir /path/to/models`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Flag("config").Value.String()
+			modelsDir, _ := cmd.Flags().GetString("models-dir")
+
+			cli.Info("Starting interactive debug session...")
+			cli.Info("This will check your environment, configuration, and resources.")
+			cli.Info("")
+
+			// Run full diagnostics
+			report := debug.RunFullDiagnostics(configPath, modelsDir)
+
+			// Display report
+			debug.DisplayReport(report)
+
+			// Check if there are critical failures
+			hasCriticalFailures := false
+			for _, results := range [][]debug.CheckResult{
+				report.Prerequisites,
+				report.Configuration,
+				report.ModelStatus,
+			} {
+				for _, result := range results {
+					if result.Status == "fail" && result.Severity == "critical" {
+						hasCriticalFailures = true
+						break
+					}
+				}
+				if hasCriticalFailures {
+					break
+				}
+			}
+
+			if hasCriticalFailures {
+				cli.Error("\n❌ Critical issues found. Please resolve them before proceeding.")
+				return fmt.Errorf("critical diagnostic failures")
+			}
+
+			cli.Success("\n✅ Debug session complete!")
+			return nil
+		},
+	}
+
+	cmd.Flags().String("models-dir", "./models", "Models directory to check")
+
+	return cmd
+}
+
+// NewHealthCmd creates the health command
+func NewHealthCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "health",
+		Short: "Check router health status",
+		Long: `Perform a quick health check of the router and its components.
+
+This is a lightweight check that verifies:
+  - Configuration validity
+  - Model availability
+  - System resources
+  - Service connectivity
+
+Examples:
+  # Quick health check
+  vsr health
+
+  # Health check with custom config
+  vsr health --config /path/to/config.yaml`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Flag("config").Value.String()
+			modelsDir := "./models"
+
+			cli.Info("Running health check...")
+			cli.Info("")
+
+			// Run quick checks
+			configResults := debug.CheckConfiguration(configPath)
+			modelResults := debug.CheckModelStatus(modelsDir)
+			resourceResults := debug.CheckResources()
+			connectivityResults := debug.CheckConnectivity(nil)
+
+			// Display results
+			allPass := true
+
+			// Config
+			for _, result := range configResults {
+				switch result.Status {
+				case "fail":
+					cli.Error(fmt.Sprintf("✗ %s: %s", result.Name, result.Message))
+					allPass = false
+				case "warn":
+					cli.Warning(fmt.Sprintf("⚠ %s: %s", result.Name, result.Message))
+				default:
+					cli.Success(fmt.Sprintf("✓ %s", result.Name))
+				}
+			}
+
+			// Models
+			for _, result := range modelResults {
+				switch result.Status {
+				case "fail":
+					cli.Error(fmt.Sprintf("✗ %s: %s", result.Name, result.Message))
+					allPass = false
+				case "warn":
+					cli.Warning(fmt.Sprintf("⚠ %s: %s", result.Name, result.Message))
+				default:
+					cli.Success(fmt.Sprintf("✓ %s", result.Name))
+				}
+			}
+
+			// Resources
+			for _, result := range resourceResults {
+				switch result.Status {
+				case "fail":
+					cli.Error(fmt.Sprintf("✗ %s: %s", result.Name, result.Message))
+					allPass = false
+				case "warn":
+					cli.Warning(fmt.Sprintf("⚠ %s: %s", result.Name, result.Message))
+				default:
+					cli.Success(fmt.Sprintf("✓ %s", result.Name))
+				}
+			}
+
+			// Connectivity
+			hasConnectivity := false
+			for _, result := range connectivityResults {
+				switch result.Status {
+				case "pass":
+					cli.Success(fmt.Sprintf("✓ %s is reachable", result.Name))
+					hasConnectivity = true
+				case "warn":
+					cli.Warning(fmt.Sprintf("⚠ %s: %s", result.Name, result.Message))
+				default:
+					// Don't fail on connectivity issues, just warn
+					cli.Warning(fmt.Sprintf("⚠ %s is not reachable", result.Name))
+				}
+			}
+
+			cli.Info("")
+
+			// Overall status
+			if allPass && hasConnectivity {
+				cli.Success("🟢 Overall Health: GOOD")
+				cli.Info("All systems operational")
+			} else if allPass {
+				cli.Warning("🟡 Overall Health: DEGRADED")
+				cli.Info("Router is not running but environment is ready")
+				cli.Info("Deploy with: vsr deploy [local|docker|kubernetes|helm]")
+			} else {
+				cli.Error("🔴 Overall Health: POOR")
+				cli.Info("Critical issues detected")
+				cli.Info("Run 'vsr debug' for detailed diagnostics")
+				return fmt.Errorf("health check failed")
+			}
+
+			return nil
+		},
+	}
+
+	return cmd
+}
+
+// NewDiagnoseCmd creates the diagnose command
+func NewDiagnoseCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "diagnose",
+		Short: "Generate diagnostic report",
+		Long: `Generate a comprehensive diagnostic report for troubleshooting.
+
+The report includes:
+  - System information
+  - Environment checks
+  - Configuration status
+  - Model status
+  - Resource availability
+  - Network connectivity
+  - Recommendations
+
+The report can be saved to a file for support tickets.
+
+Examples:
+  # Generate report to stdout
+  vsr diagnose
+
+  # Save report to file
+  vsr diagnose --output report.txt
+
+  # Generate with custom config
+  vsr diagnose --config /path/to/config.yaml --output report.txt`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			configPath := cmd.Parent().Flag("config").Value.String()
+			modelsDir, _ := cmd.Flags().GetString("models-dir")
+			outputFile, _ := cmd.Flags().GetString("output")
+
+			// Run diagnostics
+			report := debug.RunFullDiagnostics(configPath, modelsDir)
+
+			// Display to stdout
+			if outputFile == "" {
+				debug.DisplayReport(report)
+			} else {
+				// Save to file
+				cli.Info(fmt.Sprintf("Generating diagnostic report to: %s", outputFile))
+
+				// TODO: Implement file output
+				// For now, display and inform user
+				debug.DisplayReport(report)
+
+				cli.Info(fmt.Sprintf("\n📄 Report would be saved to: %s", outputFile))
+				cli.Info("Note: File output not yet implemented")
+			}
+
+			return nil
+		},
+	}
+
+	cmd.Flags().String("models-dir", "./models", "Models directory to check")
+	cmd.Flags().String("output", "", "Output file for the report")
+
+	return cmd
+}
diff --git a/src/semantic-router/cmd/vsr/commands/debug_test.go b/src/semantic-router/cmd/vsr/commands/debug_test.go
new file mode 100644
index 000000000..04db153b6
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/debug_test.go
@@ -0,0 +1,219 @@
+package commands
+
+import (
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestDebugCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+	}{
+		{
+			name:          "debug command has correct structure",
+			expectedUse:   "debug",
+			expectedShort: "Run interactive debugging session",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewDebugCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+		})
+	}
+}
+
+func TestDebugCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "debug command runs",
+			args:      []string{"debug"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+
+			debugCmd := NewDebugCmd()
+			rootCmd.AddCommand(debugCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestHealthCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+	}{
+		{
+			name:          "health command has correct structure",
+			expectedUse:   "health",
+			expectedShort: "Check router health",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewHealthCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+		})
+	}
+}
+
+func TestHealthCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "health command runs",
+			args:      []string{"health"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+
+			healthCmd := NewHealthCmd()
+			rootCmd.AddCommand(healthCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestDiagnoseCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "diagnose command has correct structure",
+			expectedUse:   "diagnose",
+			expectedShort: "Generate diagnostic report",
+			hasFlags:      []string{"output"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewDiagnoseCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestDiagnoseCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "diagnose command runs",
+			args:      []string{"diagnose"},
+			wantError: false,
+		},
+		{
+			name:      "diagnose with output flag",
+			args:      []string{"diagnose", "--output", "/tmp/diagnose.txt"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+
+			diagnoseCmd := NewDiagnoseCmd()
+			rootCmd.AddCommand(diagnoseCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestDiagnoseCommandFlags(t *testing.T) {
+	cmd := NewDiagnoseCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "output",
+			flagType:     "string",
+			defaultValue: "",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/deploy.go b/src/semantic-router/cmd/vsr/commands/deploy.go
new file mode 100644
index 000000000..99dbee6c1
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/deploy.go
@@ -0,0 +1,148 @@
+package commands
+
+import (
+	"fmt"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/deployment"
+)
+
+// NewDeployCmd creates the deploy command
+func NewDeployCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "deploy [local|docker|kubernetes|helm]",
+		Short: "Deploy the router to specified environment",
+		Long: `Deploy the vLLM Semantic Router to different environments.
+
+Supported environments:
+  local       - Run router as local process
+  docker      - Deploy using Docker Compose
+  kubernetes  - Deploy to Kubernetes cluster
+  helm        - Deploy using Helm chart`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			env := args[0]
+			configPath := cmd.Parent().Flag("config").Value.String()
+			withObs, _ := cmd.Flags().GetBool("with-observability")
+			namespace, _ := cmd.Flags().GetString("namespace")
+			releaseName, _ := cmd.Flags().GetString("release-name")
+			setValues, _ := cmd.Flags().GetStringArray("set")
+			force, _ := cmd.Flags().GetBool("force")
+
+			switch env {
+			case "local":
+				return deployment.DeployLocal(configPath, force)
+			case "docker":
+				return deployment.DeployDocker(configPath, withObs)
+			case "kubernetes":
+				return deployment.DeployKubernetes(configPath, namespace, withObs)
+			case "helm":
+				return deployment.DeployHelm(configPath, namespace, releaseName, withObs, setValues)
+			default:
+				return fmt.Errorf("unknown environment: %s", env)
+			}
+		},
+	}
+
+	cmd.Flags().Bool("with-observability", true, "Deploy with Grafana/Prometheus observability stack")
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace for deployment")
+	cmd.Flags().String("release-name", "", "Helm release name (default: semantic-router)")
+	cmd.Flags().StringArray("set", []string{}, "Set values for Helm chart (can be used multiple times)")
+	cmd.Flags().Bool("dry-run", false, "Show commands without executing")
+	cmd.Flags().Bool("force", false, "Force replacement of existing local deployment")
+
+	return cmd
+}
+
+// NewUndeployCmd creates the undeploy command
+func NewUndeployCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "undeploy [local|docker|kubernetes|helm]",
+		Short: "Remove router deployment",
+		Long: `Remove the vLLM Semantic Router deployment from the specified environment.
+
+Examples:
+  # Undeploy local router
+  vsr undeploy local
+
+  # Undeploy Docker deployment
+  vsr undeploy docker
+
+  # Undeploy Docker and remove volumes
+  vsr undeploy docker --volumes
+
+  # Undeploy Kubernetes and wait for cleanup
+  vsr undeploy kubernetes --wait
+
+  # Undeploy from specific namespace
+  vsr undeploy kubernetes --namespace production --wait
+
+  # Undeploy Helm release
+  vsr undeploy helm --namespace production --wait`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			env := args[0]
+			namespace, _ := cmd.Flags().GetString("namespace")
+			removeVolumes, _ := cmd.Flags().GetBool("volumes")
+			wait, _ := cmd.Flags().GetBool("wait")
+			releaseName, _ := cmd.Flags().GetString("release-name")
+
+			switch env {
+			case "local":
+				return deployment.UndeployLocal()
+			case "docker":
+				return deployment.UndeployDocker(removeVolumes)
+			case "kubernetes":
+				return deployment.UndeployKubernetes(namespace, wait)
+			case "helm":
+				return deployment.UndeployHelm(namespace, releaseName, wait)
+			default:
+				return fmt.Errorf("unknown environment: %s", env)
+			}
+		},
+	}
+
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace")
+	cmd.Flags().String("release-name", "", "Helm release name (default: semantic-router)")
+	cmd.Flags().Bool("volumes", false, "Remove volumes (Docker only)")
+	cmd.Flags().Bool("wait", false, "Wait for complete cleanup (Kubernetes/Helm only)")
+	return cmd
+}
+
+// NewStartCmd creates the start command
+func NewStartCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "start",
+		Short: "Start the router service",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			cli.Warning("Not implemented: use 'vsr deploy' instead")
+			return nil
+		},
+	}
+}
+
+// NewStopCmd creates the stop command
+func NewStopCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "stop",
+		Short: "Stop the router service",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			cli.Warning("Not implemented: use 'vsr undeploy' instead")
+			return nil
+		},
+	}
+}
+
+// NewRestartCmd creates the restart command
+func NewRestartCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "restart",
+		Short: "Restart the router service",
+		RunE: func(cmd *cobra.Command, args []string) error {
+			cli.Warning("Not implemented: use 'vsr undeploy' then 'vsr deploy' instead")
+			return nil
+		},
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/deploy_test.go b/src/semantic-router/cmd/vsr/commands/deploy_test.go
new file mode 100644
index 000000000..40a979205
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/deploy_test.go
@@ -0,0 +1,326 @@
+package commands
+
+import (
+	"bytes"
+	"strings"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestDeployCommand(t *testing.T) {
+	tests := []struct {
+		name        string
+		args        []string
+		expectError bool
+		errorMsg    string
+	}{
+		{
+			name:        "deploy without environment",
+			args:        []string{},
+			expectError: true,
+			errorMsg:    "accepts 1 arg(s)",
+		},
+		{
+			name:        "deploy with valid environment - local",
+			args:        []string{"local"},
+			expectError: false,
+		},
+		{
+			name:        "deploy with valid environment - docker",
+			args:        []string{"docker"},
+			expectError: false,
+		},
+		{
+			name:        "deploy with valid environment - kubernetes",
+			args:        []string{"kubernetes"},
+			expectError: false,
+		},
+		{
+			name:        "deploy with invalid environment",
+			args:        []string{"invalid"},
+			expectError: false, // Command parsing succeeds, execution would fail
+		},
+		{
+			name:        "deploy with too many args",
+			args:        []string{"docker", "extra"},
+			expectError: true,
+			errorMsg:    "accepts 1 arg(s)",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewDeployCmd()
+
+			// Create a root command to attach flags properly
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config.yaml", "config file")
+			rootCmd.AddCommand(cmd)
+
+			// Set args
+			rootCmd.SetArgs(append([]string{"deploy"}, tt.args...))
+
+			// Capture output
+			buf := new(bytes.Buffer)
+			rootCmd.SetOut(buf)
+			rootCmd.SetErr(buf)
+
+			// Execute
+			err := rootCmd.Execute()
+
+			if tt.expectError {
+				if err == nil {
+					t.Errorf("Expected error but got none")
+				} else if tt.errorMsg != "" && !strings.Contains(err.Error(), tt.errorMsg) {
+					t.Errorf("Expected error containing %q, got %q", tt.errorMsg, err.Error())
+				}
+			}
+		})
+	}
+}
+
+func TestDeployCommandFlags(t *testing.T) {
+	tests := []struct {
+		name          string
+		args          []string
+		expectedFlags map[string]string
+	}{
+		{
+			name: "default flags",
+			args: []string{"docker"},
+			expectedFlags: map[string]string{
+				"with-observability": "true",
+				"namespace":          "default",
+				"dry-run":            "false",
+			},
+		},
+		{
+			name: "with custom namespace",
+			args: []string{"kubernetes", "--namespace", "production"},
+			expectedFlags: map[string]string{
+				"namespace": "production",
+			},
+		},
+		{
+			name: "without observability",
+			args: []string{"docker", "--with-observability=false"},
+			expectedFlags: map[string]string{
+				"with-observability": "false",
+			},
+		},
+		{
+			name: "dry-run enabled",
+			args: []string{"kubernetes", "--dry-run"},
+			expectedFlags: map[string]string{
+				"dry-run": "true",
+			},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewDeployCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config.yaml", "config file")
+			rootCmd.AddCommand(cmd)
+
+			// Set args and parse
+			fullArgs := append([]string{"deploy"}, tt.args...)
+			rootCmd.SetArgs(fullArgs)
+
+			// Parse command (this will parse the subcommand flags)
+			_, err := rootCmd.ExecuteC()
+			// Ignore execution errors, we're just testing flag parsing
+			_ = err
+
+			// Check flags
+			for flagName, expectedValue := range tt.expectedFlags {
+				flag := cmd.Flags().Lookup(flagName)
+				if flag == nil {
+					t.Errorf("Flag %q not found", flagName)
+					continue
+				}
+				if flag.Value.String() != expectedValue {
+					t.Errorf("Flag %q: expected %q, got %q", flagName, expectedValue, flag.Value.String())
+				}
+			}
+		})
+	}
+}
+
+func TestUndeployCommand(t *testing.T) {
+	tests := []struct {
+		name        string
+		args        []string
+		expectError bool
+		errorMsg    string
+	}{
+		{
+			name:        "undeploy without environment",
+			args:        []string{},
+			expectError: true,
+			errorMsg:    "accepts 1 arg(s)",
+		},
+		{
+			name:        "undeploy local",
+			args:        []string{"local"},
+			expectError: false,
+		},
+		{
+			name:        "undeploy docker",
+			args:        []string{"docker"},
+			expectError: false,
+		},
+		{
+			name:        "undeploy kubernetes",
+			args:        []string{"kubernetes"},
+			expectError: false,
+		},
+		{
+			name:        "undeploy with too many args",
+			args:        []string{"docker", "extra"},
+			expectError: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewUndeployCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.AddCommand(cmd)
+			rootCmd.SetArgs(append([]string{"undeploy"}, tt.args...))
+
+			buf := new(bytes.Buffer)
+			rootCmd.SetOut(buf)
+			rootCmd.SetErr(buf)
+
+			err := rootCmd.Execute()
+
+			if tt.expectError {
+				if err == nil {
+					t.Errorf("Expected error but got none")
+				} else if tt.errorMsg != "" && !strings.Contains(err.Error(), tt.errorMsg) {
+					t.Errorf("Expected error containing %q, got %q", tt.errorMsg, err.Error())
+				}
+			}
+		})
+	}
+}
+
+func TestUndeployCommandFlags(t *testing.T) {
+	tests := []struct {
+		name          string
+		args          []string
+		expectedFlags map[string]string
+	}{
+		{
+			name: "default flags",
+			args: []string{"docker"},
+			expectedFlags: map[string]string{
+				"namespace": "default",
+				"volumes":   "false",
+				"wait":      "false",
+			},
+		},
+		{
+			name: "with volumes flag",
+			args: []string{"docker", "--volumes"},
+			expectedFlags: map[string]string{
+				"volumes": "true",
+			},
+		},
+		{
+			name: "with wait flag",
+			args: []string{"kubernetes", "--wait"},
+			expectedFlags: map[string]string{
+				"wait": "true",
+			},
+		},
+		{
+			name: "with custom namespace and wait",
+			args: []string{"kubernetes", "--namespace", "prod", "--wait"},
+			expectedFlags: map[string]string{
+				"namespace": "prod",
+				"wait":      "true",
+			},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewUndeployCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.AddCommand(cmd)
+
+			fullArgs := append([]string{"undeploy"}, tt.args...)
+			rootCmd.SetArgs(fullArgs)
+
+			// Parse command
+			_, err := rootCmd.ExecuteC()
+			_ = err // Ignore execution errors
+
+			for flagName, expectedValue := range tt.expectedFlags {
+				flag := cmd.Flags().Lookup(flagName)
+				if flag == nil {
+					t.Errorf("Flag %q not found", flagName)
+					continue
+				}
+				if flag.Value.String() != expectedValue {
+					t.Errorf("Flag %q: expected %q, got %q", flagName, expectedValue, flag.Value.String())
+				}
+			}
+		})
+	}
+}
+
+func TestStartStopRestartCommands(t *testing.T) {
+	tests := []struct {
+		name    string
+		cmdFunc func() *cobra.Command
+		cmdName string
+	}{
+		{
+			name:    "start command",
+			cmdFunc: NewStartCmd,
+			cmdName: "start",
+		},
+		{
+			name:    "stop command",
+			cmdFunc: NewStopCmd,
+			cmdName: "stop",
+		},
+		{
+			name:    "restart command",
+			cmdFunc: NewRestartCmd,
+			cmdName: "restart",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := tt.cmdFunc()
+
+			if cmd.Use != tt.cmdName {
+				t.Errorf("Expected Use=%q, got %q", tt.cmdName, cmd.Use)
+			}
+
+			// These commands should run without error (they just show warnings)
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.AddCommand(cmd)
+			rootCmd.SetArgs([]string{tt.cmdName})
+
+			err := rootCmd.Execute()
+			if err != nil {
+				t.Errorf("Command should not error: %v", err)
+			}
+
+			// The commands run successfully and print warnings
+			// We can't easily capture the cli.Warning output
+			// so we just verify they execute without error
+		})
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/get.go b/src/semantic-router/cmd/vsr/commands/get.go
new file mode 100644
index 000000000..403318b4d
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/get.go
@@ -0,0 +1,164 @@
+package commands
+
+import (
+	"fmt"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/config"
+)
+
+// NewGetCmd creates the get command
+func NewGetCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "get [models|categories|decisions|endpoints]",
+		Short: "Get information about router resources",
+		Long: `Retrieve and display information about configured resources.
+
+Available resources:
+  models      - List all configured models
+  categories  - List all routing categories
+  decisions   - List all routing decisions
+  endpoints   - List all backend endpoints`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			resource := args[0]
+			configPath := cmd.Parent().Flag("config").Value.String()
+
+			cfg, err := config.Load(configPath)
+			if err != nil {
+				return fmt.Errorf("failed to load config: %w", err)
+			}
+
+			outputFormat := cmd.Parent().Flag("output").Value.String()
+
+			switch resource {
+			case "models":
+				return displayModels(cfg, outputFormat)
+			case "categories":
+				return displayCategories(cfg, outputFormat)
+			case "decisions":
+				return displayDecisions(cfg, outputFormat)
+			case "endpoints":
+				return displayEndpoints(cfg, outputFormat)
+			default:
+				return fmt.Errorf("unknown resource: %s (valid options: models, categories, decisions, endpoints)", resource)
+			}
+		},
+	}
+
+	return cmd
+}
+
+func displayModels(cfg *config.RouterConfig, format string) error {
+	switch format {
+	case "json":
+		return cli.PrintJSON(cfg.ModelConfig)
+	case "yaml":
+		return cli.PrintYAML(cfg.ModelConfig)
+	}
+
+	// Table format
+	headers := []string{"Model Name", "Endpoints", "Pricing"}
+	var rows [][]string
+
+	for modelName, modelCfg := range cfg.ModelConfig {
+		endpoints := "N/A"
+		if len(modelCfg.PreferredEndpoints) > 0 {
+			endpoints = fmt.Sprintf("%v", modelCfg.PreferredEndpoints)
+		}
+
+		pricing := "N/A"
+		if modelCfg.Pricing.Currency != "" {
+			pricing = fmt.Sprintf("%s %.2f/%.2f per 1M",
+				modelCfg.Pricing.Currency,
+				modelCfg.Pricing.PromptPer1M,
+				modelCfg.Pricing.CompletionPer1M)
+		}
+
+		rows = append(rows, []string{modelName, endpoints, pricing})
+	}
+
+	cli.PrintTable(headers, rows)
+	return nil
+}
+
+func displayCategories(cfg *config.RouterConfig, format string) error {
+	switch format {
+	case "json":
+		return cli.PrintJSON(cfg.Categories)
+	case "yaml":
+		return cli.PrintYAML(cfg.Categories)
+	}
+
+	// Table format
+	headers := []string{"Category", "Description", "MMLU Categories"}
+	var rows [][]string
+
+	for _, category := range cfg.Categories {
+		rows = append(rows, []string{
+			category.Name,
+			category.Description,
+			fmt.Sprintf("%v", category.MMLUCategories),
+		})
+	}
+
+	cli.PrintTable(headers, rows)
+	return nil
+}
+
+func displayDecisions(cfg *config.RouterConfig, format string) error {
+	switch format {
+	case "json":
+		return cli.PrintJSON(cfg.Decisions)
+	case "yaml":
+		return cli.PrintYAML(cfg.Decisions)
+	}
+
+	// Table format
+	headers := []string{"Decision", "Description", "Priority", "Models"}
+	var rows [][]string
+
+	for _, decision := range cfg.Decisions {
+		var models []string
+		for _, ref := range decision.ModelRefs {
+			models = append(models, ref.Model)
+		}
+
+		rows = append(rows, []string{
+			decision.Name,
+			decision.Description,
+			fmt.Sprintf("%d", decision.Priority),
+			fmt.Sprintf("%v", models),
+		})
+	}
+
+	cli.PrintTable(headers, rows)
+	return nil
+}
+
+func displayEndpoints(cfg *config.RouterConfig, format string) error {
+	switch format {
+	case "json":
+		return cli.PrintJSON(cfg.VLLMEndpoints)
+	case "yaml":
+		return cli.PrintYAML(cfg.VLLMEndpoints)
+	}
+
+	// Table format
+	headers := []string{"Name", "Address", "Port", "Weight"}
+	var rows [][]string
+
+	for _, endpoint := range cfg.VLLMEndpoints {
+		rows = append(rows, []string{
+			endpoint.Name,
+			endpoint.Address,
+			fmt.Sprintf("%d", endpoint.Port),
+			fmt.Sprintf("%d", endpoint.Weight),
+		})
+	}
+
+	cli.PrintTable(headers, rows)
+	return nil
+}
diff --git a/src/semantic-router/cmd/vsr/commands/get_test.go b/src/semantic-router/cmd/vsr/commands/get_test.go
new file mode 100644
index 000000000..d4356db82
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/get_test.go
@@ -0,0 +1,206 @@
+package commands
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestGetCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+	}{
+		{
+			name:          "get command has correct structure",
+			expectedUse:   "get [models|categories|decisions|endpoints]",
+			expectedShort: "Get information about router resources",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewGetCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+		})
+	}
+}
+
+func TestGetCommand(t *testing.T) {
+	// Create temporary config file
+	tmpDir := t.TempDir()
+	configPath := filepath.Join(tmpDir, "config.yaml")
+	configContent := `bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+
+vllm_endpoints:
+  - name: "endpoint1"
+    address: "127.0.0.1"
+    port: 8000
+    weight: 1
+
+model_config:
+  test-model:
+    preferred_endpoints: ["endpoint1"]
+    pricing:
+      currency: "USD"
+      prompt_per_1m: 0.5
+      completion_per_1m: 1.5
+
+categories:
+  - name: "math"
+    description: "Math queries"
+    mmlu_categories: []
+
+decisions:
+  - name: "test_decision"
+    description: "Test decision"
+    priority: 10
+    modelRefs:
+      - model: "test-model"
+
+default_model: "test-model"
+`
+	if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+		t.Fatalf("Failed to create test config: %v", err)
+	}
+
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "get models",
+			args:      []string{"get", "models", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get categories",
+			args:      []string{"get", "categories", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get decisions",
+			args:      []string{"get", "decisions", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get endpoints",
+			args:      []string{"get", "endpoints", "-c", configPath},
+			wantError: false,
+		},
+		{
+			name:      "get unknown resource",
+			args:      []string{"get", "unknown", "-c", configPath},
+			wantError: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+			rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format")
+
+			getCmd := NewGetCmd()
+			rootCmd.AddCommand(getCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestGetCommandWithDifferentOutputFormats(t *testing.T) {
+	tmpDir := t.TempDir()
+	configPath := filepath.Join(tmpDir, "config.yaml")
+	configContent := `bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+
+vllm_endpoints:
+  - name: "endpoint1"
+    address: "127.0.0.1"
+    port: 8000
+    weight: 1
+
+model_config:
+  test-model:
+    preferred_endpoints: ["endpoint1"]
+
+default_model: "test-model"
+`
+	if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+		t.Fatalf("Failed to create test config: %v", err)
+	}
+
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "get models in json format",
+			args:      []string{"get", "models", "-c", configPath, "-o", "json"},
+			wantError: false,
+		},
+		{
+			name:      "get models in yaml format",
+			args:      []string{"get", "models", "-c", configPath, "-o", "yaml"},
+			wantError: false,
+		},
+		{
+			name:      "get models in table format",
+			args:      []string{"get", "models", "-c", configPath, "-o", "table"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+			rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format")
+
+			getCmd := NewGetCmd()
+			rootCmd.AddCommand(getCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestGetCommandRequiresResource(t *testing.T) {
+	tmpDir := t.TempDir()
+	configPath := filepath.Join(tmpDir, "config.yaml")
+	configContent := `default_model: "test"`
+	if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+		t.Fatalf("Failed to create test config: %v", err)
+	}
+
+	rootCmd := &cobra.Command{Use: "vsr"}
+	rootCmd.PersistentFlags().StringP("config", "c", configPath, "Path to configuration file")
+
+	getCmd := NewGetCmd()
+	rootCmd.AddCommand(getCmd)
+
+	rootCmd.SetArgs([]string{"get"})
+	_, err := rootCmd.ExecuteC()
+
+	if err == nil {
+		t.Error("expected error when no resource specified, got nil")
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/http_timeout_test.go b/src/semantic-router/cmd/vsr/commands/http_timeout_test.go
new file mode 100644
index 000000000..c3539f613
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/http_timeout_test.go
@@ -0,0 +1,164 @@
+package commands
+
+import (
+	"bytes"
+	"encoding/json"
+	"net/http"
+	"net/http/httptest"
+	"testing"
+	"time"
+)
+
+// TestHTTPTimeout verifies 30-second HTTP timeout prevents hanging requests
+func TestHTTPTimeout(t *testing.T) {
+	t.Run("request times out after 30 seconds", func(t *testing.T) {
+		// Create a server that never responds
+		hangingServer := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+			time.Sleep(60 * time.Second) // Never responds
+		}))
+		defer hangingServer.Close()
+
+		// Create HTTP client with 30-second timeout
+		client := &http.Client{
+			Timeout: 30 * time.Second,
+		}
+
+		reqBody := map[string]string{"text": "test"}
+		jsonData, _ := json.Marshal(reqBody)
+
+		startTime := time.Now()
+		resp, err := client.Post(
+			hangingServer.URL,
+			"application/json",
+			bytes.NewBuffer(jsonData),
+		)
+		elapsed := time.Since(startTime)
+
+		if err == nil {
+			resp.Body.Close()
+			t.Error("Request should have timed out but succeeded")
+		}
+
+		// Verify timeout occurred within expected range (30-31 seconds)
+		if elapsed < 29*time.Second || elapsed > 31*time.Second {
+			t.Errorf("Timeout should occur around 30s, took %v", elapsed)
+		}
+	})
+
+	t.Run("fast responses still work", func(t *testing.T) {
+		// Create a server that responds quickly
+		fastServer := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+			w.WriteHeader(http.StatusOK)
+			_ = json.NewEncoder(w).Encode(map[string]string{"status": "ok"})
+		}))
+		defer fastServer.Close()
+
+		client := &http.Client{
+			Timeout: 30 * time.Second,
+		}
+
+		reqBody := map[string]string{"text": "test"}
+		jsonData, _ := json.Marshal(reqBody)
+
+		startTime := time.Now()
+		resp, err := client.Post(
+			fastServer.URL,
+			"application/json",
+			bytes.NewBuffer(jsonData),
+		)
+		elapsed := time.Since(startTime)
+
+		if err != nil {
+			t.Errorf("Fast request should succeed, got error: %v", err)
+		}
+
+		if resp != nil {
+			resp.Body.Close()
+		}
+
+		// Verify response is fast (< 1 second)
+		if elapsed > 1*time.Second {
+			t.Errorf("Response should be fast (<1s), took %v", elapsed)
+		}
+	})
+
+	t.Run("client timeout is configurable", func(t *testing.T) {
+		// Verify we can create clients with different timeouts
+		shortTimeout := &http.Client{Timeout: 5 * time.Second}
+		longTimeout := &http.Client{Timeout: 60 * time.Second}
+
+		if shortTimeout.Timeout != 5*time.Second {
+			t.Error("Short timeout not set correctly")
+		}
+		if longTimeout.Timeout != 60*time.Second {
+			t.Error("Long timeout not set correctly")
+		}
+	})
+}
+
+// TestInputValidation verifies 10k character limit on prompts
+func TestInputValidation(t *testing.T) {
+	t.Run("prompt under 10k characters is valid", func(t *testing.T) {
+		prompt := "This is a valid prompt"
+		if len(prompt) > 10000 {
+			t.Errorf("Test prompt should be under 10k characters, got %d", len(prompt))
+		}
+	})
+
+	t.Run("prompt over 10k characters is invalid", func(t *testing.T) {
+		// Create a prompt over 10k characters
+		longPrompt := make([]byte, 10001)
+		for i := range longPrompt {
+			longPrompt[i] = 'a'
+		}
+
+		if len(longPrompt) <= 10000 {
+			t.Error("Test prompt should be over 10k characters")
+		}
+
+		// Verify validation would fail
+		if len(longPrompt) <= 10000 {
+			t.Errorf("Prompt length validation should fail for %d characters", len(longPrompt))
+		}
+	})
+
+	t.Run("exact 10k character limit", func(t *testing.T) {
+		exactPrompt := make([]byte, 10000)
+		for i := range exactPrompt {
+			exactPrompt[i] = 'a'
+		}
+
+		if len(exactPrompt) != 10000 {
+			t.Errorf("Prompt should be exactly 10000 characters, got %d", len(exactPrompt))
+		}
+
+		// At exactly 10k, should be valid
+		if len(exactPrompt) > 10000 {
+			t.Error("Prompt at exactly 10000 characters should be valid")
+		}
+	})
+}
+
+// TestHTTPClientConfiguration verifies HTTP client is properly configured
+func TestHTTPClientConfiguration(t *testing.T) {
+	t.Run("HTTP client has timeout set", func(t *testing.T) {
+		client := &http.Client{
+			Timeout: 30 * time.Second,
+		}
+
+		if client.Timeout == 0 {
+			t.Error("HTTP client should have timeout set")
+		}
+
+		if client.Timeout != 30*time.Second {
+			t.Errorf("HTTP client timeout = %v, expected 30s", client.Timeout)
+		}
+	})
+
+	t.Run("default HTTP client has no timeout", func(t *testing.T) {
+		defaultClient := &http.Client{}
+		if defaultClient.Timeout != 0 {
+			t.Error("Default HTTP client should have no timeout")
+		}
+	})
+}
diff --git a/src/semantic-router/cmd/vsr/commands/install.go b/src/semantic-router/cmd/vsr/commands/install.go
new file mode 100644
index 000000000..4e4f567b1
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/install.go
@@ -0,0 +1,212 @@
+package commands
+
+import (
+	"fmt"
+	"os"
+	"path/filepath"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+)
+
+// NewInstallCmd creates the install command
+func NewInstallCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "install",
+		Short: "Install vLLM Semantic Router",
+		Long: `Guide for installing the router in your environment.
+
+This command detects your environment and provides installation instructions.`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			cli.Warning("Installation Guide")
+			fmt.Println("\nThe vsr CLI is already installed if you're running this command!")
+			fmt.Println("\nTo install globally on Linux/macOS:")
+			fmt.Println("  sudo cp bin/vsr /usr/local/bin/vsr")
+			fmt.Println("  sudo chmod +x /usr/local/bin/vsr")
+			fmt.Println("  # Or run: make install-cli")
+
+			fmt.Println("\nTo deploy the router:")
+			fmt.Println("  1. Initialize configuration: vsr init")
+			fmt.Println("  2. Edit your config: vsr config edit")
+			fmt.Println("  3. Deploy: vsr deploy [local|docker|kubernetes]")
+			fmt.Println("\nFor detailed installation guides, see:")
+			fmt.Println("  https://github.com/vllm-project/semantic-router/tree/main/website/docs/installation")
+			return nil
+		},
+	}
+}
+
+// NewInitCmd creates the init command
+func NewInitCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "init",
+		Short: "Initialize a new configuration file",
+		Long: `Create a new configuration file from a template.
+
+Available templates:
+  default  - Full-featured configuration with all options
+  minimal  - Minimal configuration to get started
+  full     - Comprehensive configuration with comments`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			output, _ := cmd.Flags().GetString("output")
+			template, _ := cmd.Flags().GetString("template")
+
+			return initializeConfig(output, template)
+		},
+	}
+
+	cmd.Flags().String("output", "config/config.yaml", "Output path for the configuration file")
+	cmd.Flags().String("template", "default", "Template to use: default, minimal, full")
+
+	return cmd
+}
+
+func initializeConfig(outputPath, template string) error {
+	// Create directory if it doesn't exist
+	dir := filepath.Dir(outputPath)
+	if err := os.MkdirAll(dir, 0o755); err != nil {
+		return fmt.Errorf("failed to create directory: %w", err)
+	}
+
+	// Check if file exists
+	if _, err := os.Stat(outputPath); err == nil {
+		return fmt.Errorf("config file already exists at %s (use --output to specify different path)", outputPath)
+	}
+
+	// Get template content
+	templateContent := getTemplate(template)
+
+	// Write to file
+	if err := os.WriteFile(outputPath, []byte(templateContent), 0o644); err != nil {
+		return fmt.Errorf("failed to write config: %w", err)
+	}
+
+	cli.Success(fmt.Sprintf("Created configuration file: %s", outputPath))
+	fmt.Println("\nNext steps:")
+	fmt.Println("  1. Edit the configuration: vsr config edit")
+	fmt.Println("  2. Validate your config: vsr config validate")
+	fmt.Println("  3. Deploy the router: vsr deploy docker")
+
+	return nil
+}
+
+func getTemplate(template string) string {
+	switch template {
+	case "minimal":
+		return minimalTemplate
+	case "full":
+		return fullTemplate
+	default:
+		return defaultTemplate
+	}
+}
+
+const defaultTemplate = `# vLLM Semantic Router Configuration
+
+# BERT model for semantic similarity
+bert_model:
+  model_id: sentence-transformers/all-MiniLM-L12-v2
+  threshold: 0.6
+  use_cpu: true
+
+# vLLM endpoints - your backend models
+vllm_endpoints:
+  - name: "endpoint1"
+    address: "127.0.0.1"
+    port: 11434
+    weight: 1
+
+# Model configuration
+model_config:
+  "your-model":
+    preferred_endpoints: ["endpoint1"]
+    pricing:
+      currency: "USD"
+      prompt_per_1m: 0.50
+      completion_per_1m: 1.50
+
+# Categories (Metadata)
+categories:
+- name: math
+  description: "Mathematics related queries"
+- name: coding
+  description: "Programming and code generation"
+
+# Routing Rules
+keyword_rules:
+- name: math_keywords
+  operator: "OR"
+  keywords: ["math", "calculus", "algebra"]
+
+# Routing Decisions
+decisions:
+- name: math_decision
+  description: "Route math queries to model"
+  priority: 10
+  rules:
+    operator: "AND"
+    conditions:
+    - type: "keyword"
+      name: "math_keywords"
+  modelRefs:
+  - model: your-model
+    use_reasoning: true
+
+default_model: your-model
+
+# Classification models
+classifier:
+  category_model:
+    model_id: "models/category_classifier_modernbert-base_model"
+    use_modernbert: true
+    threshold: 0.6
+    use_cpu: true
+  pii_model:
+    model_id: "models/pii_classifier_modernbert-base_presidio_token_model"
+    use_modernbert: true
+    threshold: 0.7
+    use_cpu: true
+
+# Security features (optional)
+prompt_guard:
+  enabled: false
+  use_modernbert: true
+  threshold: 0.7
+  use_cpu: true
+
+# Semantic caching (optional)
+semantic_cache:
+  enabled: false
+  backend_type: "memory"
+  similarity_threshold: 0.8
+  max_entries: 1000
+  ttl_seconds: 3600
+  eviction_policy: "fifo"
+`
+
+const minimalTemplate = `# Minimal vLLM Semantic Router Configuration
+
+bert_model:
+  model_id: sentence-transformers/all-MiniLM-L12-v2
+  threshold: 0.6
+  use_cpu: true
+
+vllm_endpoints:
+  - name: "endpoint1"
+    address: "127.0.0.1"
+    port: 11434
+    weight: 1
+
+model_config:
+  "your-model":
+    preferred_endpoints: ["endpoint1"]
+
+categories:
+- name: general
+  description: "General queries"
+
+default_model: your-model
+`
+
+const fullTemplate = defaultTemplate // For now, full is same as default
diff --git a/src/semantic-router/cmd/vsr/commands/install_test.go b/src/semantic-router/cmd/vsr/commands/install_test.go
new file mode 100644
index 000000000..4e3c376db
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/install_test.go
@@ -0,0 +1,353 @@
+package commands
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestInstallCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+	}{
+		{
+			name:          "install command has correct structure",
+			expectedUse:   "install",
+			expectedShort: "Install vLLM Semantic Router",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewInstallCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+		})
+	}
+}
+
+func TestInstallCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "install command runs",
+			args:      []string{"install"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			installCmd := NewInstallCmd()
+			rootCmd.AddCommand(installCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, err := rootCmd.ExecuteC()
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+		})
+	}
+}
+
+func TestInitCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "init command has correct structure",
+			expectedUse:   "init",
+			expectedShort: "Initialize a new configuration file",
+			hasFlags:      []string{"output", "template"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewInitCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestInitCommand(t *testing.T) {
+	tmpDir := t.TempDir()
+
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+		checkFile bool
+	}{
+		{
+			name:      "init with default template",
+			args:      []string{"init", "--output", filepath.Join(tmpDir, "test1.yaml")},
+			wantError: false,
+			checkFile: true,
+		},
+		{
+			name:      "init with minimal template",
+			args:      []string{"init", "--output", filepath.Join(tmpDir, "test2.yaml"), "--template", "minimal"},
+			wantError: false,
+			checkFile: true,
+		},
+		{
+			name:      "init with full template",
+			args:      []string{"init", "--output", filepath.Join(tmpDir, "test3.yaml"), "--template", "full"},
+			wantError: false,
+			checkFile: true,
+		},
+		{
+			name:      "init with custom output path",
+			args:      []string{"init", "--output", filepath.Join(tmpDir, "custom/config.yaml")},
+			wantError: false,
+			checkFile: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			initCmd := NewInitCmd()
+			rootCmd.AddCommand(initCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, err := rootCmd.ExecuteC()
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+
+			// Check if file was created
+			if tt.checkFile && !tt.wantError {
+				outputPath := ""
+				for i, arg := range tt.args {
+					if arg == "--output" && i+1 < len(tt.args) {
+						outputPath = tt.args[i+1]
+						break
+					}
+				}
+				if outputPath != "" {
+					if _, err := os.Stat(outputPath); os.IsNotExist(err) {
+						t.Errorf("expected file to be created at %s", outputPath)
+					}
+				}
+			}
+		})
+	}
+}
+
+func TestInitCommandFileExists(t *testing.T) {
+	tmpDir := t.TempDir()
+	existingFile := filepath.Join(tmpDir, "existing.yaml")
+
+	// Create existing file
+	if err := os.WriteFile(existingFile, []byte("existing content"), 0o644); err != nil {
+		t.Fatalf("Failed to create test file: %v", err)
+	}
+
+	rootCmd := &cobra.Command{Use: "vsr"}
+	initCmd := NewInitCmd()
+	rootCmd.AddCommand(initCmd)
+
+	rootCmd.SetArgs([]string{"init", "--output", existingFile})
+	_, err := rootCmd.ExecuteC()
+
+	if err == nil {
+		t.Error("expected error when file exists, got nil")
+	}
+}
+
+func TestInitCommandFlags(t *testing.T) {
+	cmd := NewInitCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "output",
+			flagType:     "string",
+			defaultValue: "config/config.yaml",
+		},
+		{
+			flagName:     "template",
+			flagType:     "string",
+			defaultValue: "default",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestGetTemplate(t *testing.T) {
+	tests := []struct {
+		name          string
+		template      string
+		shouldBeEmpty bool
+	}{
+		{
+			name:          "default template",
+			template:      "default",
+			shouldBeEmpty: false,
+		},
+		{
+			name:          "minimal template",
+			template:      "minimal",
+			shouldBeEmpty: false,
+		},
+		{
+			name:          "full template",
+			template:      "full",
+			shouldBeEmpty: false,
+		},
+		{
+			name:          "unknown template defaults to default",
+			template:      "unknown",
+			shouldBeEmpty: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := getTemplate(tt.template)
+
+			if tt.shouldBeEmpty && result != "" {
+				t.Error("expected empty template")
+			}
+			if !tt.shouldBeEmpty && result == "" {
+				t.Error("expected non-empty template")
+			}
+		})
+	}
+}
+
+func TestTemplateContent(t *testing.T) {
+	tests := []struct {
+		name             string
+		template         string
+		shouldContain    []string
+		shouldNotContain []string
+	}{
+		{
+			name:     "default template contains required fields",
+			template: "default",
+			shouldContain: []string{
+				"bert_model:",
+				"vllm_endpoints:",
+				"model_config:",
+				"categories:",
+				"default_model:",
+			},
+		},
+		{
+			name:     "minimal template contains minimal fields",
+			template: "minimal",
+			shouldContain: []string{
+				"bert_model:",
+				"vllm_endpoints:",
+				"default_model:",
+			},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			content := getTemplate(tt.template)
+
+			for _, substring := range tt.shouldContain {
+				if !containsString(content, substring) {
+					t.Errorf("template should contain %q", substring)
+				}
+			}
+
+			for _, substring := range tt.shouldNotContain {
+				if containsString(content, substring) {
+					t.Errorf("template should not contain %q", substring)
+				}
+			}
+		})
+	}
+}
+
+// Helper function to check if string contains substring
+func containsString(s, substr string) bool {
+	return len(s) >= len(substr) && findSubstring(s, substr) != -1
+}
+
+// Helper function to find substring index
+func findSubstring(s, substr string) int {
+	if len(substr) == 0 {
+		return 0
+	}
+	if len(substr) > len(s) {
+		return -1
+	}
+	for i := 0; i <= len(s)-len(substr); i++ {
+		match := true
+		for j := 0; j < len(substr); j++ {
+			if s[i+j] != substr[j] {
+				match = false
+				break
+			}
+		}
+		if match {
+			return i
+		}
+	}
+	return -1
+}
diff --git a/src/semantic-router/cmd/vsr/commands/model.go b/src/semantic-router/cmd/vsr/commands/model.go
new file mode 100644
index 000000000..661666b83
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/model.go
@@ -0,0 +1,407 @@
+package commands
+
+import (
+	"fmt"
+	"strings"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/model"
+)
+
+// NewModelCmd creates the model command
+func NewModelCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "model",
+		Short: "Manage router models",
+		Long: `Manage models used by the vLLM Semantic Router.
+
+This includes downloading, listing, validating, and removing models.
+
+Examples:
+  # List all models
+  vsr model list
+
+  # Show detailed info about a model
+  vsr model info lora-intent-classifier
+
+  # Validate a model
+  vsr model validate lora-intent-classifier
+
+  # Validate all models
+  vsr model validate --all
+
+  # Remove a model
+  vsr model remove pii-classifier
+
+  # Download models (currently uses Makefile)
+  vsr model download`,
+	}
+
+	cmd.AddCommand(NewModelListCmd())
+	cmd.AddCommand(NewModelInfoCmd())
+	cmd.AddCommand(NewModelValidateCmd())
+	cmd.AddCommand(NewModelRemoveCmd())
+	cmd.AddCommand(NewModelDownloadCmd())
+
+	return cmd
+}
+
+// NewModelListCmd creates the model list command
+func NewModelListCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "list",
+		Short: "List all models",
+		Long: `List all models (both downloaded and configured).
+
+Shows model ID, name, type, size, and download status.
+
+Examples:
+  # List all models
+  vsr model list
+
+  # List only downloaded models
+  vsr model list --downloaded
+
+  # List with JSON output
+  vsr model list --output json`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			downloadedOnly, _ := cmd.Flags().GetBool("downloaded")
+			outputFormat := cmd.Parent().Parent().Flag("output").Value.String()
+
+			mgr := model.NewModelManager("./models")
+			models, err := mgr.ListModels()
+			if err != nil {
+				return fmt.Errorf("failed to list models: %w", err)
+			}
+
+			// Filter by downloaded if flag set
+			if downloadedOnly {
+				filtered := []model.ModelInfo{}
+				for _, m := range models {
+					if m.Downloaded {
+						filtered = append(filtered, m)
+					}
+				}
+				models = filtered
+			}
+
+			if len(models) == 0 {
+				cli.Warning("No models found")
+				cli.Info("Download models with: make download-models")
+				return nil
+			}
+
+			// Output based on format
+			switch outputFormat {
+			case "json":
+				// JSON output
+				fmt.Println("[")
+				for i, m := range models {
+					comma := ","
+					if i == len(models)-1 {
+						comma = ""
+					}
+					fmt.Printf("  {\"id\":\"%s\",\"name\":\"%s\",\"type\":\"%s\",\"purpose\":\"%s\",\"architecture\":\"%s\",\"size\":\"%s\",\"downloaded\":%t}%s\n",
+						m.ID, m.Name, m.Type, m.Purpose, m.Architecture, model.FormatSize(m.Size), m.Downloaded, comma)
+				}
+				fmt.Println("]")
+			case "yaml":
+				// YAML output
+				fmt.Println("models:")
+				for _, m := range models {
+					fmt.Printf("  - id: %s\n", m.ID)
+					fmt.Printf("    name: %s\n", m.Name)
+					fmt.Printf("    type: %s\n", m.Type)
+					fmt.Printf("    purpose: %s\n", m.Purpose)
+					fmt.Printf("    architecture: %s\n", m.Architecture)
+					fmt.Printf("    size: %s\n", model.FormatSize(m.Size))
+					fmt.Printf("    downloaded: %t\n", m.Downloaded)
+				}
+			default:
+				// Table output (default)
+				cli.Info("╔══════════════════════════════════════════════════════════════════════════╗")
+				cli.Info("║                            Available Models                              ║")
+				cli.Info("╠══════════════════════════════════════════════════════════════════════════╣")
+				cli.Info(fmt.Sprintf("║ %-30s %-12s %-10s %-12s ║", "Model ID", "Type", "Purpose", "Size"))
+				cli.Info("╠══════════════════════════════════════════════════════════════════════════╣")
+
+				for _, m := range models {
+					status := "✓"
+					if !m.Downloaded {
+						status = "✗"
+					}
+					cli.Info(fmt.Sprintf("║ %s %-28s %-12s %-10s %-12s ║",
+						status, m.ID, m.Type, m.Purpose, model.FormatSize(m.Size)))
+				}
+
+				cli.Info("╚══════════════════════════════════════════════════════════════════════════╝")
+
+				// Summary
+				downloadedCount := 0
+				for _, m := range models {
+					if m.Downloaded {
+						downloadedCount++
+					}
+				}
+				cli.Info(fmt.Sprintf("\nTotal: %d models (%d downloaded)", len(models), downloadedCount))
+			}
+
+			return nil
+		},
+	}
+
+	cmd.Flags().Bool("downloaded", false, "Show only downloaded models")
+
+	return cmd
+}
+
+// NewModelInfoCmd creates the model info command
+func NewModelInfoCmd() *cobra.Command {
+	return &cobra.Command{
+		Use:   "info <model-id>",
+		Short: "Show detailed model information",
+		Long: `Show detailed information about a specific model.
+
+Includes size, path, type, architecture, and purpose.
+
+Examples:
+  # Show info for a model
+  vsr model info lora-intent-classifier
+
+  # Show info with JSON output
+  vsr model info pii-classifier --output json`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			modelID := args[0]
+			outputFormat := cmd.Parent().Parent().Flag("output").Value.String()
+
+			mgr := model.NewModelManager("./models")
+			modelInfo, err := mgr.GetModelInfo(modelID)
+			if err != nil {
+				return fmt.Errorf("failed to get model info: %w", err)
+			}
+
+			// Output based on format
+			switch outputFormat {
+			case "json":
+				fmt.Printf("{\"id\":\"%s\",\"name\":\"%s\",\"type\":\"%s\",\"purpose\":\"%s\",\"architecture\":\"%s\",\"path\":\"%s\",\"size\":\"%s\",\"size_bytes\":%d,\"downloaded\":%t}\n",
+					modelInfo.ID, modelInfo.Name, modelInfo.Type, modelInfo.Purpose, modelInfo.Architecture, modelInfo.Path, model.FormatSize(modelInfo.Size), modelInfo.Size, modelInfo.Downloaded)
+			case "yaml":
+				fmt.Printf("id: %s\n", modelInfo.ID)
+				fmt.Printf("name: %s\n", modelInfo.Name)
+				fmt.Printf("type: %s\n", modelInfo.Type)
+				fmt.Printf("purpose: %s\n", modelInfo.Purpose)
+				fmt.Printf("architecture: %s\n", modelInfo.Architecture)
+				fmt.Printf("path: %s\n", modelInfo.Path)
+				fmt.Printf("size: %s\n", model.FormatSize(modelInfo.Size))
+				fmt.Printf("size_bytes: %d\n", modelInfo.Size)
+				fmt.Printf("downloaded: %t\n", modelInfo.Downloaded)
+			default:
+				// Table output
+				cli.Info("╔══════════════════════════════════════════════════════════════════════════╗")
+				cli.Info(fmt.Sprintf("║ Model: %-65s║", modelInfo.Name))
+				cli.Info("╠══════════════════════════════════════════════════════════════════════════╣")
+				cli.Info(fmt.Sprintf("║ %-20s %-52s║", "ID:", modelInfo.ID))
+				cli.Info(fmt.Sprintf("║ %-20s %-52s║", "Type:", modelInfo.Type))
+				cli.Info(fmt.Sprintf("║ %-20s %-52s║", "Purpose:", modelInfo.Purpose))
+				cli.Info(fmt.Sprintf("║ %-20s %-52s║", "Architecture:", modelInfo.Architecture))
+				cli.Info(fmt.Sprintf("║ %-20s %-52s║", "Size:", model.FormatSize(modelInfo.Size)))
+				cli.Info(fmt.Sprintf("║ %-20s %-52t║", "Downloaded:", modelInfo.Downloaded))
+				cli.Info("╠══════════════════════════════════════════════════════════════════════════╣")
+				cli.Info(fmt.Sprintf("║ Path: %-66s║", truncateString(modelInfo.Path, 66)))
+				cli.Info("╚══════════════════════════════════════════════════════════════════════════╝")
+			}
+
+			return nil
+		},
+	}
+}
+
+// NewModelValidateCmd creates the model validate command
+func NewModelValidateCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "validate [model-id]",
+		Short: "Validate model integrity",
+		Long: `Validate that a model is properly downloaded and contains all required files.
+
+Checks for config.json and model weight files (pytorch_model.bin or model.safetensors).
+
+Examples:
+  # Validate a specific model
+  vsr model validate lora-intent-classifier
+
+  # Validate all models
+  vsr model validate --all`,
+		Args: cobra.MaximumNArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			validateAll, _ := cmd.Flags().GetBool("all")
+			mgr := model.NewModelManager("./models")
+
+			if validateAll {
+				// Validate all models
+				cli.Info("Validating all models...")
+				results, err := mgr.ValidateAllModels()
+				if err != nil {
+					return fmt.Errorf("failed to validate models: %w", err)
+				}
+
+				hasErrors := false
+				for modelID, validationErr := range results {
+					if validationErr != nil {
+						cli.Error(fmt.Sprintf("✗ %s: %v", modelID, validationErr))
+						hasErrors = true
+					} else {
+						cli.Success(fmt.Sprintf("✓ %s: valid", modelID))
+					}
+				}
+
+				if hasErrors {
+					return fmt.Errorf("some models failed validation")
+				}
+
+				cli.Success(fmt.Sprintf("\nAll %d models are valid", len(results)))
+				return nil
+			}
+
+			// Validate specific model
+			if len(args) == 0 {
+				return fmt.Errorf("model-id required (or use --all flag)")
+			}
+
+			modelID := args[0]
+			cli.Info(fmt.Sprintf("Validating model: %s", modelID))
+
+			if err := mgr.ValidateModel(modelID); err != nil {
+				cli.Error(fmt.Sprintf("Validation failed: %v", err))
+				return err
+			}
+
+			cli.Success(fmt.Sprintf("Model '%s' is valid", modelID))
+
+			// Show what was checked
+			cli.Info("\nChecked:")
+			cli.Info("  ✓ Directory exists")
+			cli.Info("  ✓ config.json present")
+			cli.Info("  ✓ Model weights present")
+
+			return nil
+		},
+	}
+
+	cmd.Flags().Bool("all", false, "Validate all models")
+
+	return cmd
+}
+
+// NewModelRemoveCmd creates the model remove command
+func NewModelRemoveCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "remove <model-id>",
+		Short: "Remove a downloaded model",
+		Long: `Delete a model from disk to free up space.
+
+Requires confirmation unless --force flag is used.
+
+Examples:
+  # Remove a model (with confirmation)
+  vsr model remove pii-classifier
+
+  # Remove without confirmation
+  vsr model remove pii-classifier --force`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			modelID := args[0]
+			force, _ := cmd.Flags().GetBool("force")
+
+			mgr := model.NewModelManager("./models")
+
+			// Get model info first
+			modelInfo, err := mgr.GetModelInfo(modelID)
+			if err != nil {
+				return fmt.Errorf("failed to get model info: %w", err)
+			}
+
+			if !modelInfo.Downloaded {
+				return fmt.Errorf("model is not downloaded: %s", modelID)
+			}
+
+			// Show what will be removed
+			cli.Warning(fmt.Sprintf("This will remove model: %s", modelInfo.Name))
+			cli.Info(fmt.Sprintf("Path: %s", modelInfo.Path))
+			cli.Info(fmt.Sprintf("Size: %s", model.FormatSize(modelInfo.Size)))
+
+			// Confirmation prompt unless force flag is set
+			if !force {
+				fmt.Print("\nAre you sure? (y/N): ")
+				var response string
+				_, _ = fmt.Scanln(&response)
+				if response != "y" && response != "Y" {
+					cli.Info("Removal cancelled")
+					return nil
+				}
+			}
+
+			// Remove the model
+			cli.Info("Removing model...")
+			if err := mgr.RemoveModel(modelID); err != nil {
+				return fmt.Errorf("failed to remove model: %w", err)
+			}
+
+			cli.Success(fmt.Sprintf("Model '%s' removed successfully", modelID))
+			cli.Info(fmt.Sprintf("Freed %s of disk space", model.FormatSize(modelInfo.Size)))
+
+			return nil
+		},
+	}
+
+	cmd.Flags().Bool("force", false, "Skip confirmation prompt")
+
+	return cmd
+}
+
+// NewModelDownloadCmd creates the model download command
+func NewModelDownloadCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "download",
+		Short: "Download models",
+		Long: `Download models for the semantic router.
+
+Currently uses the Makefile 'download-models' command.
+Future versions will support direct HuggingFace downloads.
+
+Examples:
+  # Download all configured models
+  vsr model download
+
+  # Download with verbose output
+  vsr model download --verbose`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			cli.Info("Downloading models...")
+			cli.Warning("Model download currently uses 'make download-models'")
+			cli.Info("Please run: make download-models")
+
+			// In the future, this will implement direct downloads:
+			// mgr := model.NewModelManager("./models")
+			// progress := func(downloaded, total int64) {
+			//     percentage := float64(downloaded) / float64(total) * 100
+			//     cli.Info(fmt.Sprintf("Progress: %.1f%%", percentage))
+			// }
+			// return mgr.DownloadModel(modelID, progress)
+
+			return fmt.Errorf("direct model download not yet implemented")
+		},
+	}
+
+	return cmd
+}
+
+// truncateString truncates a string to maxLen characters
+func truncateString(s string, maxLen int) string {
+	if len(s) <= maxLen {
+		// Pad with spaces
+		return s + strings.Repeat(" ", maxLen-len(s))
+	}
+	return s[:maxLen-3] + "..."
+}
diff --git a/src/semantic-router/cmd/vsr/commands/model_test.go b/src/semantic-router/cmd/vsr/commands/model_test.go
new file mode 100644
index 000000000..1e6d3cba9
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/model_test.go
@@ -0,0 +1,342 @@
+package commands
+
+import (
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestModelCommandStructure(t *testing.T) {
+	tests := []struct {
+		name            string
+		expectedUse     string
+		expectedShort   string
+		subcommandCount int
+		subcommands     []string
+	}{
+		{
+			name:            "model command has correct structure",
+			expectedUse:     "model",
+			expectedShort:   "Manage semantic router models",
+			subcommandCount: 5,
+			subcommands:     []string{"list", "info", "validate", "remove", "download"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewModelCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			if len(cmd.Commands()) != tt.subcommandCount {
+				t.Errorf("expected %d subcommands, got %d", tt.subcommandCount, len(cmd.Commands()))
+			}
+
+			// Verify subcommands exist
+			for _, subcmd := range tt.subcommands {
+				found := false
+				for _, c := range cmd.Commands() {
+					if c.Use == subcmd || c.Name() == subcmd {
+						found = true
+						break
+					}
+				}
+				if !found {
+					t.Errorf("expected subcommand %q not found", subcmd)
+				}
+			}
+		})
+	}
+}
+
+func TestModelListCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "model list default",
+			args:      []string{"model", "list"},
+			wantError: false,
+		},
+		{
+			name:      "model list with downloaded flag",
+			args:      []string{"model", "list", "--downloaded"},
+			wantError: false,
+		},
+		{
+			name:      "model list with json output",
+			args:      []string{"model", "list", "-o", "json"},
+			wantError: false,
+		},
+		{
+			name:      "model list with yaml output",
+			args:      []string{"model", "list", "-o", "yaml"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format")
+
+			modelCmd := NewModelCmd()
+			rootCmd.AddCommand(modelCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestModelInfoCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "model info with model id",
+			args:      []string{"model", "info", "test-model"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			modelCmd := NewModelCmd()
+			rootCmd.AddCommand(modelCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestModelValidateCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "model validate specific model",
+			args:      []string{"model", "validate", "test-model"},
+			wantError: false,
+		},
+		{
+			name:      "model validate all models",
+			args:      []string{"model", "validate", "--all"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			modelCmd := NewModelCmd()
+			rootCmd.AddCommand(modelCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestModelRemoveCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "model remove with confirmation",
+			args:      []string{"model", "remove", "test-model", "--force"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			modelCmd := NewModelCmd()
+			rootCmd.AddCommand(modelCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestModelDownloadCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "model download",
+			args:      []string{"model", "download"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			modelCmd := NewModelCmd()
+			rootCmd.AddCommand(modelCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestModelListCommandFlags(t *testing.T) {
+	cmd := NewModelCmd()
+
+	// Find the list subcommand
+	var listCmd *cobra.Command
+	for _, c := range cmd.Commands() {
+		if c.Name() == "list" {
+			listCmd = c
+			break
+		}
+	}
+
+	if listCmd == nil {
+		t.Fatal("list subcommand not found")
+	}
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "downloaded",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := listCmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestModelValidateCommandFlags(t *testing.T) {
+	cmd := NewModelCmd()
+
+	// Find the validate subcommand
+	var validateCmd *cobra.Command
+	for _, c := range cmd.Commands() {
+		if c.Name() == "validate" {
+			validateCmd = c
+			break
+		}
+	}
+
+	if validateCmd == nil {
+		t.Fatal("validate subcommand not found")
+	}
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "all",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := validateCmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestModelRemoveCommandFlags(t *testing.T) {
+	cmd := NewModelCmd()
+
+	// Find the remove subcommand
+	var removeCmd *cobra.Command
+	for _, c := range cmd.Commands() {
+		if c.Name() == "remove" {
+			removeCmd = c
+			break
+		}
+	}
+
+	if removeCmd == nil {
+		t.Fatal("remove subcommand not found")
+	}
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "force",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := removeCmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/signal_handling_test.go b/src/semantic-router/cmd/vsr/commands/signal_handling_test.go
new file mode 100644
index 000000000..c142df436
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/signal_handling_test.go
@@ -0,0 +1,195 @@
+package commands
+
+import (
+	"os"
+	"os/exec"
+	"os/signal"
+	"syscall"
+	"testing"
+	"time"
+)
+
+// TestSignalHandling verifies graceful shutdown on interrupt signal
+func TestSignalHandling(t *testing.T) {
+	t.Run("signal channel receives interrupt", func(t *testing.T) {
+		// Create signal channel
+		sigChan := make(chan os.Signal, 1)
+		signal.Notify(sigChan, os.Interrupt, syscall.SIGTERM)
+		defer signal.Stop(sigChan)
+
+		// Simulate sending interrupt signal
+		go func() {
+			time.Sleep(100 * time.Millisecond)
+			sigChan <- os.Interrupt
+		}()
+
+		// Wait for signal with timeout
+		select {
+		case sig := <-sigChan:
+			if sig != os.Interrupt {
+				t.Errorf("Expected os.Interrupt, got %v", sig)
+			}
+		case <-time.After(1 * time.Second):
+			t.Error("Signal should be received within 1 second")
+		}
+	})
+
+	t.Run("process cleanup on signal", func(t *testing.T) {
+		// Start a dummy process (simulating port-forward)
+		cmd := exec.Command("sleep", "10")
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start process: %v", err)
+		}
+
+		pid := cmd.Process.Pid
+
+		// Create signal channel
+		sigChan := make(chan os.Signal, 1)
+		signal.Notify(sigChan, os.Interrupt, syscall.SIGTERM)
+		defer signal.Stop(sigChan)
+
+		// Simulate interrupt after short delay
+		go func() {
+			time.Sleep(100 * time.Millisecond)
+			sigChan <- os.Interrupt
+		}()
+
+		// Wait for signal and kill process gracefully
+		done := make(chan error)
+		go func() {
+			done <- cmd.Wait()
+		}()
+
+		select {
+		case <-sigChan:
+			// Kill the process
+			if cmd.Process != nil {
+				_ = cmd.Process.Kill()
+			}
+
+			// Wait a bit for process to die
+			time.Sleep(100 * time.Millisecond)
+
+			// Verify process is dead
+			if err := cmd.Process.Signal(syscall.Signal(0)); err == nil {
+				t.Errorf("Process %d should be terminated after signal", pid)
+			}
+
+		case err := <-done:
+			t.Logf("Process exited on its own: %v", err)
+		case <-time.After(2 * time.Second):
+			t.Error("Test timed out")
+			_ = cmd.Process.Kill()
+		}
+	})
+
+	t.Run("done channel receives process exit", func(t *testing.T) {
+		// Start a process that exits quickly
+		cmd := exec.Command("sleep", "0.1")
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start process: %v", err)
+		}
+
+		done := make(chan error)
+		go func() {
+			done <- cmd.Wait()
+		}()
+
+		// Wait for either signal or done
+		sigChan := make(chan os.Signal, 1)
+		signal.Notify(sigChan, os.Interrupt, syscall.SIGTERM)
+		defer signal.Stop(sigChan)
+
+		select {
+		case <-sigChan:
+			t.Error("Should not receive signal in this test")
+		case err := <-done:
+			// Process exited normally
+			if err != nil {
+				t.Logf("Process exited with: %v", err)
+			}
+		case <-time.After(1 * time.Second):
+			t.Error("Process should exit within 1 second")
+			_ = cmd.Process.Kill()
+		}
+	})
+}
+
+// TestGracefulShutdown verifies cleanup logic
+func TestGracefulShutdown(t *testing.T) {
+	t.Run("process is killed when signal received", func(t *testing.T) {
+		cmd := exec.Command("sleep", "30")
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start process: %v", err)
+		}
+
+		pid := cmd.Process.Pid
+		t.Logf("Started process with PID: %d", pid)
+
+		// Kill the process (simulating signal handler)
+		if err := cmd.Process.Kill(); err != nil {
+			t.Errorf("Failed to kill process: %v", err)
+		}
+
+		// Wait a bit
+		time.Sleep(100 * time.Millisecond)
+
+		// Verify process is gone
+		if err := cmd.Process.Signal(syscall.Signal(0)); err == nil {
+			t.Errorf("Process %d should be dead", pid)
+		}
+	})
+
+	t.Run("select statement with signal and done channels", func(t *testing.T) {
+		// This tests the select pattern used in dashboard command
+		sigChan := make(chan os.Signal, 1)
+		done := make(chan error)
+
+		// Simulate signal received first
+		go func() {
+			time.Sleep(50 * time.Millisecond)
+			sigChan <- os.Interrupt
+		}()
+
+		var receivedSignal bool
+		select {
+		case <-sigChan:
+			receivedSignal = true
+		case <-done:
+			t.Error("Should receive signal, not done")
+		case <-time.After(1 * time.Second):
+			t.Error("Should receive signal within 1 second")
+		}
+
+		if !receivedSignal {
+			t.Error("Signal should have been received")
+		}
+	})
+}
+
+// TestSignalNotification verifies signal notification setup
+func TestSignalNotification(t *testing.T) {
+	t.Run("can create and stop signal notification", func(t *testing.T) {
+		sigChan := make(chan os.Signal, 1)
+		signal.Notify(sigChan, os.Interrupt, syscall.SIGTERM)
+
+		// Stop notification
+		signal.Stop(sigChan)
+
+		// After stopping, signals should not be delivered
+		// This is more of a safety check
+		select {
+		case <-sigChan:
+			// May or may not receive depending on timing
+		case <-time.After(100 * time.Millisecond):
+			// Expected - no signal received
+		}
+	})
+
+	t.Run("signal channel has buffer size", func(t *testing.T) {
+		sigChan := make(chan os.Signal, 1)
+		if cap(sigChan) != 1 {
+			t.Errorf("Signal channel should have capacity 1, got %d", cap(sigChan))
+		}
+	})
+}
diff --git a/src/semantic-router/cmd/vsr/commands/status.go b/src/semantic-router/cmd/vsr/commands/status.go
new file mode 100644
index 000000000..82ee936c5
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/status.go
@@ -0,0 +1,91 @@
+package commands
+
+import (
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/deployment"
+)
+
+// NewStatusCmd creates the status command
+func NewStatusCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "status",
+		Short: "Check router and components status",
+		Long: `Display status information for the router and its components.
+
+This command auto-detects all running deployments (local, docker, kubernetes, helm)
+and displays their status, components, and endpoints.
+
+Examples:
+  # Check status in default namespace
+  vsr status
+
+  # Check status in specific namespace
+  vsr status --namespace production`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			namespace, _ := cmd.Flags().GetString("namespace")
+			return deployment.CheckStatus(namespace)
+		},
+	}
+
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace to check")
+
+	return cmd
+}
+
+// NewLogsCmd creates the logs command
+func NewLogsCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "logs",
+		Short: "Fetch router logs",
+		Long: `Stream or fetch logs from the router service with auto-detection of deployment type.
+
+Supports filtering by component, time range, and pattern matching.
+
+Examples:
+  # Fetch last 100 lines (auto-detect deployment)
+  vsr logs
+
+  # Follow logs in real-time
+  vsr logs --follow
+
+  # Show logs from specific deployment type
+  vsr logs --env docker
+
+  # Filter by component
+  vsr logs --component router
+
+  # Show logs from specific namespace
+  vsr logs --namespace production
+
+  # Show logs since a time
+  vsr logs --since 10m
+
+  # Filter logs by pattern
+  vsr logs --grep error
+
+  # Combine options
+  vsr logs --follow --env kubernetes --namespace prod --component router --grep "ERROR"`,
+		RunE: func(cmd *cobra.Command, args []string) error {
+			follow, _ := cmd.Flags().GetBool("follow")
+			tail, _ := cmd.Flags().GetInt("tail")
+			namespace, _ := cmd.Flags().GetString("namespace")
+			deployType, _ := cmd.Flags().GetString("env")
+			component, _ := cmd.Flags().GetString("component")
+			since, _ := cmd.Flags().GetString("since")
+			grep, _ := cmd.Flags().GetString("grep")
+
+			return deployment.FetchLogs(follow, tail, namespace, deployType, component, since, grep)
+		},
+	}
+
+	cmd.Flags().BoolP("follow", "f", false, "Follow log output")
+	cmd.Flags().IntP("tail", "n", 100, "Number of lines to show from the end")
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace (for K8s/Helm deployments)")
+	cmd.Flags().String("env", "", "Deployment type: local, docker, kubernetes, helm (auto-detect if empty)")
+	cmd.Flags().String("component", "", "Filter by component name (e.g., router, envoy, grafana)")
+	cmd.Flags().String("since", "", "Show logs since duration (e.g., 10m, 1h) or timestamp")
+	cmd.Flags().String("grep", "", "Filter logs by pattern (uses grep)")
+
+	return cmd
+}
diff --git a/src/semantic-router/cmd/vsr/commands/status_test.go b/src/semantic-router/cmd/vsr/commands/status_test.go
new file mode 100644
index 000000000..cc96ef81b
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/status_test.go
@@ -0,0 +1,309 @@
+package commands
+
+import (
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestStatusCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "status command has correct structure",
+			expectedUse:   "status",
+			expectedShort: "Check router and components status",
+			hasFlags:      []string{"namespace"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewStatusCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestStatusCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "status with default namespace",
+			args:      []string{"status"},
+			wantError: false,
+		},
+		{
+			name:      "status with custom namespace",
+			args:      []string{"status", "--namespace", "production"},
+			wantError: false,
+		},
+		{
+			name:      "status with short namespace flag",
+			args:      []string{"status", "--namespace=test"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			statusCmd := NewStatusCmd()
+			rootCmd.AddCommand(statusCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestStatusCommandFlags(t *testing.T) {
+	cmd := NewStatusCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue interface{}
+	}{
+		{
+			flagName:     "namespace",
+			flagType:     "string",
+			defaultValue: "default",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue.(string) {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestLogsCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "logs command has correct structure",
+			expectedUse:   "logs",
+			expectedShort: "Fetch router logs",
+			hasFlags:      []string{"follow", "tail", "namespace", "env", "component", "since", "grep"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewLogsCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestLogsCommand(t *testing.T) {
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "logs with defaults",
+			args:      []string{"logs"},
+			wantError: false,
+		},
+		{
+			name:      "logs with follow flag",
+			args:      []string{"logs", "--follow"},
+			wantError: false,
+		},
+		{
+			name:      "logs with tail count",
+			args:      []string{"logs", "--tail", "50"},
+			wantError: false,
+		},
+		{
+			name:      "logs with namespace",
+			args:      []string{"logs", "--namespace", "production"},
+			wantError: false,
+		},
+		{
+			name:      "logs with env type",
+			args:      []string{"logs", "--env", "docker"},
+			wantError: false,
+		},
+		{
+			name:      "logs with component filter",
+			args:      []string{"logs", "--component", "router"},
+			wantError: false,
+		},
+		{
+			name:      "logs with since filter",
+			args:      []string{"logs", "--since", "10m"},
+			wantError: false,
+		},
+		{
+			name:      "logs with grep filter",
+			args:      []string{"logs", "--grep", "error"},
+			wantError: false,
+		},
+		{
+			name:      "logs with multiple flags",
+			args:      []string{"logs", "--follow", "--tail", "200", "--env", "kubernetes", "--namespace", "prod", "--component", "router", "--grep", "ERROR"},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			logsCmd := NewLogsCmd()
+			rootCmd.AddCommand(logsCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestLogsCommandFlags(t *testing.T) {
+	cmd := NewLogsCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue interface{}
+	}{
+		{
+			flagName:     "follow",
+			flagType:     "bool",
+			defaultValue: "false",
+		},
+		{
+			flagName:     "tail",
+			flagType:     "int",
+			defaultValue: "100",
+		},
+		{
+			flagName:     "namespace",
+			flagType:     "string",
+			defaultValue: "default",
+		},
+		{
+			flagName:     "env",
+			flagType:     "string",
+			defaultValue: "",
+		},
+		{
+			flagName:     "component",
+			flagType:     "string",
+			defaultValue: "",
+		},
+		{
+			flagName:     "since",
+			flagType:     "string",
+			defaultValue: "",
+		},
+		{
+			flagName:     "grep",
+			flagType:     "string",
+			defaultValue: "",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue.(string) {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestLogsCommandShortFlags(t *testing.T) {
+	cmd := NewLogsCmd()
+
+	// Test short flags
+	tests := []struct {
+		shortFlag string
+		longFlag  string
+	}{
+		{
+			shortFlag: "f",
+			longFlag:  "follow",
+		},
+		{
+			shortFlag: "n",
+			longFlag:  "tail",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("short_flag_"+tt.shortFlag, func(t *testing.T) {
+			shortFlag := cmd.Flags().ShorthandLookup(tt.shortFlag)
+			if shortFlag == nil {
+				t.Fatalf("short flag %q not found", tt.shortFlag)
+			}
+
+			if shortFlag.Name != tt.longFlag {
+				t.Errorf("expected short flag %q to map to %q, got %q", tt.shortFlag, tt.longFlag, shortFlag.Name)
+			}
+		})
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/test.go b/src/semantic-router/cmd/vsr/commands/test.go
new file mode 100644
index 000000000..ec80b0547
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/test.go
@@ -0,0 +1,129 @@
+package commands
+
+import (
+	"bytes"
+	"encoding/json"
+	"fmt"
+	"net/http"
+	"strings"
+	"time"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+)
+
+// NewTestCmd creates the test command
+func NewTestCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "test-prompt [text]",
+		Short: "Send a test prompt to the router",
+		Long: `Test the router by sending a prompt for classification.
+
+This command sends your prompt to the router's classification API and displays:
+  - Detected category
+  - Model routing decision
+  - PII detection results
+  - Jailbreak protection status
+
+Example:
+  vsr test-prompt "Solve x^2 + 5x + 6 = 0"`,
+		Args: cobra.MinimumNArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			prompt := strings.Join(args, " ")
+
+			// Input validation: 10k byte limit (approximates character limit for ASCII)
+			// NOTE: len() counts bytes, not Unicode characters. For non-ASCII text with
+			// multi-byte UTF-8 characters, this may trigger on fewer than 10k characters.
+			// Known limitation. Future enhancement: use utf8.RuneCountInString(prompt).
+			if len(prompt) > 10000 {
+				return fmt.Errorf("prompt too long (max 10000 bytes, got %d)", len(prompt))
+			}
+
+			endpoint, _ := cmd.Flags().GetString("endpoint")
+			outputFormat := cmd.Parent().Flag("output").Value.String()
+
+			result, err := callClassificationAPI(endpoint, prompt)
+			if err != nil {
+				return fmt.Errorf("failed to classify prompt: %w", err)
+			}
+
+			return displayTestResult(result, outputFormat)
+		},
+	}
+
+	cmd.Flags().String("endpoint", "http://localhost:8080", "Router API endpoint")
+
+	return cmd
+}
+
+type ClassificationResult struct {
+	Classification struct {
+		Category   string  `json:"category"`
+		Confidence float64 `json:"confidence"`
+	} `json:"classification"`
+	RecommendedModel string `json:"recommended_model"`
+}
+
+func callClassificationAPI(endpoint, prompt string) (*ClassificationResult, error) {
+	// Prepare request
+	reqBody := map[string]string{
+		"text": prompt,
+	}
+	jsonData, err := json.Marshal(reqBody)
+	if err != nil {
+		return nil, err
+	}
+
+	// Issue #4: Add HTTP timeout to prevent hanging
+	client := &http.Client{
+		Timeout: 30 * time.Second,
+	}
+
+	// Make HTTP request
+	resp, err := client.Post(
+		fmt.Sprintf("%s/api/v1/classify/intent", endpoint),
+		"application/json",
+		bytes.NewBuffer(jsonData),
+	)
+	if err != nil {
+		return nil, err
+	}
+	defer resp.Body.Close()
+
+	if resp.StatusCode != http.StatusOK {
+		return nil, fmt.Errorf("API returned status %d", resp.StatusCode)
+	}
+
+	// Parse response
+	var result ClassificationResult
+	if err := json.NewDecoder(resp.Body).Decode(&result); err != nil {
+		return nil, err
+	}
+
+	return &result, nil
+}
+
+func displayTestResult(result *ClassificationResult, format string) error {
+	switch format {
+	case "json":
+		return cli.PrintJSON(result)
+	case "yaml":
+		return cli.PrintYAML(result)
+	}
+
+	// Table format
+	fmt.Println("\nTest Results:")
+	fmt.Println("━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━")
+	fmt.Printf("Category:    %s\n", result.Classification.Category)
+	fmt.Printf("Model:       %s\n", result.RecommendedModel)
+	fmt.Printf("Confidence:  %.2f\n", result.Classification.Confidence)
+
+	// PII and Jailbreak are not part of the intent response
+	cli.Success("PII Check: Not performed in intent classification")
+	cli.Success("Jailbreak Check: Not performed in intent classification")
+
+	fmt.Println("━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━")
+
+	return nil
+}
diff --git a/src/semantic-router/cmd/vsr/commands/test_test.go b/src/semantic-router/cmd/vsr/commands/test_test.go
new file mode 100644
index 000000000..a5fd1e423
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/test_test.go
@@ -0,0 +1,256 @@
+package commands
+
+import (
+	"encoding/json"
+	"net/http"
+	"net/http/httptest"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestTestCommandStructure(t *testing.T) {
+	tests := []struct {
+		name          string
+		expectedUse   string
+		expectedShort string
+		hasFlags      []string
+	}{
+		{
+			name:          "test-prompt command has correct structure",
+			expectedUse:   "test-prompt [text]",
+			expectedShort: "Send a test prompt to the router",
+			hasFlags:      []string{"endpoint"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewTestCmd()
+
+			if cmd.Use != tt.expectedUse {
+				t.Errorf("expected Use %q, got %q", tt.expectedUse, cmd.Use)
+			}
+
+			if cmd.Short != tt.expectedShort {
+				t.Errorf("expected Short %q, got %q", tt.expectedShort, cmd.Short)
+			}
+
+			// Verify flags exist
+			for _, flagName := range tt.hasFlags {
+				if cmd.Flags().Lookup(flagName) == nil {
+					t.Errorf("expected flag %q not found", flagName)
+				}
+			}
+		})
+	}
+}
+
+func TestTestCommandFlags(t *testing.T) {
+	cmd := NewTestCmd()
+
+	tests := []struct {
+		flagName     string
+		flagType     string
+		defaultValue string
+	}{
+		{
+			flagName:     "endpoint",
+			flagType:     "string",
+			defaultValue: "http://localhost:8080",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run("flag_"+tt.flagName, func(t *testing.T) {
+			flag := cmd.Flags().Lookup(tt.flagName)
+			if flag == nil {
+				t.Fatalf("flag %q not found", tt.flagName)
+			}
+
+			if flag.Value.Type() != tt.flagType {
+				t.Errorf("expected flag type %q, got %q", tt.flagType, flag.Value.Type())
+			}
+
+			if flag.DefValue != tt.defaultValue {
+				t.Errorf("expected default value %q, got %q", tt.defaultValue, flag.DefValue)
+			}
+		})
+	}
+}
+
+func TestCallClassificationAPI(t *testing.T) {
+	tests := []struct {
+		name           string
+		prompt         string
+		mockResponse   ClassificationResult
+		mockStatusCode int
+		wantError      bool
+	}{
+		{
+			name:   "successful classification",
+			prompt: "test prompt",
+			mockResponse: ClassificationResult{
+				Classification: struct {
+					Category   string  `json:"category"`
+					Confidence float64 `json:"confidence"`
+				}{
+					Category:   "math",
+					Confidence: 0.95,
+				},
+				RecommendedModel: "test-model",
+			},
+			mockStatusCode: http.StatusOK,
+			wantError:      false,
+		},
+		{
+			name:           "API error",
+			prompt:         "test prompt",
+			mockResponse:   ClassificationResult{},
+			mockStatusCode: http.StatusInternalServerError,
+			wantError:      true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			// Create mock server
+			server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+				if r.URL.Path != "/api/v1/classify/intent" {
+					t.Errorf("unexpected path: %s", r.URL.Path)
+				}
+
+				w.WriteHeader(tt.mockStatusCode)
+				if tt.mockStatusCode == http.StatusOK {
+					_ = json.NewEncoder(w).Encode(tt.mockResponse)
+				}
+			}))
+			defer server.Close()
+
+			result, err := callClassificationAPI(server.URL, tt.prompt)
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+
+			if !tt.wantError && result != nil {
+				if result.Classification.Category != tt.mockResponse.Classification.Category {
+					t.Errorf("expected category %q, got %q", tt.mockResponse.Classification.Category, result.Classification.Category)
+				}
+			}
+		})
+	}
+}
+
+func TestDisplayTestResult(t *testing.T) {
+	result := &ClassificationResult{
+		Classification: struct {
+			Category   string  `json:"category"`
+			Confidence float64 `json:"confidence"`
+		}{
+			Category:   "math",
+			Confidence: 0.95,
+		},
+		RecommendedModel: "test-model",
+	}
+
+	tests := []struct {
+		name      string
+		format    string
+		wantError bool
+	}{
+		{
+			name:      "table format",
+			format:    "table",
+			wantError: false,
+		},
+		{
+			name:      "json format",
+			format:    "json",
+			wantError: false,
+		},
+		{
+			name:      "yaml format",
+			format:    "yaml",
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			err := displayTestResult(result, tt.format)
+
+			if tt.wantError && err == nil {
+				t.Error("expected error, got nil")
+			}
+			if !tt.wantError && err != nil {
+				t.Errorf("unexpected error: %v", err)
+			}
+		})
+	}
+}
+
+func TestTestCommand(t *testing.T) {
+	// Create mock server
+	server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		result := ClassificationResult{
+			Classification: struct {
+				Category   string  `json:"category"`
+				Confidence float64 `json:"confidence"`
+			}{
+				Category:   "test",
+				Confidence: 0.9,
+			},
+			RecommendedModel: "test-model",
+		}
+		w.WriteHeader(http.StatusOK)
+		_ = json.NewEncoder(w).Encode(result)
+	}))
+	defer server.Close()
+
+	tests := []struct {
+		name      string
+		args      []string
+		wantError bool
+	}{
+		{
+			name:      "test with prompt",
+			args:      []string{"test-prompt", "test prompt", "--endpoint", server.URL},
+			wantError: false,
+		},
+		{
+			name:      "test with multiple word prompt",
+			args:      []string{"test-prompt", "solve", "x^2", "+", "5x", "+", "6", "--endpoint", server.URL},
+			wantError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format")
+
+			testCmd := NewTestCmd()
+			rootCmd.AddCommand(testCmd)
+
+			rootCmd.SetArgs(tt.args)
+			_, _ = rootCmd.ExecuteC() // Ignore error, just testing command structure
+		})
+	}
+}
+
+func TestTestCommandRequiresArgs(t *testing.T) {
+	rootCmd := &cobra.Command{Use: "vsr"}
+	testCmd := NewTestCmd()
+	rootCmd.AddCommand(testCmd)
+
+	rootCmd.SetArgs([]string{"test-prompt"})
+	_, err := rootCmd.ExecuteC()
+
+	if err == nil {
+		t.Error("expected error when no prompt provided, got nil")
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/commands/upgrade.go b/src/semantic-router/cmd/vsr/commands/upgrade.go
new file mode 100644
index 000000000..6f118cbfc
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/upgrade.go
@@ -0,0 +1,100 @@
+package commands
+
+import (
+	"fmt"
+	"time"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli/deployment"
+)
+
+// NewUpgradeCmd creates the upgrade command
+func NewUpgradeCmd() *cobra.Command {
+	cmd := &cobra.Command{
+		Use:   "upgrade [local|docker|kubernetes|helm]",
+		Short: "Upgrade router deployment to latest version",
+		Long: `Upgrade the vLLM Semantic Router deployment to the latest version.
+
+This command performs a rolling upgrade with minimal downtime:
+  - local:      Rebuild binary and gracefully restart
+  - docker:     Pull latest images and recreate containers
+  - kubernetes: Apply updated manifests and rolling restart
+  - helm:       Upgrade Helm release with latest chart
+
+Examples:
+  # Upgrade local deployment
+  vsr upgrade local
+
+  # Upgrade Docker deployment
+  vsr upgrade docker
+
+  # Upgrade Docker with observability
+  vsr upgrade docker --with-observability
+
+  # Upgrade Kubernetes deployment
+  vsr upgrade kubernetes
+
+  # Upgrade Kubernetes in specific namespace with wait
+  vsr upgrade kubernetes --namespace production --wait
+
+  # Force upgrade without confirmation
+  vsr upgrade docker --force
+
+  # Upgrade with custom timeout
+  vsr upgrade kubernetes --timeout 10m`,
+		Args: cobra.ExactArgs(1),
+		RunE: func(cmd *cobra.Command, args []string) error {
+			env := args[0]
+			configPath := cmd.Parent().Flag("config").Value.String()
+			withObs, _ := cmd.Flags().GetBool("with-observability")
+			namespace, _ := cmd.Flags().GetString("namespace")
+			releaseName, _ := cmd.Flags().GetString("release-name")
+			force, _ := cmd.Flags().GetBool("force")
+			wait, _ := cmd.Flags().GetBool("wait")
+			timeoutStr, _ := cmd.Flags().GetString("timeout")
+
+			// Parse timeout
+			timeout, err := time.ParseDuration(timeoutStr)
+			if err != nil {
+				return fmt.Errorf("invalid timeout format: %s (use format like '5m', '300s')", timeoutStr)
+			}
+
+			// Confirmation prompt unless force flag is set
+			if !force {
+				cli.Warning(fmt.Sprintf("This will upgrade the %s deployment", env))
+				cli.Info("The router will be temporarily unavailable during the upgrade")
+				fmt.Print("Continue? (y/N): ")
+				var response string
+				_, _ = fmt.Scanln(&response)
+				if response != "y" && response != "Y" {
+					cli.Info("Upgrade cancelled")
+					return nil
+				}
+			}
+
+			switch env {
+			case "local":
+				return deployment.UpgradeLocal(configPath)
+			case "docker":
+				return deployment.UpgradeDocker(configPath, withObs)
+			case "kubernetes":
+				return deployment.UpgradeKubernetes(configPath, namespace, int(timeout.Seconds()), wait)
+			case "helm":
+				return deployment.UpgradeHelmRelease(configPath, namespace, releaseName, int(timeout.Seconds()))
+			default:
+				return fmt.Errorf("unknown environment: %s", env)
+			}
+		},
+	}
+
+	cmd.Flags().Bool("with-observability", true, "Include observability stack (Docker only)")
+	cmd.Flags().String("namespace", "default", "Kubernetes namespace (Kubernetes/Helm only)")
+	cmd.Flags().String("release-name", "", "Helm release name (default: semantic-router)")
+	cmd.Flags().Bool("force", false, "Skip confirmation prompt")
+	cmd.Flags().Bool("wait", false, "Wait for upgrade to complete (Kubernetes/Helm only)")
+	cmd.Flags().String("timeout", "5m", "Timeout for upgrade operation (e.g., '5m', '300s')")
+
+	return cmd
+}
diff --git a/src/semantic-router/cmd/vsr/commands/upgrade_test.go b/src/semantic-router/cmd/vsr/commands/upgrade_test.go
new file mode 100644
index 000000000..08674056e
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/commands/upgrade_test.go
@@ -0,0 +1,265 @@
+package commands
+
+import (
+	"bytes"
+	"strings"
+	"testing"
+
+	"github.com/spf13/cobra"
+)
+
+func TestUpgradeCommand(t *testing.T) {
+	tests := []struct {
+		name        string
+		args        []string
+		expectError bool
+		errorMsg    string
+	}{
+		{
+			name:        "upgrade without environment",
+			args:        []string{},
+			expectError: true,
+			errorMsg:    "accepts 1 arg(s)",
+		},
+		{
+			name:        "upgrade local",
+			args:        []string{"local"},
+			expectError: false,
+		},
+		{
+			name:        "upgrade docker",
+			args:        []string{"docker"},
+			expectError: false,
+		},
+		{
+			name:        "upgrade kubernetes",
+			args:        []string{"kubernetes"},
+			expectError: false,
+		},
+		{
+			name:        "upgrade helm",
+			args:        []string{"helm"},
+			expectError: false,
+		},
+		{
+			name:        "upgrade with too many args",
+			args:        []string{"docker", "extra"},
+			expectError: true,
+			errorMsg:    "accepts 1 arg(s)",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewUpgradeCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config.yaml", "config file")
+			rootCmd.AddCommand(cmd)
+
+			rootCmd.SetArgs(append([]string{"upgrade"}, tt.args...))
+
+			buf := new(bytes.Buffer)
+			rootCmd.SetOut(buf)
+			rootCmd.SetErr(buf)
+
+			err := rootCmd.Execute()
+
+			if tt.expectError {
+				if err == nil {
+					t.Errorf("Expected error but got none")
+				} else if tt.errorMsg != "" && !strings.Contains(err.Error(), tt.errorMsg) {
+					t.Errorf("Expected error containing %q, got %q", tt.errorMsg, err.Error())
+				}
+			}
+		})
+	}
+}
+
+func TestUpgradeCommandFlags(t *testing.T) {
+	tests := []struct {
+		name          string
+		args          []string
+		expectedFlags map[string]string
+	}{
+		{
+			name: "default flags",
+			args: []string{"docker"},
+			expectedFlags: map[string]string{
+				"with-observability": "true",
+				"namespace":          "default",
+				"force":              "false",
+				"wait":               "false",
+				"timeout":            "5m",
+			},
+		},
+		{
+			name: "with force flag",
+			args: []string{"docker", "--force"},
+			expectedFlags: map[string]string{
+				"force": "true",
+			},
+		},
+		{
+			name: "with wait flag",
+			args: []string{"kubernetes", "--wait"},
+			expectedFlags: map[string]string{
+				"wait": "true",
+			},
+		},
+		{
+			name: "with custom timeout",
+			args: []string{"kubernetes", "--timeout", "10m"},
+			expectedFlags: map[string]string{
+				"timeout": "10m",
+			},
+		},
+		{
+			name: "with custom namespace",
+			args: []string{"kubernetes", "--namespace", "production"},
+			expectedFlags: map[string]string{
+				"namespace": "production",
+			},
+		},
+		{
+			name: "without observability",
+			args: []string{"docker", "--with-observability=false"},
+			expectedFlags: map[string]string{
+				"with-observability": "false",
+			},
+		},
+		{
+			name: "kubernetes with all options",
+			args: []string{"kubernetes", "--namespace", "prod", "--wait", "--timeout", "15m", "--force"},
+			expectedFlags: map[string]string{
+				"namespace": "prod",
+				"wait":      "true",
+				"timeout":   "15m",
+				"force":     "true",
+			},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewUpgradeCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config.yaml", "config file")
+			rootCmd.AddCommand(cmd)
+
+			fullArgs := append([]string{"upgrade"}, tt.args...)
+			rootCmd.SetArgs(fullArgs)
+
+			// Parse command
+			_, err := rootCmd.ExecuteC()
+			_ = err // Ignore execution errors
+
+			for flagName, expectedValue := range tt.expectedFlags {
+				flag := cmd.Flags().Lookup(flagName)
+				if flag == nil {
+					t.Errorf("Flag %q not found", flagName)
+					continue
+				}
+				if flag.Value.String() != expectedValue {
+					t.Errorf("Flag %q: expected %q, got %q", flagName, expectedValue, flag.Value.String())
+				}
+			}
+		})
+	}
+}
+
+func TestUpgradeCommandTimeoutParsing(t *testing.T) {
+	tests := []struct {
+		name        string
+		timeout     string
+		expectError bool
+	}{
+		{
+			name:        "valid timeout - minutes",
+			timeout:     "5m",
+			expectError: false,
+		},
+		{
+			name:        "valid timeout - seconds",
+			timeout:     "300s",
+			expectError: false,
+		},
+		{
+			name:        "valid timeout - hours",
+			timeout:     "1h",
+			expectError: false,
+		},
+		{
+			name:        "invalid timeout - no unit",
+			timeout:     "300",
+			expectError: true,
+		},
+		{
+			name:        "invalid timeout - wrong unit",
+			timeout:     "5x",
+			expectError: true,
+		},
+		{
+			name:        "invalid timeout - empty",
+			timeout:     "",
+			expectError: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			cmd := NewUpgradeCmd()
+
+			rootCmd := &cobra.Command{Use: "vsr"}
+			rootCmd.PersistentFlags().StringP("config", "c", "config.yaml", "config file")
+			rootCmd.AddCommand(cmd)
+
+			// Force flag to skip confirmation
+			args := []string{"upgrade", "docker", "--force", "--timeout", tt.timeout}
+			rootCmd.SetArgs(args)
+
+			buf := new(bytes.Buffer)
+			rootCmd.SetOut(buf)
+			rootCmd.SetErr(buf)
+
+			err := rootCmd.Execute()
+
+			if tt.expectError {
+				if err == nil {
+					t.Errorf("Expected error for timeout %q but got none", tt.timeout)
+				} else if !strings.Contains(err.Error(), "invalid timeout") {
+					t.Errorf("Expected 'invalid timeout' error, got: %v", err)
+				}
+			}
+		})
+	}
+}
+
+func TestUpgradeCommandHelp(t *testing.T) {
+	cmd := NewUpgradeCmd()
+
+	if cmd.Use != "upgrade [local|docker|kubernetes|helm]" {
+		t.Errorf("Expected Use to include environment options, got: %s", cmd.Use)
+	}
+
+	if cmd.Short == "" {
+		t.Error("Short description should not be empty")
+	}
+
+	if cmd.Long == "" {
+		t.Error("Long description should not be empty")
+	}
+
+	// Check that Long contains examples
+	if !strings.Contains(cmd.Long, "Examples:") {
+		t.Error("Long description should contain examples")
+	}
+
+	// Check that all environments are mentioned
+	for _, env := range []string{"local", "docker", "kubernetes", "helm"} {
+		if !strings.Contains(cmd.Long, env) {
+			t.Errorf("Long description should mention %s environment", env)
+		}
+	}
+}
diff --git a/src/semantic-router/cmd/vsr/config/config.yaml b/src/semantic-router/cmd/vsr/config/config.yaml
new file mode 100644
index 000000000..cd319a815
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/config/config.yaml
@@ -0,0 +1,132 @@
+# vLLM Semantic Router Configuration
+
+# BERT model for semantic similarity
+bert_model:
+  model_id: sentence-transformers/all-MiniLM-L12-v2
+  threshold: 0.6
+  use_cpu: true
+
+# vLLM endpoints - your backend models
+vllm_endpoints:
+  - name: "endpoint1"
+    address: "127.0.0.1"
+    port: 11434
+    weight: 1
+
+# Model configuration
+model_config:
+  "your-model":
+    preferred_endpoints: ["endpoint1"]
+    pricing:
+      currency: "USD"
+      prompt_per_1m: 0.50
+      completion_per_1m: 1.50
+
+# Categories (Metadata)
+  categories:
+    - name: math
+      description: "Mathematics related queries"
+      model_scores:
+        - model: "gpt-4"
+          score: 0.95
+        - model: "claude-3"
+          score: 0.90
+        - model: "llama-70b"
+          score: 0.85
+        - model: "your-model"
+          score: 0.80
+    - name: coding
+      description: "Programming and code generation"
+      model_scores:
+        - model: "gpt-4"
+          score: 0.92
+        - model: "claude-3"
+          score: 0.94
+        - model: "llama-70b"
+          score: 0.88
+        - model: "your-model"
+          score: 0.75
+    - name: business
+      description: "Business analysis and strategy queries"
+      model_scores:
+        - model: "gpt-4"
+          score: 0.90
+        - model: "claude-3"
+          score: 0.92
+        - model: "llama-70b"
+          score: 0.78
+        - model: "your-model"
+          score: 0.85
+    - name: creative
+      description: "Creative writing and content generation"
+      model_scores:
+        - model: "gpt-4"
+          score: 0.88
+        - model: "claude-3"
+          score: 0.95
+        - model: "llama-70b"
+          score: 0.82
+        - model: "your-model"
+          score: 0.70
+    - name: general
+      description: "General knowledge and conversation"
+      model_scores:
+        - model: "gpt-4"
+          score: 0.85
+        - model: "claude-3"
+          score: 0.85
+        - model: "llama-70b"
+          score: 0.80
+        - model: "your-model"
+          score: 0.80
+
+# Routing Rules
+  keyword_rules:
+    - name: math_keywords
+      operator: "OR"
+      keywords: ["math", "calculus", "algebra"]
+
+# Routing Decisions
+  decisions:
+    - name: math_decision
+      description: "Route math queries to model"
+      priority: 10
+      rules:
+        operator: "AND"
+        conditions:
+          - type: "keyword"
+            name: "math_keywords"
+      modelRefs:
+        - model: your-model
+          use_reasoning: true
+
+  default_model: your-model
+
+# Classification models
+  classifier:
+    category_model:
+      use_modernbert: true
+      threshold: 0.6
+      use_cpu: true
+      category_mapping_path: "src/semantic-router/cmd/vsr/config/category_mapping.json"
+    pii_model:
+      model_id: "models/pii_classifier_modernbert-base_presidio_token_model"
+      use_modernbert: true
+      threshold: 0.7
+      use_cpu: true
+
+# Security features (optional)
+prompt_guard:
+  enabled: false
+  use_modernbert: true
+  threshold: 0.7
+  use_cpu: true
+
+# Semantic caching (optional)
+semantic_cache:
+  enabled: false
+  backend_type: "memory"
+  similarity_threshold: 0.8
+  max_entries: 1000
+  ttl_seconds: 3600
+  eviction_policy: "fifo"
diff --git a/src/semantic-router/cmd/vsr/main.go b/src/semantic-router/cmd/vsr/main.go
new file mode 100644
index 000000000..f4d6b8364
--- /dev/null
+++ b/src/semantic-router/cmd/vsr/main.go
@@ -0,0 +1,74 @@
+package main
+
+import (
+	"fmt"
+	"os"
+
+	"github.com/spf13/cobra"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/cmd/vsr/commands"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/observability/logging"
+)
+
+var (
+	// Version information (set by build flags)
+	version   = "dev"
+	gitCommit = "unknown"
+	buildDate = "unknown"
+)
+
+func main() {
+	// Initialize logging
+	if _, err := logging.InitLoggerFromEnv(); err != nil {
+		fmt.Fprintf(os.Stderr, "Failed to initialize logger: %v\n", err)
+	}
+
+	rootCmd := &cobra.Command{
+		Use:   "vsr",
+		Short: "vLLM Semantic Router Control CLI",
+		Long: `vsr is a command-line tool for managing the vLLM Semantic Router.
+
+It provides a unified interface for installing, configuring, deploying, and
+managing the router across different environments (local, Docker, Kubernetes).
+
+Common workflows:
+  vsr init                    # Initialize a new configuration
+  vsr config validate         # Validate your configuration
+  vsr deploy docker           # Deploy using Docker Compose
+  vsr status                  # Check router status
+  vsr test-prompt "test"      # Send a test prompt
+
+For detailed help on any command, use:
+  vsr <command> --help`,
+		Version: fmt.Sprintf("%s (commit: %s, built: %s)", version, gitCommit, buildDate),
+	}
+
+	// Global flags
+	rootCmd.PersistentFlags().StringP("config", "c", "config/config.yaml", "Path to configuration file")
+	rootCmd.PersistentFlags().BoolP("verbose", "v", false, "Enable verbose output")
+	rootCmd.PersistentFlags().StringP("output", "o", "table", "Output format: table, json, yaml")
+
+	// Add subcommands
+	rootCmd.AddCommand(commands.NewConfigCmd())
+	rootCmd.AddCommand(commands.NewGetCmd())
+	rootCmd.AddCommand(commands.NewDeployCmd())
+	rootCmd.AddCommand(commands.NewUndeployCmd())
+	rootCmd.AddCommand(commands.NewUpgradeCmd())
+	rootCmd.AddCommand(commands.NewStatusCmd())
+	rootCmd.AddCommand(commands.NewLogsCmd())
+	rootCmd.AddCommand(commands.NewModelCmd())
+	rootCmd.AddCommand(commands.NewDebugCmd())
+	rootCmd.AddCommand(commands.NewHealthCmd())
+	rootCmd.AddCommand(commands.NewDiagnoseCmd())
+	rootCmd.AddCommand(commands.NewDashboardCmd())
+	rootCmd.AddCommand(commands.NewMetricsCmd())
+	rootCmd.AddCommand(commands.NewTestCmd())
+	rootCmd.AddCommand(commands.NewInstallCmd())
+	rootCmd.AddCommand(commands.NewInitCmd())
+	rootCmd.AddCommand(commands.NewCompletionCmd())
+
+	// Execute
+	if err := rootCmd.Execute(); err != nil {
+		os.Exit(1)
+	}
+}
diff --git a/src/semantic-router/go.mod b/src/semantic-router/go.mod
index 5fa0217ae..4c5f48d1b 100644
--- a/src/semantic-router/go.mod
+++ b/src/semantic-router/go.mod
@@ -13,15 +13,18 @@ replace (
 
 require (
 	github.com/envoyproxy/go-control-plane/envoy v1.32.4
-	github.com/fsnotify/fsnotify v1.7.0
+	github.com/fatih/color v1.18.0
+	github.com/fsnotify/fsnotify v1.9.0
 	github.com/mark3labs/mcp-go v0.42.0-beta.1
 	github.com/milvus-io/milvus-sdk-go/v2 v2.4.2
+	github.com/olekukonko/tablewriter v0.0.5
 	github.com/onsi/ginkgo/v2 v2.23.4
 	github.com/onsi/gomega v1.38.0
 	github.com/openai/openai-go v1.12.0
 	github.com/prometheus/client_golang v1.23.0
 	github.com/prometheus/client_model v0.6.2
 	github.com/redis/go-redis/v9 v9.17.0
+	github.com/spf13/cobra v1.9.1
 	github.com/stretchr/testify v1.11.1
 	github.com/vllm-project/semantic-router/candle-binding v0.0.0-00010101000000-000000000000
 	go.opentelemetry.io/otel v1.38.0
@@ -36,7 +39,7 @@ require (
 	gopkg.in/yaml.v3 v3.0.1
 	k8s.io/apimachinery v0.34.2
 	k8s.io/client-go v0.34.2
-	sigs.k8s.io/controller-runtime v0.19.4
+	sigs.k8s.io/controller-runtime v0.22.4
 	sigs.k8s.io/yaml v1.6.0
 )
 
@@ -54,7 +57,7 @@ require (
 	github.com/dgryski/go-rendezvous v0.0.0-20200823014737-9f7001d12a5f // indirect
 	github.com/emicklei/go-restful/v3 v3.12.2 // indirect
 	github.com/envoyproxy/protoc-gen-validate v1.2.1 // indirect
-	github.com/evanphx/json-patch/v5 v5.9.0 // indirect
+	github.com/evanphx/json-patch/v5 v5.9.11 // indirect
 	github.com/fxamacker/cbor/v2 v2.9.0 // indirect
 	github.com/getsentry/sentry-go v0.12.0 // indirect
 	github.com/go-logr/logr v1.4.3 // indirect
@@ -71,6 +74,7 @@ require (
 	github.com/google/uuid v1.6.0 // indirect
 	github.com/grpc-ecosystem/go-grpc-middleware v1.3.0 // indirect
 	github.com/grpc-ecosystem/grpc-gateway/v2 v2.27.2 // indirect
+	github.com/inconshreveable/mousetrap v1.1.0 // indirect
 	github.com/invopop/jsonschema v0.13.0 // indirect
 	github.com/josharian/intern v1.0.0 // indirect
 	github.com/json-iterator/go v1.1.12 // indirect
@@ -78,6 +82,9 @@ require (
 	github.com/kr/text v0.2.0 // indirect
 	github.com/kylelemons/godebug v1.1.0 // indirect
 	github.com/mailru/easyjson v0.7.7 // indirect
+	github.com/mattn/go-colorable v0.1.13 // indirect
+	github.com/mattn/go-isatty v0.0.20 // indirect
+	github.com/mattn/go-runewidth v0.0.9 // indirect
 	github.com/milvus-io/milvus-proto/go-api/v2 v2.4.10-0.20240819025435-512e3b98866a // indirect
 	github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd // indirect
 	github.com/modern-go/reflect2 v1.0.3-0.20250322232337-35a7c28c31ee // indirect
@@ -105,7 +112,6 @@ require (
 	go.uber.org/multierr v1.11.0 // indirect
 	go.yaml.in/yaml/v2 v2.4.2 // indirect
 	go.yaml.in/yaml/v3 v3.0.4 // indirect
-	golang.org/x/exp v0.0.0-20230515195305-f3d0a9c9a5cc // indirect
 	golang.org/x/net v0.43.0 // indirect
 	golang.org/x/oauth2 v0.30.0 // indirect
 	golang.org/x/sync v0.16.0 // indirect
diff --git a/src/semantic-router/go.sum b/src/semantic-router/go.sum
index c41e06bfd..f8aa5c19c 100644
--- a/src/semantic-router/go.sum
+++ b/src/semantic-router/go.sum
@@ -40,6 +40,7 @@ github.com/coreos/etcd v3.3.10+incompatible/go.mod h1:uF7uidLiAD3TWHmW31ZFd/JWoc
 github.com/coreos/go-etcd v2.0.0+incompatible/go.mod h1:Jez6KQU2B/sWsbdaef3ED8NzMklzPG4d5KIOhIy30Tk=
 github.com/coreos/go-semver v0.2.0/go.mod h1:nnelYz7RCh+5ahJtPPxZlU+153eP4D4r3EedlOD2RNk=
 github.com/cpuguy83/go-md2man v1.0.10/go.mod h1:SmD6nW6nTyfqj6ABTjUi3V3JVMnlJmwcJI5acqYI6dE=
+github.com/cpuguy83/go-md2man/v2 v2.0.6/go.mod h1:oOW0eioCTA6cOiMLiUPZOpcVxMig6NIQQ7OS05n1F4g=
 github.com/creack/pty v1.1.9/go.mod h1:oKZEueFk5CKHvIhNR5MUki03XCEU+Q6VDXinZuGJ33E=
 github.com/davecgh/go-spew v1.1.0/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
 github.com/davecgh/go-spew v1.1.1/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
@@ -65,15 +66,17 @@ github.com/envoyproxy/protoc-gen-validate v1.2.1/go.mod h1:d/C80l/jxXLdfEIhX1W2T
 github.com/etcd-io/bbolt v1.3.3/go.mod h1:ZF2nL25h33cCyBtcyWeZ2/I3HQOfTP+0PIEvHjkjCrw=
 github.com/evanphx/json-patch v0.5.2 h1:xVCHIVMUu1wtM/VkR9jVZ45N3FhZfYMMYGorLCR8P3k=
 github.com/evanphx/json-patch v0.5.2/go.mod h1:ZWS5hhDbVDyob71nXKNL0+PWn6ToqBHMikGIFbs31qQ=
-github.com/evanphx/json-patch/v5 v5.9.0 h1:kcBlZQbplgElYIlo/n1hJbls2z/1awpXxpRi0/FOJfg=
-github.com/evanphx/json-patch/v5 v5.9.0/go.mod h1:VNkHZ/282BpEyt/tObQO8s5CMPmYYq14uClGH4abBuQ=
+github.com/evanphx/json-patch/v5 v5.9.11 h1:/8HVnzMq13/3x9TPvjG08wUGqBTmZBsCWzjTM0wiaDU=
+github.com/evanphx/json-patch/v5 v5.9.11/go.mod h1:3j+LviiESTElxA4p3EMKAB9HXj3/XEtnUf6OZxqIQTM=
 github.com/fasthttp-contrib/websocket v0.0.0-20160511215533-1f3b11f56072/go.mod h1:duJ4Jxv5lDcvg4QuQr0oowTf7dz4/CR8NtyCooz9HL8=
+github.com/fatih/color v1.18.0 h1:S8gINlzdQ840/4pfAwic/ZE0djQEH3wM94VfqLTZcOM=
+github.com/fatih/color v1.18.0/go.mod h1:4FelSpRwEGDpQ12mAdzqdOukCy4u8WUtOY6lkT/6HfU=
 github.com/fatih/structs v1.1.0/go.mod h1:9NiDSp5zOcgEDl+j00MP/WkGVPOlPRLejGD8Ga6PJ7M=
 github.com/frankban/quicktest v1.14.6 h1:7Xjx+VpznH+oBnejlPUj8oUpdxnVs4f8XU8WnHkI4W8=
 github.com/frankban/quicktest v1.14.6/go.mod h1:4ptaffx2x8+WTWXmUCuVU6aPUX1/Mz7zb5vbUoiM6w0=
 github.com/fsnotify/fsnotify v1.4.7/go.mod h1:jwhsz4b93w/PPRr/qN1Yymfu8t87LnFCMoQvtojpjFo=
-github.com/fsnotify/fsnotify v1.7.0 h1:8JEhPFa5W2WU7YfeZzPNqzMP6Lwt7L2715Ggo0nosvA=
-github.com/fsnotify/fsnotify v1.7.0/go.mod h1:40Bi/Hjc2AVfZrqy+aj+yEI+/bRxZnMJyTJwOpGvigM=
+github.com/fsnotify/fsnotify v1.9.0 h1:2Ml+OJNzbYCTzsxtv8vKSFD9PbJjmhYF14k/jKC7S9k=
+github.com/fsnotify/fsnotify v1.9.0/go.mod h1:8jBTzvmWwFyi3Pb8djgCCO5IBqzKJ/Jwo8TRcHyHii0=
 github.com/fxamacker/cbor/v2 v2.9.0 h1:NpKPmjDBgUfBms6tr6JZkTHtfFGcMKsw3eGcmD/sapM=
 github.com/fxamacker/cbor/v2 v2.9.0/go.mod h1:vM4b+DJCtHn+zz7h3FFp/hDAI9WNWCsZj23V5ytsSxQ=
 github.com/gavv/httpexpect v2.0.0+incompatible/go.mod h1:x+9tiU1YnrOvnB725RkpoLv1M62hOWzwo5OXotisrKc=
@@ -135,6 +138,8 @@ github.com/golang/protobuf v1.5.2/go.mod h1:XVQd3VNwM+JqD3oG2Ue2ip4fOMUkwXdXDdiu
 github.com/golang/protobuf v1.5.4 h1:i7eJL8qZTpSEXOPTxNKhASYpMn+8e5Q6AdndVa1dWek=
 github.com/golang/protobuf v1.5.4/go.mod h1:lnTiLA8Wa4RWRcIUkrtSVa5nRhsEGBg48fD6rSs7xps=
 github.com/gomodule/redigo v1.7.1-0.20190724094224-574c33c3df38/go.mod h1:B4C85qUVwatsJoIUNIfCRsp7qO0iAmpGFZ4EELWSbC4=
+github.com/google/btree v1.1.3 h1:CVpQJjYgC4VbzxeGVHfvZrv1ctoYCAI8vbl07Fcxlyg=
+github.com/google/btree v1.1.3/go.mod h1:qOPhT0dTNdNzV6Z/lhRX0YXUafgPLFUh+gZMl761Gm4=
 github.com/google/gnostic-models v0.7.0 h1:qwTtogB15McXDaNqTZdzPJRHvaVJlAl+HVQnLmJEJxo=
 github.com/google/gnostic-models v0.7.0/go.mod h1:whL5G0m6dmc5cPxKc5bdKdEN3UjI7OUGxBlw57miDrQ=
 github.com/google/go-cmp v0.2.0/go.mod h1:oXzfMopK8JAjlY9xF4vHSVASa0yLyX7SntLO5aqRK0M=
@@ -166,6 +171,8 @@ github.com/hpcloud/tail v1.0.0/go.mod h1:ab1qPbhIpdTxEkNHXyeSf5vhxWSCs/tWer42PpO
 github.com/hydrogen18/memlistener v0.0.0-20200120041712-dcc25e7acd91/go.mod h1:qEIFzExnS6016fRpRfxrExeVn2gbClQA99gQhnIcdhE=
 github.com/imkira/go-interpol v1.1.0/go.mod h1:z0h2/2T3XF8kyEPpRgJ3kmNv+C43p+I/CoI+jC3w2iA=
 github.com/inconshreveable/mousetrap v1.0.0/go.mod h1:PxqpIevigyE2G7u3NXJIT2ANytuPF1OarO4DADm73n8=
+github.com/inconshreveable/mousetrap v1.1.0 h1:wN+x4NVGpMsO7ErUn/mUI3vEoE6Jt13X2s0bqwp9tc8=
+github.com/inconshreveable/mousetrap v1.1.0/go.mod h1:vpF70FUmC8bwa3OWnCshd2FqLfsEA9PFc4w1p2J65bw=
 github.com/invopop/jsonschema v0.13.0 h1:KvpoAJWEjR3uD9Kbm2HWJmqsEaHt8lBUpd0qHcIi21E=
 github.com/invopop/jsonschema v0.13.0/go.mod h1:ffZ5Km5SWWRAIN6wbDXItl95euhFz2uON45H2qjYt+0=
 github.com/iris-contrib/blackfriday v2.0.0+incompatible/go.mod h1:UzZ2bDEoaSGPbkg6SAB4att1aAwTmVIx/5gCVqeyUdI=
@@ -216,11 +223,18 @@ github.com/mark3labs/mcp-go v0.42.0-beta.1/go.mod h1:T7tUa2jO6MavG+3P25Oy/jR7iCe
 github.com/mattn/go-colorable v0.1.2/go.mod h1:U0ppj6V5qS13XJ6of8GYAs25YV2eR4EVcfRqFIhoBtE=
 github.com/mattn/go-colorable v0.1.8/go.mod h1:u6P/XSegPjTcexA+o6vUJrdnUu04hMope9wVRipJSqc=
 github.com/mattn/go-colorable v0.1.11/go.mod h1:u5H1YNBxpqRaxsYJYSkiCWKzEfiAb1Gb520KVy5xxl4=
+github.com/mattn/go-colorable v0.1.13 h1:fFA4WZxdEF4tXPZVKMLwD8oUnCTTo08duU7wxecdEvA=
+github.com/mattn/go-colorable v0.1.13/go.mod h1:7S9/ev0klgBDR4GtXTXX8a3vIGJpMovkB8vQcUbaXHg=
 github.com/mattn/go-isatty v0.0.7/go.mod h1:Iq45c/XA43vh69/j3iqttzPXn0bhXyGjM0Hdxcsrc5s=
 github.com/mattn/go-isatty v0.0.8/go.mod h1:Iq45c/XA43vh69/j3iqttzPXn0bhXyGjM0Hdxcsrc5s=
 github.com/mattn/go-isatty v0.0.9/go.mod h1:YNRxwqDuOph6SZLI9vUUz6OYw3QyUt7WiY2yME+cCiQ=
 github.com/mattn/go-isatty v0.0.12/go.mod h1:cbi8OIDigv2wuxKPP5vlRcQ1OAZbq2CE4Kysco4FUpU=
 github.com/mattn/go-isatty v0.0.14/go.mod h1:7GGIvUiUoEMVVmxf/4nioHXj79iQHKdU27kJ6hsGG94=
+github.com/mattn/go-isatty v0.0.16/go.mod h1:kYGgaQfpe5nmfYZH+SKPsOc2e4SrIfOl2e/yFXSvRLM=
+github.com/mattn/go-isatty v0.0.20 h1:xfD0iDuEKnDkl03q4limB+vH+GxLEtL/jb4xVJSWWEY=
+github.com/mattn/go-isatty v0.0.20/go.mod h1:W+V8PltTTMOvKvAeJH7IuucS94S2C6jfK/D7dTCTo3Y=
+github.com/mattn/go-runewidth v0.0.9 h1:Lm995f3rfxdpd6TSmuVCHVb/QhupuXlYr8sCI/QdE+0=
+github.com/mattn/go-runewidth v0.0.9/go.mod h1:H031xJmbD/WCDINGzjvQ9THkh0rPKHF+m2gUSrubnMI=
 github.com/mattn/goveralls v0.0.2/go.mod h1:8d1ZMHsd7fW6IRPKQh46F2WRpyib5/X4FOpevwGNQEw=
 github.com/mediocregopher/radix/v3 v3.4.2/go.mod h1:8FL3F6UQRXHXIBSPUs5h0RybMF8i4n7wVopoX3x7Bv8=
 github.com/microcosm-cc/bluemonday v1.0.2/go.mod h1:iVP4YcDBq+n/5fb23BhYFvIMq/leAFZyRl6bYmGDlGc=
@@ -245,6 +259,8 @@ github.com/nats-io/jwt v0.3.0/go.mod h1:fRYCDE99xlTsqUzISS1Bi75UBJ6ljOJQOAAu5Vgl
 github.com/nats-io/nats.go v1.9.1/go.mod h1:ZjDU1L/7fJ09jvUSRVBR2e7+RnLiiIQyqyzEE/Zbp4w=
 github.com/nats-io/nkeys v0.1.0/go.mod h1:xpnFELMwJABBLVhffcfd1MZx6VsNRFpEugbxziKVo7w=
 github.com/nats-io/nuid v1.0.1/go.mod h1:19wcPz3Ph3q0Jbyiqsd0kePYG7A95tJPxeL+1OSON2c=
+github.com/olekukonko/tablewriter v0.0.5 h1:P2Ga83D34wi1o9J6Wh1mRuqd4mF/x/lgBS7N7AbDhec=
+github.com/olekukonko/tablewriter v0.0.5/go.mod h1:hPp6KlRPjbx+hW8ykQs1w3UBbZlj6HuIJcUGPhkA7kY=
 github.com/onsi/ginkgo v1.6.0/go.mod h1:lLunBs/Ym6LB5Z9jYTR76FiuTmxDTDusOGeTQH+WWjE=
 github.com/onsi/ginkgo v1.10.3/go.mod h1:lLunBs/Ym6LB5Z9jYTR76FiuTmxDTDusOGeTQH+WWjE=
 github.com/onsi/ginkgo/v2 v2.23.4 h1:ktYTpKJAVZnDT4VjxSbiBenUjmlL/5QkBEocaWXiQus=
@@ -286,6 +302,7 @@ github.com/rogpeppe/go-internal v1.9.0/go.mod h1:WtVeX8xhTBvf0smdhujwtBcq4Qrzq/f
 github.com/rogpeppe/go-internal v1.13.1 h1:KvO1DLK/DRN07sQ1LQKScxyZJuNnedQ5/wKSR38lUII=
 github.com/rogpeppe/go-internal v1.13.1/go.mod h1:uMEvuHeurkdAXX61udpOXGD/AzZDWNMNyH2VO9fmH0o=
 github.com/russross/blackfriday v1.5.2/go.mod h1:JO/DiYxRf+HjHt06OyowR9PTA263kcR/rfWxYHBV53g=
+github.com/russross/blackfriday/v2 v2.1.0/go.mod h1:+Rmxgy9KzJVeS9/2gXHxylqXiyQDYRxCVz55jmeOWTM=
 github.com/ryanuber/columnize v2.1.0+incompatible/go.mod h1:sm1tb6uqfes/u+d4ooFouqFdy9/2g9QGwK3SQygK0Ts=
 github.com/schollz/closestmatch v2.1.0+incompatible/go.mod h1:RtP1ddjLong6gTkbtmuhtR2uUrrJOpYzYRvbcPAid+g=
 github.com/sergi/go-diff v1.0.0/go.mod h1:0CfEIISq7TuYL3j771MWULgwwjU+GofnZX9QAmXWZgo=
@@ -298,6 +315,8 @@ github.com/spf13/cast v1.3.0/go.mod h1:Qx5cxh0v+4UWYiBimWS+eyWzqEqokIECu5etghLkU
 github.com/spf13/cast v1.7.1 h1:cuNEagBQEHWN1FnbGEjCXL2szYEXqfJPbP2HNUaca9Y=
 github.com/spf13/cast v1.7.1/go.mod h1:ancEpBxwJDODSW/UG4rDrAqiKolqNNh2DX3mk86cAdo=
 github.com/spf13/cobra v0.0.5/go.mod h1:3K3wKZymM7VvHMDS9+Akkh4K60UwM26emMESw8tLCHU=
+github.com/spf13/cobra v1.9.1 h1:CXSaggrXdbHK9CF+8ywj8Amf7PBRmPCOJugH954Nnlo=
+github.com/spf13/cobra v1.9.1/go.mod h1:nDyEzZ8ogv936Cinf6g1RU9MRY64Ir93oCnqb9wxYW0=
 github.com/spf13/jwalterweatherman v1.0.0/go.mod h1:cQK4TGJAtQXfYWX+Ddv3mKDzgVb68N+wFjFa4jdeBTo=
 github.com/spf13/pflag v1.0.3/go.mod h1:DYY7MBk1bdzusC3SYhjObp+wFpr4gzcvqqNjLnInEg4=
 github.com/spf13/pflag v1.0.6 h1:jFzHGLGAlb3ruxLB8MhbI6A8+AQX/2eW4qeyNZXNp2o=
@@ -400,8 +419,6 @@ golang.org/x/crypto v0.0.0-20200622213623-75b288015ac9/go.mod h1:LzIPMQfyMNhhGPh
 golang.org/x/crypto v0.0.0-20210322153248-0c34fe9e7dc2/go.mod h1:T9bdIzuCu7OtxOm1hfPfRQxPLYneinmdGuTeoZ9dtd4=
 golang.org/x/crypto v0.0.0-20210921155107-089bfa567519/go.mod h1:GvvjBRRGRdwPK5ydBHafDWAxML/pGHZbMvKqRZ5+Abc=
 golang.org/x/exp v0.0.0-20190121172915-509febef88a4/go.mod h1:CJ0aWSM057203Lf6IL+f9T1iT9GByDxfZKAQTCR3kQA=
-golang.org/x/exp v0.0.0-20230515195305-f3d0a9c9a5cc h1:mCRnTeVUjcrhlRmO0VK8a6k6Rrf6TF9htwo2pJVSjIU=
-golang.org/x/exp v0.0.0-20230515195305-f3d0a9c9a5cc/go.mod h1:V1LtkGg67GoY2N1AnLN78QLrzxkLyJw7RJb1gzOOz9w=
 golang.org/x/lint v0.0.0-20181026193005-c67002cb31c3/go.mod h1:UVdnD1Gm6xHRNCYTkRU2/jEulfH38KcIWyp/GAMgvoE=
 golang.org/x/lint v0.0.0-20190227174305-5b3e6a55c961/go.mod h1:wehouNa3lNwaWXcvxsM5YxQ5yQlVC4a0KAMCusXpPoU=
 golang.org/x/lint v0.0.0-20190313153728-d0100b6bd8b3/go.mod h1:6SW0HCj/g11FgYtHlgUYUwCkIfeOF89ocIRzGO/8vkc=
@@ -463,6 +480,8 @@ golang.org/x/sys v0.0.0-20210630005230-0f9fa26af87c/go.mod h1:oPkhp1MJrh7nUepCBc
 golang.org/x/sys v0.0.0-20210927094055-39ccf1dd6fa6/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.0.0-20211007075335-d3039528d8ac/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.0.0-20220209214540-3681064d5158/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
+golang.org/x/sys v0.0.0-20220811171246-fbc7d0a398ab/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
+golang.org/x/sys v0.6.0/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.37.0 h1:fdNQudmxPjkdUTPnLn5mdQv7Zwvbvpaxqs831goi9kQ=
 golang.org/x/sys v0.37.0/go.mod h1:OgkHotnGiDImocRcuBABYBEXf8A9a87e/uXjp9XT3ks=
 golang.org/x/term v0.0.0-20201126162022-7de9c90e9dd1/go.mod h1:bj7SfCRtBDWHUb9snDiAeCFNEtKQo2Wmx5Cou7ajbmo=
@@ -567,8 +586,8 @@ honnef.co/go/tools v0.0.0-20190102054323-c2f93a96b099/go.mod h1:rf3lG4BRIbNafJWh
 honnef.co/go/tools v0.0.0-20190523083050-ea95bdfd59fc/go.mod h1:rf3lG4BRIbNafJWhAfAdb/ePZxsR/4RtNHQocxwk9r4=
 k8s.io/api v0.34.2 h1:fsSUNZhV+bnL6Aqrp6O7lMTy6o5x2C4XLjnh//8SLYY=
 k8s.io/api v0.34.2/go.mod h1:MMBPaWlED2a8w4RSeanD76f7opUoypY8TFYkSM+3XHw=
-k8s.io/apiextensions-apiserver v0.31.0 h1:fZgCVhGwsclj3qCw1buVXCV6khjRzKC5eCFt24kyLSk=
-k8s.io/apiextensions-apiserver v0.31.0/go.mod h1:b9aMDEYaEe5sdK+1T0KU78ApR/5ZVp4i56VacZYEHxk=
+k8s.io/apiextensions-apiserver v0.34.1 h1:NNPBva8FNAPt1iSVwIE0FsdrVriRXMsaWFMqJbII2CI=
+k8s.io/apiextensions-apiserver v0.34.1/go.mod h1:hP9Rld3zF5Ay2Of3BeEpLAToP+l4s5UlxiHfqRaRcMc=
 k8s.io/apimachinery v0.34.2 h1:zQ12Uk3eMHPxrsbUJgNF8bTauTVR2WgqJsTmwTE/NW4=
 k8s.io/apimachinery v0.34.2/go.mod h1:/GwIlEcWuTX9zKIg2mbw0LRFIsXwrfoVxn+ef0X13lw=
 k8s.io/client-go v0.34.2 h1:Co6XiknN+uUZqiddlfAjT68184/37PS4QAzYvQvDR8M=
@@ -579,8 +598,8 @@ k8s.io/kube-openapi v0.0.0-20250710124328-f3f2b991d03b h1:MloQ9/bdJyIu9lb1PzujOP
 k8s.io/kube-openapi v0.0.0-20250710124328-f3f2b991d03b/go.mod h1:UZ2yyWbFTpuhSbFhv24aGNOdoRdJZgsIObGBUaYVsts=
 k8s.io/utils v0.0.0-20250604170112-4c0f3b243397 h1:hwvWFiBzdWw1FhfY1FooPn3kzWuJ8tmbZBHi4zVsl1Y=
 k8s.io/utils v0.0.0-20250604170112-4c0f3b243397/go.mod h1:OLgZIPagt7ERELqWJFomSt595RzquPNLL48iOWgYOg0=
-sigs.k8s.io/controller-runtime v0.19.4 h1:SUmheabttt0nx8uJtoII4oIP27BVVvAKFvdvGFwV/Qo=
-sigs.k8s.io/controller-runtime v0.19.4/go.mod h1:iRmWllt8IlaLjvTTDLhRBXIEtkCK6hwVBJJsYS9Ajf4=
+sigs.k8s.io/controller-runtime v0.22.4 h1:GEjV7KV3TY8e+tJ2LCTxUTanW4z/FmNB7l327UfMq9A=
+sigs.k8s.io/controller-runtime v0.22.4/go.mod h1:+QX1XUpTXN4mLoblf4tqr5CQcyHPAki2HLXqQMY6vh8=
 sigs.k8s.io/json v0.0.0-20241014173422-cfa47c3a1cc8 h1:gBQPwqORJ8d8/YNZWEjoZs7npUVDpVXUUOFfW6CgAqE=
 sigs.k8s.io/json v0.0.0-20241014173422-cfa47c3a1cc8/go.mod h1:mdzfpAEoE6DHQEN0uh9ZbOCuHbLK5wOm7dK4ctXE9Tg=
 sigs.k8s.io/randfill v1.0.0 h1:JfjMILfT8A6RbawdsK2JXGBR5AQVfd+9TbzrlneTyrU=
diff --git a/src/semantic-router/pkg/cli/debug/checker.go b/src/semantic-router/pkg/cli/debug/checker.go
new file mode 100644
index 000000000..67008237e
--- /dev/null
+++ b/src/semantic-router/pkg/cli/debug/checker.go
@@ -0,0 +1,572 @@
+package debug
+
+import (
+	"fmt"
+	"net"
+	"net/http"
+	"os"
+	"os/exec"
+	"path/filepath"
+	"runtime"
+	"strings"
+	"syscall"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/config"
+)
+
+// CheckResult represents the result of a check
+type CheckResult struct {
+	Name     string
+	Status   string // "pass", "warn", "fail"
+	Message  string
+	Details  []string
+	Severity string // "critical", "warning", "info"
+}
+
+// DiagnosticReport contains all diagnostic information
+type DiagnosticReport struct {
+	Timestamp       time.Time
+	SystemInfo      SystemInfo
+	Prerequisites   []CheckResult
+	Configuration   []CheckResult
+	ModelStatus     []CheckResult
+	Resources       []CheckResult
+	Connectivity    []CheckResult
+	Recommendations []string
+}
+
+// SystemInfo contains system information
+type SystemInfo struct {
+	OS           string
+	Architecture string
+	GoVersion    string
+	Hostname     string
+	WorkingDir   string
+}
+
+// CheckPrerequisites checks all required tools
+func CheckPrerequisites() []CheckResult {
+	results := []CheckResult{}
+
+	// Check Go
+	if version := runtime.Version(); version != "" {
+		results = append(results, CheckResult{
+			Name:     "Go",
+			Status:   "pass",
+			Message:  fmt.Sprintf("Found: %s", version),
+			Severity: "info",
+		})
+	} else {
+		results = append(results, CheckResult{
+			Name:     "Go",
+			Status:   "fail",
+			Message:  "Go not found",
+			Severity: "critical",
+		})
+	}
+
+	// Check kubectl
+	results = append(results, checkCommand("kubectl", "kubectl version --client --short", false))
+
+	// Check docker
+	results = append(results, checkCommand("docker", "docker --version", false))
+
+	// Check docker-compose
+	dockerComposeResult := checkCommand("docker-compose", "docker-compose --version", false)
+	if dockerComposeResult.Status != "pass" {
+		// Try docker compose (v2)
+		dockerComposeResult = checkCommand("docker-compose", "docker compose version", false)
+	}
+	results = append(results, dockerComposeResult)
+
+	// Check helm
+	results = append(results, checkCommand("helm", "helm version --short", false))
+
+	// Check make
+	results = append(results, checkCommand("make", "make --version", false))
+
+	// Check git
+	results = append(results, checkCommand("git", "git --version", false))
+
+	return results
+}
+
+// checkCommand checks if a command exists and runs successfully
+func checkCommand(name, command string, critical bool) CheckResult {
+	parts := strings.Fields(command)
+	//nolint:gosec // G204: Command is from internal prerequisite checks, not user input
+	cmd := exec.Command(parts[0], parts[1:]...)
+
+	output, err := cmd.CombinedOutput()
+	if err != nil {
+		severity := "warning"
+		if critical {
+			severity = "critical"
+		}
+		return CheckResult{
+			Name:     name,
+			Status:   "fail",
+			Message:  fmt.Sprintf("Not found or not working: %v", err),
+			Severity: severity,
+		}
+	}
+
+	// Extract version from output
+	outputStr := strings.TrimSpace(string(output))
+	lines := strings.Split(outputStr, "\n")
+	version := lines[0]
+	if len(version) > 100 {
+		version = version[:100] + "..."
+	}
+
+	return CheckResult{
+		Name:     name,
+		Status:   "pass",
+		Message:  version,
+		Severity: "info",
+	}
+}
+
+// CheckConfiguration validates the configuration file
+func CheckConfiguration(configPath string) []CheckResult {
+	results := []CheckResult{}
+
+	// Check if config file exists
+	if _, err := os.Stat(configPath); os.IsNotExist(err) {
+		results = append(results, CheckResult{
+			Name:     "Config File",
+			Status:   "fail",
+			Message:  fmt.Sprintf("Configuration file not found: %s", configPath),
+			Severity: "critical",
+		})
+		return results
+	}
+
+	results = append(results, CheckResult{
+		Name:     "Config File",
+		Status:   "pass",
+		Message:  fmt.Sprintf("Found: %s", configPath),
+		Severity: "info",
+	})
+
+	// Try to parse config
+	cfg, err := config.Parse(configPath)
+	if err != nil {
+		results = append(results, CheckResult{
+			Name:     "Config Parse",
+			Status:   "fail",
+			Message:  fmt.Sprintf("Failed to parse: %v", err),
+			Severity: "critical",
+			Details:  []string{"Check YAML syntax", "Verify all required fields are present"},
+		})
+		return results
+	}
+
+	results = append(results, CheckResult{
+		Name:     "Config Parse",
+		Status:   "pass",
+		Message:  "Configuration parsed successfully",
+		Severity: "info",
+	})
+
+	// Validate config
+	if err := cli.ValidateConfig(cfg); err != nil {
+		results = append(results, CheckResult{
+			Name:     "Config Validation",
+			Status:   "fail",
+			Message:  fmt.Sprintf("Validation failed: %v", err),
+			Severity: "critical",
+			Details:  strings.Split(err.Error(), "\n"),
+		})
+	} else {
+		results = append(results, CheckResult{
+			Name:     "Config Validation",
+			Status:   "pass",
+			Message:  "Configuration is valid",
+			Severity: "info",
+		})
+	}
+
+	return results
+}
+
+// CheckModelStatus checks model availability
+func CheckModelStatus(modelsDir string) []CheckResult {
+	results := []CheckResult{}
+
+	// Check if models directory exists
+	if _, err := os.Stat(modelsDir); os.IsNotExist(err) {
+		results = append(results, CheckResult{
+			Name:     "Models Directory",
+			Status:   "fail",
+			Message:  fmt.Sprintf("Models directory not found: %s", modelsDir),
+			Severity: "critical",
+			Details:  []string{"Run: make download-models", "Or create the directory manually"},
+		})
+		return results
+	}
+
+	results = append(results, CheckResult{
+		Name:     "Models Directory",
+		Status:   "pass",
+		Message:  fmt.Sprintf("Found: %s", modelsDir),
+		Severity: "info",
+	})
+
+	// Count model files
+	modelCount := 0
+	_ = filepath.Walk(modelsDir, func(path string, info os.FileInfo, err error) error {
+		if err == nil && !info.IsDir() {
+			if strings.HasSuffix(path, ".bin") || strings.HasSuffix(path, ".safetensors") {
+				modelCount++
+			}
+		}
+		return nil
+	})
+
+	if modelCount == 0 {
+		results = append(results, CheckResult{
+			Name:     "Model Files",
+			Status:   "warn",
+			Message:  "No model files found",
+			Severity: "warning",
+			Details:  []string{"Models may not be downloaded", "Run: make download-models"},
+		})
+	} else {
+		results = append(results, CheckResult{
+			Name:     "Model Files",
+			Status:   "pass",
+			Message:  fmt.Sprintf("Found %d model file(s)", modelCount),
+			Severity: "info",
+		})
+	}
+
+	return results
+}
+
+// CheckResources checks system resources
+func CheckResources() []CheckResult {
+	results := []CheckResult{}
+
+	// Check disk space
+	cwd, _ := os.Getwd()
+	var stat syscall.Statfs_t
+	_ = syscall.Statfs(cwd, &stat)
+
+	// Available space in bytes
+	//nolint:gosec // G115: Block size is always positive, conversion is safe
+	availableSpace := stat.Bavail * uint64(stat.Bsize)
+	//nolint:gosec // G115: Block size is always positive, conversion is safe
+	totalSpace := stat.Blocks * uint64(stat.Bsize)
+	usedSpace := totalSpace - availableSpace
+	usedPercent := float64(usedSpace) / float64(totalSpace) * 100
+
+	diskStatus := "pass"
+	diskSeverity := "info"
+	if usedPercent > 90 {
+		diskStatus = "warn"
+		diskSeverity = "warning"
+	} else if usedPercent > 95 {
+		diskStatus = "fail"
+		diskSeverity = "critical"
+	}
+
+	results = append(results, CheckResult{
+		Name:     "Disk Space",
+		Status:   diskStatus,
+		Message:  fmt.Sprintf("%.1f%% used (%.2f GB available)", usedPercent, float64(availableSpace)/1024/1024/1024),
+		Severity: diskSeverity,
+	})
+
+	// Check common ports
+	commonPorts := []int{8080, 8801, 8700, 3000, 9090}
+	usedPorts := []int{}
+
+	for _, port := range commonPorts {
+		if !isPortAvailable(port) {
+			usedPorts = append(usedPorts, port)
+		}
+	}
+
+	if len(usedPorts) > 0 {
+		results = append(results, CheckResult{
+			Name:     "Port Availability",
+			Status:   "warn",
+			Message:  fmt.Sprintf("%d port(s) in use: %v", len(usedPorts), usedPorts),
+			Severity: "warning",
+			Details:  []string{"These ports are commonly used by the router", "Check: netstat -tulpn | grep <port>"},
+		})
+	} else {
+		results = append(results, CheckResult{
+			Name:     "Port Availability",
+			Status:   "pass",
+			Message:  "All common ports available",
+			Severity: "info",
+		})
+	}
+
+	return results
+}
+
+// CheckConnectivity checks network connectivity
+func CheckConnectivity(endpoints []string) []CheckResult {
+	results := []CheckResult{}
+
+	if len(endpoints) == 0 {
+		endpoints = []string{
+			"http://localhost:8080/health",
+			"http://localhost:8080/metrics",
+		}
+	}
+
+	for _, endpoint := range endpoints {
+		result := checkEndpoint(endpoint)
+		results = append(results, result)
+	}
+
+	return results
+}
+
+// checkEndpoint checks if an endpoint is reachable
+func checkEndpoint(endpoint string) CheckResult {
+	client := &http.Client{
+		Timeout: 5 * time.Second,
+	}
+
+	resp, err := client.Get(endpoint)
+	if err != nil {
+		return CheckResult{
+			Name:     endpoint,
+			Status:   "fail",
+			Message:  fmt.Sprintf("Not reachable: %v", err),
+			Severity: "warning",
+		}
+	}
+	defer resp.Body.Close()
+
+	if resp.StatusCode >= 200 && resp.StatusCode < 300 {
+		return CheckResult{
+			Name:     endpoint,
+			Status:   "pass",
+			Message:  fmt.Sprintf("Reachable (HTTP %d)", resp.StatusCode),
+			Severity: "info",
+		}
+	}
+
+	return CheckResult{
+		Name:     endpoint,
+		Status:   "warn",
+		Message:  fmt.Sprintf("Reachable but returned HTTP %d", resp.StatusCode),
+		Severity: "warning",
+	}
+}
+
+// isPortAvailable checks if a port is available
+func isPortAvailable(port int) bool {
+	address := fmt.Sprintf("localhost:%d", port)
+	conn, err := net.DialTimeout("tcp", address, 1*time.Second)
+	if err != nil {
+		return true // Port is available (connection failed)
+	}
+	conn.Close()
+	return false // Port is in use
+}
+
+// GetSystemInfo returns system information
+func GetSystemInfo() SystemInfo {
+	hostname, _ := os.Hostname()
+	cwd, _ := os.Getwd()
+
+	return SystemInfo{
+		OS:           runtime.GOOS,
+		Architecture: runtime.GOARCH,
+		GoVersion:    runtime.Version(),
+		Hostname:     hostname,
+		WorkingDir:   cwd,
+	}
+}
+
+// GenerateRecommendations generates recommendations based on check results
+func GenerateRecommendations(report *DiagnosticReport) []string {
+	recommendations := []string{}
+
+	// Check for failed prerequisites
+	for _, result := range report.Prerequisites {
+		if result.Status == "fail" {
+			switch result.Name {
+			case "kubectl":
+				recommendations = append(recommendations, "Install kubectl: https://kubernetes.io/docs/tasks/tools/")
+			case "docker":
+				recommendations = append(recommendations, "Install Docker: https://docs.docker.com/get-docker/")
+			case "docker-compose":
+				recommendations = append(recommendations, "Install Docker Compose: https://docs.docker.com/compose/install/")
+			case "helm":
+				recommendations = append(recommendations, "Install Helm: https://helm.sh/docs/intro/install/")
+			case "make":
+				recommendations = append(recommendations, "Install make: apt-get install build-essential (Ubuntu) or brew install make (macOS)")
+			}
+		}
+	}
+
+	// Check for config issues
+	for _, result := range report.Configuration {
+		if result.Status == "fail" && result.Name == "Config File" {
+			recommendations = append(recommendations, "Initialize configuration: vsr init")
+		}
+	}
+
+	// Check for model issues
+	for _, result := range report.ModelStatus {
+		if result.Status == "fail" || result.Status == "warn" {
+			recommendations = append(recommendations, "Download models: make download-models")
+			break
+		}
+	}
+
+	// Check for resource issues
+	for _, result := range report.Resources {
+		if result.Name == "Disk Space" && result.Status != "pass" {
+			recommendations = append(recommendations, "Free up disk space or clean up unused models: vsr model remove <model-id>")
+		}
+		if result.Name == "Port Availability" && result.Status == "warn" {
+			recommendations = append(recommendations, "Stop services using required ports or configure different ports in config.yaml")
+		}
+	}
+
+	// If everything passes
+	if len(recommendations) == 0 {
+		recommendations = append(recommendations, "All checks passed! You're ready to deploy.")
+		recommendations = append(recommendations, "Deploy with: vsr deploy [local|docker|kubernetes|helm]")
+	}
+
+	return recommendations
+}
+
+// RunFullDiagnostics runs all diagnostic checks
+func RunFullDiagnostics(configPath, modelsDir string) *DiagnosticReport {
+	report := &DiagnosticReport{
+		Timestamp:  time.Now(),
+		SystemInfo: GetSystemInfo(),
+	}
+
+	cli.Info("Running comprehensive diagnostics...")
+	cli.Info("")
+
+	// Prerequisites
+	cli.Info("Checking prerequisites...")
+	report.Prerequisites = CheckPrerequisites()
+
+	// Configuration
+	cli.Info("Checking configuration...")
+	report.Configuration = CheckConfiguration(configPath)
+
+	// Models
+	cli.Info("Checking models...")
+	report.ModelStatus = CheckModelStatus(modelsDir)
+
+	// Resources
+	cli.Info("Checking system resources...")
+	report.Resources = CheckResources()
+
+	// Connectivity
+	cli.Info("Checking connectivity...")
+	report.Connectivity = CheckConnectivity(nil)
+
+	// Generate recommendations
+	report.Recommendations = GenerateRecommendations(report)
+
+	return report
+}
+
+// DisplayReport displays a diagnostic report
+func DisplayReport(report *DiagnosticReport) {
+	cli.Info("\n╔════════════════════════════════════════════════════════════════╗")
+	cli.Info("║                    Diagnostic Report                          ║")
+	cli.Info("╚════════════════════════════════════════════════════════════════╝")
+
+	// System Info
+	cli.Info("\n📋 System Information:")
+	cli.Info(fmt.Sprintf("  OS: %s (%s)", report.SystemInfo.OS, report.SystemInfo.Architecture))
+	cli.Info(fmt.Sprintf("  Go: %s", report.SystemInfo.GoVersion))
+	cli.Info(fmt.Sprintf("  Hostname: %s", report.SystemInfo.Hostname))
+	cli.Info(fmt.Sprintf("  Working Directory: %s", report.SystemInfo.WorkingDir))
+	cli.Info(fmt.Sprintf("  Timestamp: %s", report.Timestamp.Format(time.RFC3339)))
+
+	// Display each category
+	displayCheckCategory("Prerequisites", report.Prerequisites)
+	displayCheckCategory("Configuration", report.Configuration)
+	displayCheckCategory("Models", report.ModelStatus)
+	displayCheckCategory("Resources", report.Resources)
+	displayCheckCategory("Connectivity", report.Connectivity)
+
+	// Recommendations
+	if len(report.Recommendations) > 0 {
+		cli.Info("\n💡 Recommendations:")
+		for i, rec := range report.Recommendations {
+			cli.Info(fmt.Sprintf("  %d. %s", i+1, rec))
+		}
+	}
+
+	// Summary
+	totalChecks := len(report.Prerequisites) + len(report.Configuration) +
+		len(report.ModelStatus) + len(report.Resources) + len(report.Connectivity)
+	passedChecks := 0
+	failedChecks := 0
+	warningChecks := 0
+
+	for _, results := range [][]CheckResult{
+		report.Prerequisites,
+		report.Configuration,
+		report.ModelStatus,
+		report.Resources,
+		report.Connectivity,
+	} {
+		for _, result := range results {
+			switch result.Status {
+			case "pass":
+				passedChecks++
+			case "fail":
+				failedChecks++
+			case "warn":
+				warningChecks++
+			}
+		}
+	}
+
+	cli.Info(fmt.Sprintf("\n📊 Summary: %d checks (%d passed, %d warnings, %d failed)",
+		totalChecks, passedChecks, warningChecks, failedChecks))
+}
+
+// displayCheckCategory displays a category of checks
+func displayCheckCategory(category string, results []CheckResult) {
+	if len(results) == 0 {
+		return
+	}
+
+	cli.Info(fmt.Sprintf("\n🔍 %s:", category))
+	for _, result := range results {
+		symbol := getStatusSymbol(result.Status)
+		cli.Info(fmt.Sprintf("  %s %-25s %s", symbol, result.Name, result.Message))
+		if len(result.Details) > 0 {
+			for _, detail := range result.Details {
+				cli.Info(fmt.Sprintf("      → %s", detail))
+			}
+		}
+	}
+}
+
+// getStatusSymbol returns a symbol for the status
+func getStatusSymbol(status string) string {
+	switch status {
+	case "pass":
+		return "✓"
+	case "fail":
+		return "✗"
+	case "warn":
+		return "⚠"
+	default:
+		return "•"
+	}
+}
diff --git a/src/semantic-router/pkg/cli/debug/checker_test.go b/src/semantic-router/pkg/cli/debug/checker_test.go
new file mode 100644
index 000000000..fcac08f28
--- /dev/null
+++ b/src/semantic-router/pkg/cli/debug/checker_test.go
@@ -0,0 +1,337 @@
+package debug
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+)
+
+func TestGetSystemInfo(t *testing.T) {
+	info := GetSystemInfo()
+
+	if info.OS == "" {
+		t.Error("Expected OS to be set")
+	}
+
+	if info.Architecture == "" {
+		t.Error("Expected Architecture to be set")
+	}
+
+	if info.GoVersion == "" {
+		t.Error("Expected GoVersion to be set")
+	}
+}
+
+func TestCheckPrerequisites(t *testing.T) {
+	results := CheckPrerequisites()
+
+	// Should have at least some results
+	if len(results) == 0 {
+		t.Error("Expected at least some prerequisite checks")
+	}
+
+	// Check that Go is always present (since we're running in Go)
+	hasGo := false
+	for _, result := range results {
+		if result.Name == "Go" {
+			hasGo = true
+			if result.Status != "pass" {
+				t.Error("Expected Go to pass (since we're running in Go)")
+			}
+		}
+	}
+
+	if !hasGo {
+		t.Error("Expected Go to be in prerequisite checks")
+	}
+}
+
+func TestCheckConfiguration(t *testing.T) {
+	t.Run("nonexistent config file", func(t *testing.T) {
+		results := CheckConfiguration("/nonexistent/config.yaml")
+
+		// Should have at least one result
+		if len(results) == 0 {
+			t.Error("Expected at least one result")
+		}
+
+		// First result should be about missing file
+		if results[0].Status != "fail" {
+			t.Error("Expected fail status for nonexistent config")
+		}
+	})
+
+	t.Run("invalid config file", func(t *testing.T) {
+		// Create a temp invalid config
+		tmpDir := t.TempDir()
+		configPath := filepath.Join(tmpDir, "config.yaml")
+		_ = os.WriteFile(configPath, []byte("invalid: yaml: content: ["), 0o644)
+
+		results := CheckConfiguration(configPath)
+
+		// Should detect parse failure
+		hasParseError := false
+		for _, result := range results {
+			if result.Name == "Config Parse" && result.Status == "fail" {
+				hasParseError = true
+			}
+		}
+
+		if !hasParseError {
+			t.Error("Expected parse error for invalid config")
+		}
+	})
+}
+
+func TestCheckModelStatus(t *testing.T) {
+	t.Run("nonexistent models directory", func(t *testing.T) {
+		results := CheckModelStatus("/nonexistent/models")
+
+		// Should have at least one result
+		if len(results) == 0 {
+			t.Error("Expected at least one result")
+		}
+
+		// Should fail
+		if results[0].Status != "fail" {
+			t.Error("Expected fail status for nonexistent models dir")
+		}
+	})
+
+	t.Run("empty models directory", func(t *testing.T) {
+		tmpDir := t.TempDir()
+
+		results := CheckModelStatus(tmpDir)
+
+		// Should have results
+		if len(results) < 2 {
+			t.Error("Expected at least 2 results (directory + model files)")
+		}
+
+		// Directory should pass
+		if results[0].Status != "pass" {
+			t.Error("Expected pass status for existing directory")
+		}
+
+		// Model files check should warn
+		if results[1].Status != "warn" {
+			t.Error("Expected warn status for no model files")
+		}
+	})
+}
+
+func TestCheckResources(t *testing.T) {
+	results := CheckResources()
+
+	// Should have at least disk space check
+	if len(results) == 0 {
+		t.Error("Expected at least one resource check")
+	}
+
+	hasDiskCheck := false
+	for _, result := range results {
+		if result.Name == "Disk Space" {
+			hasDiskCheck = true
+			// Disk check should have a message
+			if result.Message == "" {
+				t.Error("Expected disk space message")
+			}
+		}
+	}
+
+	if !hasDiskCheck {
+		t.Error("Expected disk space check")
+	}
+}
+
+func TestCheckConnectivity(t *testing.T) {
+	t.Run("invalid endpoint", func(t *testing.T) {
+		results := CheckConnectivity([]string{"http://invalid-endpoint-12345:9999"})
+
+		// Should have one result
+		if len(results) != 1 {
+			t.Errorf("Expected 1 result, got %d", len(results))
+		}
+
+		// Should fail
+		if results[0].Status != "fail" {
+			t.Error("Expected fail status for invalid endpoint")
+		}
+	})
+
+	t.Run("default endpoints", func(t *testing.T) {
+		results := CheckConnectivity(nil)
+
+		// Should check default endpoints
+		if len(results) == 0 {
+			t.Error("Expected at least one default endpoint check")
+		}
+	})
+}
+
+func TestIsPortAvailable(t *testing.T) {
+	// Test with a very high port that's unlikely to be in use
+	highPort := 54321
+
+	// Should be available (or we can't test accurately)
+	result := isPortAvailable(highPort)
+
+	// Just verify the function runs without error
+	_ = result
+}
+
+func TestGetStatusSymbol(t *testing.T) {
+	tests := []struct {
+		status   string
+		expected string
+	}{
+		{"pass", "✓"},
+		{"fail", "✗"},
+		{"warn", "⚠"},
+		{"unknown", "•"},
+	}
+
+	for _, tt := range tests {
+		result := getStatusSymbol(tt.status)
+		if result != tt.expected {
+			t.Errorf("getStatusSymbol(%s) = %s, expected %s", tt.status, result, tt.expected)
+		}
+	}
+}
+
+func TestGenerateRecommendations(t *testing.T) {
+	t.Run("all pass", func(t *testing.T) {
+		report := &DiagnosticReport{
+			Prerequisites: []CheckResult{
+				{Name: "Go", Status: "pass"},
+			},
+			Configuration: []CheckResult{
+				{Name: "Config File", Status: "pass"},
+			},
+			ModelStatus: []CheckResult{
+				{Name: "Models", Status: "pass"},
+			},
+			Resources: []CheckResult{
+				{Name: "Disk Space", Status: "pass"},
+			},
+		}
+
+		recommendations := GenerateRecommendations(report)
+
+		// Should have at least one recommendation (deploy)
+		if len(recommendations) == 0 {
+			t.Error("Expected at least one recommendation")
+		}
+	})
+
+	t.Run("kubectl missing", func(t *testing.T) {
+		report := &DiagnosticReport{
+			Prerequisites: []CheckResult{
+				{Name: "kubectl", Status: "fail"},
+			},
+		}
+
+		recommendations := GenerateRecommendations(report)
+
+		// Should recommend installing kubectl
+		hasKubectlRec := false
+		for _, rec := range recommendations {
+			if containsIgnoreCase(rec, "kubectl") {
+				hasKubectlRec = true
+			}
+		}
+
+		if !hasKubectlRec {
+			t.Error("Expected recommendation to install kubectl")
+		}
+	})
+
+	t.Run("models missing", func(t *testing.T) {
+		report := &DiagnosticReport{
+			ModelStatus: []CheckResult{
+				{Name: "Model Files", Status: "warn"},
+			},
+		}
+
+		recommendations := GenerateRecommendations(report)
+
+		// Should recommend downloading models
+		hasModelRec := false
+		for _, rec := range recommendations {
+			if containsIgnoreCase(rec, "model") {
+				hasModelRec = true
+			}
+		}
+
+		if !hasModelRec {
+			t.Error("Expected recommendation to download models")
+		}
+	})
+}
+
+func TestRunFullDiagnostics(t *testing.T) {
+	t.Run("with nonexistent paths", func(t *testing.T) {
+		report := RunFullDiagnostics("/nonexistent/config.yaml", "/nonexistent/models")
+
+		// Should have a report
+		if report == nil {
+			t.Error("Expected non-nil report")
+		}
+
+		// Should have timestamp
+		if report.Timestamp.IsZero() {
+			t.Error("Expected timestamp to be set")
+		}
+
+		// Should have system info
+		if report.SystemInfo.OS == "" {
+			t.Error("Expected system info to be set")
+		}
+
+		// Should have some checks
+		totalChecks := len(report.Prerequisites) + len(report.Configuration) +
+			len(report.ModelStatus) + len(report.Resources) + len(report.Connectivity)
+
+		if totalChecks == 0 {
+			t.Error("Expected at least some checks to be run")
+		}
+	})
+}
+
+// Helper function
+func containsIgnoreCase(s, substr string) bool {
+	s = toLower(s)
+	substr = toLower(substr)
+	return contains(s, substr)
+}
+
+func toLower(s string) string {
+	result := []rune{}
+	for _, r := range s {
+		if r >= 'A' && r <= 'Z' {
+			result = append(result, r+32)
+		} else {
+			result = append(result, r)
+		}
+	}
+	return string(result)
+}
+
+func contains(s, substr string) bool {
+	return findIndex(s, substr) >= 0
+}
+
+func findIndex(s, substr string) int {
+	if len(substr) == 0 {
+		return 0
+	}
+	if len(substr) > len(s) {
+		return -1
+	}
+	for i := 0; i <= len(s)-len(substr); i++ {
+		if s[i:i+len(substr)] == substr {
+			return i
+		}
+	}
+	return -1
+}
diff --git a/src/semantic-router/pkg/cli/deployment/deployment.go b/src/semantic-router/pkg/cli/deployment/deployment.go
new file mode 100644
index 000000000..2001bb715
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/deployment.go
@@ -0,0 +1,1282 @@
+package deployment
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"path/filepath"
+	"strconv"
+	"strings"
+	"syscall"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/config"
+)
+
+// getPIDFilePath returns the cross-platform PID file path
+// NOTE: Uses os.Getuid() which returns -1 on Windows for all users.
+// Focus is on Linux/macOS. For future Windows support, consider using
+// user.Current().Username or os.UserHomeDir() for true cross-platform
+// user isolation.
+func getPIDFilePath() string {
+	return filepath.Join(os.TempDir(), fmt.Sprintf("vsr-local-deployment-%d.pid", os.Getuid()))
+}
+
+// getLogFilePath returns the cross-platform log file path
+// NOTE: See getPIDFilePath for Windows limitation regarding os.Getuid().
+func getLogFilePath() string {
+	return filepath.Join(os.TempDir(), fmt.Sprintf("vsr-local-deployment-%d.log", os.Getuid()))
+}
+
+// isProcessRunning checks if a process with the given PID is still running
+func isProcessRunning(pid int) bool {
+	process, err := os.FindProcess(pid)
+	if err != nil {
+		return false
+	}
+	// Send signal 0 to check if process exists (doesn't actually signal)
+	// This is Unix-specific but works on Linux/macOS
+	err = process.Signal(syscall.Signal(0))
+	return err == nil
+}
+
+// stopProcess stops a process gracefully (SIGTERM then SIGKILL if needed)
+func stopProcess(pid int) error {
+	process, err := os.FindProcess(pid)
+	if err != nil {
+		return fmt.Errorf("process not found: %w", err)
+	}
+
+	// Send SIGTERM for graceful shutdown
+	if err := process.Signal(syscall.SIGTERM); err != nil {
+		return fmt.Errorf("failed to send SIGTERM: %w", err)
+	}
+
+	// Wait up to 10 seconds for graceful shutdown
+	for i := 0; i < 10; i++ {
+		time.Sleep(1 * time.Second)
+		if err := process.Signal(syscall.Signal(0)); err != nil {
+			// Process is gone
+			return nil
+		}
+	}
+
+	// Still running, send SIGKILL
+	if err := process.Kill(); err != nil {
+		return fmt.Errorf("failed to kill process: %w", err)
+	}
+
+	time.Sleep(1 * time.Second)
+	return nil
+}
+
+// DeploymentStatus represents the status of a deployment
+type DeploymentStatus struct {
+	Type        string
+	IsRunning   bool
+	ReleaseName string
+	Namespace   string
+	Components  []ComponentStatus
+	Endpoints   []string
+	Uptime      string
+}
+
+// ComponentStatus represents the status of a component
+type ComponentStatus struct {
+	Name    string
+	Status  string
+	Message string
+}
+
+// DeployLocal deploys the router as a local process
+// If force is true, any existing router process will be stopped before starting a new one.
+func DeployLocal(configPath string, force bool) error {
+	cli.Info("Deploying router locally...")
+
+	// Check if binary exists
+	binPath := "bin/router"
+	if _, err := os.Stat(binPath); os.IsNotExist(err) {
+		cli.Warning("Router binary not found. Building...")
+		if err := buildRouter(); err != nil {
+			return fmt.Errorf("failed to build router: %w", err)
+		}
+	}
+
+	// Get absolute config path
+	absConfigPath, err := filepath.Abs(configPath)
+	if err != nil {
+		return fmt.Errorf("failed to resolve config path: %w", err)
+	}
+
+	cli.Info(fmt.Sprintf("Starting router with config: %s", absConfigPath))
+
+	// Get cross-platform file paths
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Check if router is already running
+	if _, errShadow := os.Stat(pidFilePath); errShadow == nil {
+		// PID file exists, check if process is actually running
+		pidBytes, readErr := os.ReadFile(pidFilePath)
+		if readErr == nil {
+			if pid, atoiErr := strconv.Atoi(strings.TrimSpace(string(pidBytes))); atoiErr == nil {
+				if isProcessRunning(pid) {
+					// Process is running
+					if !force {
+						// Error by default
+						cli.Warning(fmt.Sprintf("Router is already running (PID: %d)", pid))
+						cli.Info(fmt.Sprintf("PID file: %s", pidFilePath))
+						cli.Info(fmt.Sprintf("Log file: %s", logFilePath))
+						cli.Info("")
+						cli.Info("Options:")
+						cli.Info("  1. Stop existing: vsr undeploy local")
+						cli.Info("  2. Force replace: vsr deploy local --force")
+						return fmt.Errorf("router already running (use --force to replace)")
+					}
+
+					// --force flag provided, stop existing process
+					cli.Warning(fmt.Sprintf("Stopping existing router (PID: %d)...", pid))
+					if stopErr := stopProcess(pid); stopErr != nil {
+						cli.Warning(fmt.Sprintf("Failed to stop existing process: %v", stopErr))
+						cli.Info("Continuing anyway (process may have already stopped)")
+					} else {
+						cli.Success("Existing router stopped")
+					}
+
+					// Clean up PID file
+					os.Remove(pidFilePath)
+				} else {
+					// PID file exists but process is not running (stale file)
+					cli.Warning(fmt.Sprintf("Found stale PID file (process %d not running)", pid))
+					cli.Info("Cleaning up stale PID file...")
+					os.Remove(pidFilePath)
+				}
+			}
+		}
+	}
+
+	// Open log file for output (Issue #5: restrictive permissions 0600)
+	logFile, err := os.OpenFile(logFilePath, os.O_CREATE|os.O_WRONLY|os.O_APPEND, 0o600)
+	if err != nil {
+		return fmt.Errorf("failed to create log file: %w", err)
+	}
+	defer logFile.Close()
+
+	// Start router process
+	cmd := exec.Command(binPath, "--config", absConfigPath)
+	cmd.Stdout = logFile
+	cmd.Stderr = logFile
+
+	// Set library path for candle binding (Rust FFI library)
+	// Get current working directory to construct the library path
+	cwd, err := os.Getwd()
+	if err != nil {
+		return fmt.Errorf("failed to get working directory: %w", err)
+	}
+	candleLibPath := filepath.Join(cwd, "candle-binding", "target", "release")
+
+	// Inherit current environment and add/update library path
+	// Linux uses LD_LIBRARY_PATH, macOS uses DYLD_LIBRARY_PATH
+	cmd.Env = os.Environ()
+
+	// Set LD_LIBRARY_PATH for Linux
+	existingLDPath := os.Getenv("LD_LIBRARY_PATH")
+	if existingLDPath != "" {
+		cmd.Env = append(cmd.Env, fmt.Sprintf("LD_LIBRARY_PATH=%s:%s", candleLibPath, existingLDPath))
+	} else {
+		cmd.Env = append(cmd.Env, fmt.Sprintf("LD_LIBRARY_PATH=%s", candleLibPath))
+	}
+
+	// Set DYLD_LIBRARY_PATH for macOS
+	existingDYLDPath := os.Getenv("DYLD_LIBRARY_PATH")
+	if existingDYLDPath != "" {
+		cmd.Env = append(cmd.Env, fmt.Sprintf("DYLD_LIBRARY_PATH=%s:%s", candleLibPath, existingDYLDPath))
+	} else {
+		cmd.Env = append(cmd.Env, fmt.Sprintf("DYLD_LIBRARY_PATH=%s", candleLibPath))
+	}
+
+	if err := cmd.Start(); err != nil {
+		return fmt.Errorf("failed to start router: %w", err)
+	}
+
+	// Store PID for later management (Issue #1: kill process if PID file write fails)
+	pid := cmd.Process.Pid
+	if err := os.WriteFile(pidFilePath, []byte(fmt.Sprintf("%d", pid)), 0o600); err != nil {
+		// Kill process if we can't track it
+		_ = cmd.Process.Kill()
+		return fmt.Errorf("failed to write PID file: %w", err)
+	}
+
+	cli.Success(fmt.Sprintf("Router started (PID: %d)", pid))
+	cli.Info(fmt.Sprintf("PID file: %s", pidFilePath))
+	cli.Info(fmt.Sprintf("Log file: %s", logFilePath))
+	cli.Info("To stop: vsr undeploy local")
+
+	return nil // Don't wait, run in background
+}
+
+// DeployDocker deploys using Docker Compose
+func DeployDocker(configPath string, withObservability bool) error {
+	cli.Info("Deploying router with Docker Compose...")
+
+	// Validate the configuration first
+	cfg, err := config.Parse(configPath)
+	if err != nil {
+		return fmt.Errorf("failed to parse config: %w", err)
+	}
+	if err := cli.ValidateConfig(cfg); err != nil {
+		return fmt.Errorf("configuration validation failed: %w", err)
+	}
+
+	// Check if docker-compose exists
+	if !commandExists("docker-compose") && !commandExists("docker compose") {
+		return fmt.Errorf("docker-compose not found. Please install Docker Compose")
+	}
+
+	// Download models first
+	cli.Info("Downloading models...")
+	cmd := exec.Command("make", "download-models")
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("failed to download models: %w", err)
+	}
+
+	// Determine compose file path
+	composeFile := "deploy/docker-compose/docker-compose.yml"
+	if _, err := os.Stat(composeFile); os.IsNotExist(err) {
+		return fmt.Errorf("docker-compose file not found: %s", composeFile)
+	}
+
+	// Run docker-compose up
+	var upCmd *exec.Cmd
+	if commandExists("docker-compose") {
+		upCmd = exec.Command("docker-compose", "-f", composeFile, "up", "-d")
+	} else {
+		upCmd = exec.Command("docker", "compose", "-f", composeFile, "up", "-d")
+	}
+
+	upCmd.Stdout = os.Stdout
+	upCmd.Stderr = os.Stderr
+
+	if err := upCmd.Run(); err != nil {
+		return fmt.Errorf("failed to deploy with docker-compose: %w", err)
+	}
+
+	cli.Success("Router deployed with Docker Compose")
+	cli.Info("Check status with: vsr status")
+	cli.Info("View logs with: vsr logs")
+
+	return nil
+}
+
+// DeployKubernetes deploys to Kubernetes
+func DeployKubernetes(configPath, namespace string, withObservability bool) error {
+	cli.Info("Deploying router to Kubernetes...")
+
+	// Pre-deployment checks
+	cli.Info("Running pre-deployment checks...")
+
+	// 1. Check if kubectl exists
+	if !commandExists("kubectl") {
+		cli.Error("kubectl not found")
+		cli.Info("Install kubectl: https://kubernetes.io/docs/tasks/tools/")
+		return fmt.Errorf("kubectl not found")
+	}
+
+	// 2. Check cluster connectivity
+	cli.Info("Checking cluster connectivity...")
+	clusterInfoCmd := exec.Command("kubectl", "cluster-info")
+	if err := clusterInfoCmd.Run(); err != nil {
+		cli.Error("Unable to connect to Kubernetes cluster")
+		cli.Info("Check your kubeconfig: kubectl config view")
+		cli.Info("List available contexts: kubectl config get-contexts")
+		return fmt.Errorf("no connection to Kubernetes cluster")
+	}
+	cli.Success("Cluster connection verified")
+
+	// 3. Check/create namespace
+	cli.Info(fmt.Sprintf("Checking namespace '%s'...", namespace))
+	nsCheckCmd := exec.Command("kubectl", "get", "namespace", namespace)
+	if err := nsCheckCmd.Run(); err != nil {
+		// Namespace doesn't exist, create it
+		cli.Info(fmt.Sprintf("Creating namespace '%s'...", namespace))
+		nsCreateCmd := exec.Command("kubectl", "create", "namespace", namespace)
+		nsCreateCmd.Stdout = os.Stdout
+		nsCreateCmd.Stderr = os.Stderr
+		if err := nsCreateCmd.Run(); err != nil {
+			cli.Warning(fmt.Sprintf("Failed to create namespace: %v", err))
+			cli.Info("You may need to create it manually: kubectl create namespace " + namespace)
+		} else {
+			cli.Success("Namespace created")
+		}
+	} else {
+		cli.Success("Namespace exists")
+	}
+
+	// 4. Check permissions
+	cli.Info("Checking permissions...")
+	permCheckCmd := exec.Command("kubectl", "auth", "can-i", "create", "pods", "-n", namespace)
+	if err := permCheckCmd.Run(); err != nil {
+		cli.Warning("You may not have sufficient permissions")
+		cli.Info("Check RBAC: kubectl auth can-i create pods -n " + namespace)
+		cli.Info("You may need cluster-admin privileges for deployment")
+	} else {
+		cli.Success("Permissions verified")
+	}
+
+	// Apply manifests
+	cli.Info("Applying Kubernetes manifests...")
+	manifestDir := "deploy/kubernetes"
+	if _, err := os.Stat(manifestDir); os.IsNotExist(err) {
+		return fmt.Errorf("kubernetes manifests not found: %s", manifestDir)
+	}
+
+	cmd := exec.Command("kubectl", "apply", "-f", manifestDir, "-n", namespace)
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("failed to apply kubernetes manifests: %w", err)
+	}
+
+	cli.Success("Manifests applied successfully")
+
+	// Post-deployment validation
+	cli.Info("Waiting for pods to be ready...")
+	timeout := 300 // 5 minutes
+	ready := false
+
+	for i := 0; i < timeout; i += 5 {
+		time.Sleep(5 * time.Second)
+
+		// Check pod status
+		podsCmd := exec.Command("kubectl", "get", "pods", "-n", namespace, "-l", "app=semantic-router", "--no-headers")
+		output, err := podsCmd.Output()
+		if err != nil {
+			cli.Info(fmt.Sprintf("Waiting for pods... (%ds/%ds)", i+5, timeout))
+			continue
+		}
+
+		if len(output) == 0 {
+			cli.Info(fmt.Sprintf("Waiting for pods to be created... (%ds/%ds)", i+5, timeout))
+			continue
+		}
+
+		// Count ready pods
+		lines := splitLines(string(output))
+		totalPods := len(lines)
+		readyPods := 0
+
+		for _, line := range lines {
+			if len(line) > 0 {
+				// Simple check: if line contains "Running" and "1/1" or "2/2", etc.
+				// This is a basic heuristic
+				if containsString(line, "Running") {
+					readyPods++
+				}
+			}
+		}
+
+		if readyPods > 0 && readyPods == totalPods {
+			ready = true
+			cli.Success(fmt.Sprintf("All %d pod(s) are ready", readyPods))
+			break
+		}
+
+		if i%10 == 0 {
+			cli.Info(fmt.Sprintf("Waiting for pods... (%d/%d ready, %ds/%ds)", readyPods, totalPods, i+5, timeout))
+		}
+	}
+
+	if !ready {
+		cli.Warning("Timeout waiting for pods to be ready")
+		cli.Info("Check pod status: kubectl get pods -n " + namespace)
+		cli.Info("Check pod logs: kubectl logs -n " + namespace + " -l app=semantic-router")
+		return fmt.Errorf("pods did not become ready within timeout")
+	}
+
+	// Check deployment rollout status
+	cli.Info("Checking deployment rollout status...")
+	rolloutCmd := exec.Command("kubectl", "rollout", "status", "deployment/semantic-router", "-n", namespace, "--timeout=60s")
+	rolloutCmd.Stdout = os.Stdout
+	rolloutCmd.Stderr = os.Stderr
+	if err := rolloutCmd.Run(); err != nil {
+		cli.Warning("Deployment rollout check failed (pods may still be starting)")
+	}
+
+	// Verify service endpoints
+	cli.Info("Verifying service endpoints...")
+	svcCmd := exec.Command("kubectl", "get", "svc", "-n", namespace, "-l", "app=semantic-router")
+	svcCmd.Stdout = os.Stdout
+	svcCmd.Stderr = os.Stderr
+	if err := svcCmd.Run(); err != nil {
+		cli.Warning("Could not verify service endpoints")
+	}
+
+	cli.Success(fmt.Sprintf("Router deployed successfully to Kubernetes namespace: %s", namespace))
+	cli.Info("\nNext steps:")
+	cli.Info("  Check status: kubectl get pods -n " + namespace)
+	cli.Info("  View logs: kubectl logs -n " + namespace + " -l app=semantic-router")
+	cli.Info("  Port forward: kubectl port-forward -n " + namespace + " svc/semantic-router 8080:8080")
+
+	return nil
+}
+
+// UndeployLocal stops the local router process
+func UndeployLocal() error {
+	cli.Info("Stopping local router...")
+
+	// Get cross-platform file paths
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Check if PID file exists
+	if _, err := os.Stat(pidFilePath); os.IsNotExist(err) {
+		cli.Warning("No PID file found. Router may not be running.")
+		cli.Info("Use: ps aux | grep router")
+		return nil
+	}
+
+	// Read PID from file
+	pidBytes, err := os.ReadFile(pidFilePath)
+	if err != nil {
+		return fmt.Errorf("failed to read PID file: %w", err)
+	}
+
+	pid, err := strconv.Atoi(string(pidBytes))
+	if err != nil {
+		return fmt.Errorf("invalid PID in file: %w", err)
+	}
+
+	// Find the process
+	process, err := os.FindProcess(pid)
+	if err != nil {
+		cli.Warning(fmt.Sprintf("Process %d not found (may have already stopped)", pid))
+		// Clean up PID file anyway
+		os.Remove(pidFilePath)
+		return nil
+	}
+
+	// Send SIGTERM for graceful shutdown
+	cli.Info(fmt.Sprintf("Sending SIGTERM to process %d...", pid))
+	if err := process.Signal(syscall.SIGTERM); err != nil {
+		// Process might already be dead
+		cli.Warning(fmt.Sprintf("Failed to send SIGTERM: %v", err))
+	}
+
+	// Wait for up to 10 seconds for graceful shutdown
+	stopped := false
+	for i := 0; i < 10; i++ {
+		time.Sleep(1 * time.Second)
+		// Try to signal with 0 to check if process exists
+		if err := process.Signal(syscall.Signal(0)); err != nil {
+			// Process is gone
+			stopped = true
+			break
+		}
+		cli.Info(fmt.Sprintf("Waiting for graceful shutdown... (%d/10s)", i+1))
+	}
+
+	// If still running, send SIGKILL
+	if !stopped {
+		cli.Warning("Process did not stop gracefully, sending SIGKILL...")
+		if err := process.Kill(); err != nil {
+			cli.Warning(fmt.Sprintf("Failed to kill process: %v", err))
+		}
+		time.Sleep(1 * time.Second)
+	}
+
+	// Clean up PID file
+	if err := os.Remove(pidFilePath); err != nil {
+		cli.Warning(fmt.Sprintf("Failed to remove PID file: %v", err))
+	}
+
+	// Optionally clean up log file (keep it for now for debugging)
+	// os.Remove(logFilePath)
+
+	cli.Success("Router stopped successfully")
+	cli.Info(fmt.Sprintf("Log file available at: %s", logFilePath))
+	return nil
+}
+
+// UndeployDocker removes Docker Compose deployment
+func UndeployDocker(removeVolumes bool) error {
+	cli.Info("Removing Docker Compose deployment...")
+
+	composeFile := "deploy/docker-compose/docker-compose.yml"
+
+	// Check if docker-compose file exists
+	if _, err := os.Stat(composeFile); os.IsNotExist(err) {
+		return fmt.Errorf("docker-compose file not found: %s", composeFile)
+	}
+
+	// Get list of containers before stopping
+	cli.Info("Identifying running containers...")
+	containersBefore, _ := getDockerContainers("semantic-router")
+
+	// Build docker-compose down command
+	var args []string
+	if commandExists("docker-compose") {
+		args = []string{"-f", composeFile, "down"}
+	} else {
+		args = []string{"compose", "-f", composeFile, "down"}
+	}
+
+	// Add --volumes flag if requested
+	if removeVolumes {
+		args = append(args, "--volumes")
+		cli.Info("Will remove volumes...")
+	}
+
+	// Execute docker-compose down
+	var cmd *exec.Cmd
+	if commandExists("docker-compose") {
+		cmd = exec.Command("docker-compose", args...)
+	} else {
+		cmd = exec.Command("docker", args...)
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("failed to undeploy: %w", err)
+	}
+
+	// Wait for containers to fully stop (max 30 seconds)
+	cli.Info("Waiting for containers to stop...")
+	stopped := false
+	for i := 0; i < 30; i++ {
+		time.Sleep(1 * time.Second)
+		containers, _ := getDockerContainers("semantic-router")
+		if len(containers) == 0 {
+			stopped = true
+			break
+		}
+		if i%5 == 4 { // Show progress every 5 seconds
+			cli.Info(fmt.Sprintf("Still stopping... (%d/30s, %d containers remaining)", i+1, len(containers)))
+		}
+	}
+
+	if !stopped {
+		cli.Warning("Some containers may still be stopping")
+	}
+
+	// Verify cleanup
+	containersAfter, _ := getDockerContainers("semantic-router")
+	if len(containersAfter) > 0 {
+		cli.Warning(fmt.Sprintf("Warning: %d container(s) still running", len(containersAfter)))
+		for _, container := range containersAfter {
+			cli.Warning(fmt.Sprintf("  - %s", container))
+		}
+	} else {
+		cli.Success("All containers stopped successfully")
+	}
+
+	// Show cleanup summary
+	if len(containersBefore) > 0 {
+		cli.Info(fmt.Sprintf("Cleaned up %d container(s):", len(containersBefore)))
+		for _, container := range containersBefore {
+			cli.Info(fmt.Sprintf("  ✓ %s", container))
+		}
+	}
+
+	if removeVolumes {
+		cli.Success("Router undeployed (volumes removed)")
+	} else {
+		cli.Success("Router undeployed (volumes preserved)")
+		cli.Info("To remove volumes, use: vsr undeploy docker --volumes")
+	}
+
+	return nil
+}
+
+// UndeployKubernetes removes Kubernetes deployment
+func UndeployKubernetes(namespace string, wait bool) error {
+	cli.Info("Removing Kubernetes deployment...")
+
+	// Check if kubectl exists
+	if !commandExists("kubectl") {
+		return fmt.Errorf("kubectl not found. Please install kubectl")
+	}
+
+	// Check if namespace exists
+	checkCmd := exec.Command("kubectl", "get", "namespace", namespace)
+	if err := checkCmd.Run(); err != nil {
+		cli.Warning(fmt.Sprintf("Namespace '%s' not found or not accessible", namespace))
+		return nil
+	}
+
+	manifestDir := "deploy/kubernetes"
+	if _, err := os.Stat(manifestDir); os.IsNotExist(err) {
+		cli.Warning(fmt.Sprintf("Manifest directory not found: %s", manifestDir))
+		cli.Info("Attempting to delete by label...")
+		// Try deleting by common labels
+		labelCmd := exec.Command("kubectl", "delete", "all", "-l", "app=semantic-router", "-n", namespace)
+		labelCmd.Stdout = os.Stdout
+		labelCmd.Stderr = os.Stderr
+		if err := labelCmd.Run(); err != nil {
+			return fmt.Errorf("failed to delete resources: %w", err)
+		}
+	} else {
+		// Delete using manifest files
+		cmd := exec.Command("kubectl", "delete", "-f", manifestDir, "-n", namespace)
+		cmd.Stdout = os.Stdout
+		cmd.Stderr = os.Stderr
+
+		if err := cmd.Run(); err != nil {
+			cli.Warning(fmt.Sprintf("Some resources may not have been deleted: %v", err))
+			// Don't return error, continue to wait/verify
+		}
+	}
+
+	// Wait for pods to terminate if requested
+	if wait {
+		cli.Info("Waiting for pods to terminate...")
+		timeout := 5 * 60 // 5 minutes in seconds
+		stopped := false
+
+		for i := 0; i < timeout; i += 2 {
+			time.Sleep(2 * time.Second)
+
+			// Check for pods
+			checkCmd := exec.Command("kubectl", "get", "pods", "-n", namespace, "-l", "app=semantic-router", "--no-headers")
+			output, err := checkCmd.Output()
+
+			if err != nil || len(output) == 0 {
+				// No pods found or error (likely no resources)
+				stopped = true
+				break
+			}
+
+			// Count remaining pods
+			podCount := len(splitLines(string(output)))
+			if podCount == 0 {
+				stopped = true
+				break
+			}
+
+			// Show progress every 10 seconds
+			if i%10 == 0 {
+				cli.Info(fmt.Sprintf("Waiting for pods to terminate... (%ds/%ds, %d pods remaining)", i, timeout, podCount))
+			}
+		}
+
+		if !stopped {
+			cli.Warning("Timeout waiting for all pods to terminate")
+			cli.Info("Some resources may still be terminating in the background")
+		} else {
+			cli.Success("All pods terminated successfully")
+		}
+	}
+
+	// Verify cleanup
+	verifyCmd := exec.Command("kubectl", "get", "all", "-n", namespace, "-l", "app=semantic-router", "--no-headers")
+	output, err := verifyCmd.Output()
+	if err == nil && len(output) > 0 {
+		remainingResources := len(splitLines(string(output)))
+		if remainingResources > 0 {
+			cli.Warning(fmt.Sprintf("Warning: %d resource(s) may still exist", remainingResources))
+			cli.Info("Check with: kubectl get all -n " + namespace + " -l app=semantic-router")
+		}
+	}
+
+	cli.Success(fmt.Sprintf("Router undeployed from Kubernetes namespace: %s", namespace))
+	if !wait {
+		cli.Info("Resources may still be terminating in the background")
+		cli.Info("Use --wait flag to wait for complete cleanup")
+	}
+
+	return nil
+}
+
+// CheckStatus checks the status of all deployments
+func CheckStatus(namespace string) error {
+	cli.Info("Checking router status...")
+
+	foundAny := false
+
+	// Check local deployment
+	localStatus := DetectLocalDeployment()
+	if localStatus.IsRunning {
+		foundAny = true
+		displayDeploymentStatus(localStatus)
+	}
+
+	// Check Docker deployment
+	dockerStatus := DetectDockerDeployment()
+	if dockerStatus.IsRunning {
+		foundAny = true
+		displayDeploymentStatus(dockerStatus)
+	}
+
+	// Check Kubernetes deployment
+	k8sStatus := DetectKubernetesDeployment(namespace)
+	if k8sStatus.IsRunning {
+		foundAny = true
+		displayDeploymentStatus(k8sStatus)
+	}
+
+	// Check Helm deployment
+	helmStatus := DetectHelmDeployment(namespace)
+	if helmStatus.IsRunning {
+		foundAny = true
+		displayDeploymentStatus(helmStatus)
+	}
+
+	if !foundAny {
+		cli.Warning("No router deployments found")
+		cli.Info("Deploy the router with: vsr deploy [local|docker|kubernetes|helm]")
+	}
+
+	return nil
+}
+
+// DetectLocalDeployment checks for local deployment
+func DetectLocalDeployment() *DeploymentStatus {
+	status := &DeploymentStatus{
+		Type:      "local",
+		IsRunning: false,
+	}
+
+	// Get cross-platform file paths
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Check if PID file exists
+	if _, err := os.Stat(pidFilePath); err == nil {
+		pidBytes, err := os.ReadFile(pidFilePath)
+		if err == nil {
+			pid, err := strconv.Atoi(string(pidBytes))
+			if err == nil {
+				// Check if process is running
+				process, err := os.FindProcess(pid)
+				if err == nil {
+					// Try to signal the process
+					if err := process.Signal(syscall.Signal(0)); err == nil {
+						status.IsRunning = true
+						status.Components = []ComponentStatus{
+							{
+								Name:    "router",
+								Status:  "running",
+								Message: fmt.Sprintf("PID: %d", pid),
+							},
+						}
+						status.Endpoints = []string{
+							"Check logs: " + logFilePath,
+						}
+					}
+				}
+			}
+		}
+	}
+
+	return status
+}
+
+// DetectDockerDeployment checks for Docker Compose deployment
+func DetectDockerDeployment() *DeploymentStatus {
+	status := &DeploymentStatus{
+		Type:      "docker-compose",
+		IsRunning: false,
+	}
+
+	if !isDockerRunning() {
+		return status
+	}
+
+	// Get Docker containers
+	containers, err := getDockerContainers("semantic-router")
+	if err != nil || len(containers) == 0 {
+		return status
+	}
+
+	status.IsRunning = true
+	status.Components = []ComponentStatus{}
+	status.Endpoints = []string{
+		"Router API: http://localhost:8080",
+		"Envoy Proxy: http://localhost:8801",
+		"Dashboard: http://localhost:8700",
+		"Grafana: http://localhost:3000",
+	}
+
+	// Get detailed status for each container
+	for _, container := range containers {
+		inspectCmd := exec.Command("docker", "inspect", "--format", "{{.State.Status}}", container)
+		output, err := inspectCmd.Output()
+		containerStatus := "unknown"
+		if err == nil {
+			containerStatus = strings.TrimSpace(string(output))
+		}
+
+		status.Components = append(status.Components, ComponentStatus{
+			Name:    container,
+			Status:  containerStatus,
+			Message: "",
+		})
+	}
+
+	return status
+}
+
+// DetectKubernetesDeployment checks for Kubernetes deployment
+func DetectKubernetesDeployment(namespace string) *DeploymentStatus {
+	status := &DeploymentStatus{
+		Type:      "kubernetes",
+		IsRunning: false,
+		Namespace: namespace,
+	}
+
+	if !commandExists("kubectl") {
+		return status
+	}
+
+	// Check for pods
+	cmd := exec.Command("kubectl", "get", "pods", "-n", namespace, "-l", "app=semantic-router", "--no-headers")
+	output, err := cmd.Output()
+	if err != nil || len(output) == 0 {
+		return status
+	}
+
+	lines := splitLines(string(output))
+	if len(lines) == 0 {
+		return status
+	}
+
+	status.IsRunning = true
+	status.Components = []ComponentStatus{}
+
+	for _, line := range lines {
+		if line == "" {
+			continue
+		}
+		fields := strings.Fields(line)
+		if len(fields) >= 3 {
+			podName := fields[0]
+			podStatus := fields[2]
+			status.Components = append(status.Components, ComponentStatus{
+				Name:    podName,
+				Status:  podStatus,
+				Message: "",
+			})
+		}
+	}
+
+	// Get service info
+	svcCmd := exec.Command("kubectl", "get", "svc", "-n", namespace, "-l", "app=semantic-router", "--no-headers")
+	svcOutput, err := svcCmd.Output()
+	if err == nil && len(svcOutput) > 0 {
+		status.Endpoints = []string{
+			fmt.Sprintf("Check services: kubectl get svc -n %s", namespace),
+		}
+	}
+
+	return status
+}
+
+// displayDeploymentStatus displays the status of a deployment
+func displayDeploymentStatus(status *DeploymentStatus) {
+	cli.Info("\n╔═══════════════════════════════════════╗")
+	cli.Info(fmt.Sprintf("║ Deployment: %-26s║", status.Type))
+	cli.Info("╚═══════════════════════════════════════╝")
+
+	if status.Namespace != "" {
+		cli.Info(fmt.Sprintf("Namespace: %s", status.Namespace))
+	}
+
+	if status.ReleaseName != "" {
+		cli.Info(fmt.Sprintf("Release: %s", status.ReleaseName))
+	}
+
+	// Show components
+	if len(status.Components) > 0 {
+		cli.Info("\nComponents:")
+		for _, comp := range status.Components {
+			statusSymbol := "✓"
+			if comp.Status != "running" && comp.Status != "Running" {
+				statusSymbol = "⚠"
+			}
+			msg := comp.Message
+			if msg != "" {
+				cli.Info(fmt.Sprintf("  %s %-30s %-15s %s", statusSymbol, comp.Name, comp.Status, msg))
+			} else {
+				cli.Info(fmt.Sprintf("  %s %-30s %s", statusSymbol, comp.Name, comp.Status))
+			}
+		}
+	}
+
+	// Show endpoints
+	if len(status.Endpoints) > 0 {
+		cli.Info("\nEndpoints:")
+		for _, endpoint := range status.Endpoints {
+			cli.Info(fmt.Sprintf("  %s", endpoint))
+		}
+	}
+
+	fmt.Println() // Extra newline for spacing
+}
+
+// FetchLogs fetches logs from the router with auto-detection
+func FetchLogs(follow bool, tail int, namespace, deployType, component string, since string, grep string) error {
+	cli.Info("Fetching router logs...")
+
+	// Auto-detect deployment type if not specified
+	if deployType == "" {
+		deployType = detectDeploymentType(namespace)
+		if deployType == "" {
+			cli.Warning("Could not detect router deployment")
+			cli.Info("Specify deployment type with: vsr logs --env [local|docker|kubernetes|helm]")
+			return fmt.Errorf("no router deployment found")
+		}
+		cli.Info(fmt.Sprintf("Detected deployment type: %s", deployType))
+	}
+
+	// Fetch logs based on deployment type
+	switch deployType {
+	case "local":
+		return fetchLocalLogs(follow, tail, since, grep)
+	case "docker":
+		return fetchDockerLogsEnhanced(follow, tail, component, since, grep)
+	case "kubernetes":
+		return fetchKubernetesLogs(follow, tail, namespace, component, since, grep)
+	case "helm":
+		return fetchHelmLogs(follow, tail, namespace, component, since, grep)
+	default:
+		return fmt.Errorf("unsupported deployment type: %s", deployType)
+	}
+}
+
+// detectDeploymentType detects the deployment type
+func detectDeploymentType(namespace string) string {
+	// Check in order of specificity
+	if DetectHelmDeployment(namespace).IsRunning {
+		return "helm"
+	}
+	if DetectKubernetesDeployment(namespace).IsRunning {
+		return "kubernetes"
+	}
+	if DetectDockerDeployment().IsRunning {
+		return "docker"
+	}
+	if DetectLocalDeployment().IsRunning {
+		return "local"
+	}
+	return ""
+}
+
+// fetchLocalLogs fetches logs from local deployment
+func fetchLocalLogs(follow bool, tail int, since string, grep string) error {
+	logFilePath := getLogFilePath()
+
+	if _, err := os.Stat(logFilePath); os.IsNotExist(err) {
+		return fmt.Errorf("log file not found: %s", logFilePath)
+	}
+
+	if follow {
+		// Use tail -f for following logs
+		args := []string{"-f"}
+		if tail > 0 {
+			args = append(args, "-n", fmt.Sprintf("%d", tail))
+		}
+		args = append(args, logFilePath)
+
+		cmd := exec.Command("tail", args...)
+
+		if grep != "" {
+			// Pipe through grep if pattern specified
+			grepCmd := exec.Command("grep", "--color=always", grep)
+			grepCmd.Stdin, _ = cmd.StdoutPipe()
+			grepCmd.Stdout = os.Stdout
+			grepCmd.Stderr = os.Stderr
+
+			if err := cmd.Start(); err != nil {
+				return fmt.Errorf("failed to start tail: %w", err)
+			}
+			if err := grepCmd.Run(); err != nil {
+				_ = cmd.Process.Kill()
+				return fmt.Errorf("grep failed: %w", err)
+			}
+			return cmd.Wait()
+		}
+
+		cmd.Stdout = os.Stdout
+		cmd.Stderr = os.Stderr
+		return cmd.Run()
+	}
+
+	// Non-following mode - just cat with tail
+	args := []string{}
+	if tail > 0 {
+		args = append(args, "-n", fmt.Sprintf("%d", tail))
+	}
+	args = append(args, logFilePath)
+
+	cmd := exec.Command("tail", args...)
+
+	if grep != "" {
+		grepCmd := exec.Command("grep", "--color=always", grep)
+		grepCmd.Stdin, _ = cmd.StdoutPipe()
+		grepCmd.Stdout = os.Stdout
+		grepCmd.Stderr = os.Stderr
+
+		if err := cmd.Start(); err != nil {
+			return fmt.Errorf("failed to start tail: %w", err)
+		}
+		if err := grepCmd.Run(); err != nil {
+			_ = cmd.Process.Kill()
+			return fmt.Errorf("grep failed: %w", err)
+		}
+		return cmd.Wait()
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+	return cmd.Run()
+}
+
+// fetchDockerLogsEnhanced fetches logs from Docker Compose deployment
+func fetchDockerLogsEnhanced(follow bool, tail int, component string, since string, grep string) error {
+	if !isDockerRunning() {
+		return fmt.Errorf("docker is not running")
+	}
+
+	// Get list of containers
+	containers, err := getDockerContainers("semantic-router")
+	if err != nil || len(containers) == 0 {
+		return fmt.Errorf("no semantic-router containers found")
+	}
+
+	// Filter by component if specified
+	targetContainers := containers
+	if component != "" && component != "all" {
+		targetContainers = []string{}
+		for _, container := range containers {
+			if containsString(container, component) {
+				targetContainers = append(targetContainers, container)
+			}
+		}
+		if len(targetContainers) == 0 {
+			return fmt.Errorf("no containers found matching component: %s", component)
+		}
+	}
+
+	// Build docker logs command
+	for _, container := range targetContainers {
+		cli.Info(fmt.Sprintf("=== Logs from: %s ===", container))
+
+		args := []string{"logs"}
+		if follow {
+			args = append(args, "-f")
+		}
+		if tail > 0 {
+			args = append(args, "--tail", fmt.Sprintf("%d", tail))
+		}
+		if since != "" {
+			args = append(args, "--since", since)
+		}
+		args = append(args, container)
+
+		cmd := exec.Command("docker", args...)
+
+		if grep != "" {
+			grepCmd := exec.Command("grep", "--color=always", grep)
+			grepCmd.Stdin, _ = cmd.StdoutPipe()
+			grepCmd.Stdout = os.Stdout
+			grepCmd.Stderr = os.Stderr
+
+			if err := cmd.Start(); err != nil {
+				return fmt.Errorf("failed to start docker logs: %w", err)
+			}
+			if err := grepCmd.Run(); err != nil {
+				_ = cmd.Process.Kill()
+				return fmt.Errorf("grep failed: %w", err)
+			}
+			if err := cmd.Wait(); err != nil {
+				return err
+			}
+		} else {
+			cmd.Stdout = os.Stdout
+			cmd.Stderr = os.Stderr
+			if err := cmd.Run(); err != nil {
+				return fmt.Errorf("failed to fetch logs from %s: %w", container, err)
+			}
+		}
+
+		fmt.Println() // Add spacing between containers
+	}
+
+	return nil
+}
+
+// fetchKubernetesLogs fetches logs from Kubernetes deployment
+func fetchKubernetesLogs(follow bool, tail int, namespace string, component string, since string, grep string) error {
+	if !commandExists("kubectl") {
+		return fmt.Errorf("kubectl not found")
+	}
+
+	// Build label selector
+	labelSelector := "app=semantic-router"
+	if component != "" && component != "all" {
+		labelSelector = fmt.Sprintf("app=semantic-router,component=%s", component)
+	}
+
+	// Build kubectl logs command
+	args := []string{"logs", "-n", namespace, "-l", labelSelector}
+	if follow {
+		args = append(args, "-f")
+	}
+	if tail > 0 {
+		args = append(args, "--tail", fmt.Sprintf("%d", tail))
+	}
+	if since != "" {
+		args = append(args, "--since", since)
+	}
+	args = append(args, "--all-containers=true", "--prefix=true")
+
+	cmd := exec.Command("kubectl", args...)
+
+	if grep != "" {
+		grepCmd := exec.Command("grep", "--color=always", grep)
+		grepCmd.Stdin, _ = cmd.StdoutPipe()
+		grepCmd.Stdout = os.Stdout
+		grepCmd.Stderr = os.Stderr
+
+		if err := cmd.Start(); err != nil {
+			return fmt.Errorf("failed to start kubectl logs: %w", err)
+		}
+		if err := grepCmd.Run(); err != nil {
+			_ = cmd.Process.Kill()
+			return fmt.Errorf("grep failed: %w", err)
+		}
+		return cmd.Wait()
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+	return cmd.Run()
+}
+
+// fetchHelmLogs fetches logs from Helm deployment
+func fetchHelmLogs(follow bool, tail int, namespace string, component string, since string, grep string) error {
+	// Helm deployments use Kubernetes, so we can reuse the K8s log fetching
+	// but with different label selector
+	if !commandExists("kubectl") {
+		return fmt.Errorf("kubectl not found")
+	}
+
+	// Get release name
+	helmStatus := DetectHelmDeployment(namespace)
+	if !helmStatus.IsRunning {
+		return fmt.Errorf("no helm deployment found in namespace: %s", namespace)
+	}
+
+	// Build label selector for Helm
+	labelSelector := fmt.Sprintf("app.kubernetes.io/instance=%s", helmStatus.ReleaseName)
+	if component != "" && component != "all" {
+		labelSelector = fmt.Sprintf("%s,app.kubernetes.io/component=%s", labelSelector, component)
+	}
+
+	// Build kubectl logs command
+	args := []string{"logs", "-n", namespace, "-l", labelSelector}
+	if follow {
+		args = append(args, "-f")
+	}
+	if tail > 0 {
+		args = append(args, "--tail", fmt.Sprintf("%d", tail))
+	}
+	if since != "" {
+		args = append(args, "--since", since)
+	}
+	args = append(args, "--all-containers=true", "--prefix=true")
+
+	cmd := exec.Command("kubectl", args...)
+
+	if grep != "" {
+		grepCmd := exec.Command("grep", "--color=always", grep)
+		grepCmd.Stdin, _ = cmd.StdoutPipe()
+		grepCmd.Stdout = os.Stdout
+		grepCmd.Stderr = os.Stderr
+
+		if err := cmd.Start(); err != nil {
+			return fmt.Errorf("failed to start kubectl logs: %w", err)
+		}
+		if err := grepCmd.Run(); err != nil {
+			_ = cmd.Process.Kill()
+			return fmt.Errorf("grep failed: %w", err)
+		}
+		return cmd.Wait()
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+	return cmd.Run()
+}
+
+// Helper functions
+
+func buildRouter() error {
+	cmd := exec.Command("make", "build")
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+	return cmd.Run()
+}
+
+func commandExists(cmd string) bool {
+	_, err := exec.LookPath(cmd)
+	return err == nil
+}
+
+func isDockerRunning() bool {
+	cmd := exec.Command("docker", "ps")
+	return cmd.Run() == nil
+}
+
+func getDockerContainers(nameFilter string) ([]string, error) {
+	//nolint:gosec // G204: nameFilter is from internal use, not user input
+	cmd := exec.Command("docker", "ps", "--filter", fmt.Sprintf("name=%s", nameFilter), "--format", "{{.Names}}")
+	output, err := cmd.Output()
+	if err != nil {
+		return nil, err
+	}
+
+	containers := []string{}
+	if len(output) > 0 {
+		lines := string(output)
+		for _, line := range splitLines(lines) {
+			if line != "" {
+				containers = append(containers, line)
+			}
+		}
+	}
+	return containers, nil
+}
+
+func splitLines(s string) []string {
+	var lines []string
+	start := 0
+	for i, c := range s {
+		if c == '\n' {
+			lines = append(lines, s[start:i])
+			start = i + 1
+		}
+	}
+	if start < len(s) {
+		lines = append(lines, s[start:])
+	}
+	return lines
+}
+
+func containsString(s, substr string) bool {
+	return len(s) >= len(substr) && (s == substr || len(s) > len(substr) && findSubstring(s, substr))
+}
+
+func findSubstring(s, substr string) bool {
+	for i := 0; i <= len(s)-len(substr); i++ {
+		if s[i:i+len(substr)] == substr {
+			return true
+		}
+	}
+	return false
+}
diff --git a/src/semantic-router/pkg/cli/deployment/deployment_test.go b/src/semantic-router/pkg/cli/deployment/deployment_test.go
new file mode 100644
index 000000000..bf28c3587
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/deployment_test.go
@@ -0,0 +1,426 @@
+package deployment
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+)
+
+func TestCommandExists(t *testing.T) {
+	tests := []struct {
+		name     string
+		command  string
+		expected bool
+	}{
+		{
+			name:     "existing command - ls",
+			command:  "ls",
+			expected: true,
+		},
+		{
+			name:     "existing command - echo",
+			command:  "echo",
+			expected: true,
+		},
+		{
+			name:     "non-existing command",
+			command:  "nonexistentcommand12345",
+			expected: false,
+		},
+		{
+			name:     "kubectl may or may not exist",
+			command:  "kubectl",
+			expected: commandExists("kubectl"), // whatever the actual state is
+		},
+		{
+			name:     "docker may or may not exist",
+			command:  "docker",
+			expected: commandExists("docker"),
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := commandExists(tt.command)
+			if result != tt.expected {
+				t.Errorf("commandExists(%q) = %v, expected %v", tt.command, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestSplitLines(t *testing.T) {
+	tests := []struct {
+		name     string
+		input    string
+		expected []string
+	}{
+		{
+			name:     "empty string",
+			input:    "",
+			expected: []string{},
+		},
+		{
+			name:     "single line",
+			input:    "hello",
+			expected: []string{"hello"},
+		},
+		{
+			name:     "two lines",
+			input:    "hello\nworld",
+			expected: []string{"hello", "world"},
+		},
+		{
+			name:     "three lines",
+			input:    "line1\nline2\nline3",
+			expected: []string{"line1", "line2", "line3"},
+		},
+		{
+			name:     "lines with trailing newline",
+			input:    "line1\nline2\n",
+			expected: []string{"line1", "line2"},
+		},
+		{
+			name:     "lines with empty lines",
+			input:    "line1\n\nline3",
+			expected: []string{"line1", "", "line3"},
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := splitLines(tt.input)
+			if len(result) != len(tt.expected) {
+				t.Errorf("splitLines(%q) returned %d lines, expected %d", tt.input, len(result), len(tt.expected))
+				return
+			}
+			for i, line := range result {
+				if line != tt.expected[i] {
+					t.Errorf("splitLines(%q)[%d] = %q, expected %q", tt.input, i, line, tt.expected[i])
+				}
+			}
+		})
+	}
+}
+
+func TestContainsString(t *testing.T) {
+	tests := []struct {
+		name     string
+		s        string
+		substr   string
+		expected bool
+	}{
+		{
+			name:     "substring found",
+			s:        "hello world",
+			substr:   "world",
+			expected: true,
+		},
+		{
+			name:     "substring not found",
+			s:        "hello world",
+			substr:   "foo",
+			expected: false,
+		},
+		{
+			name:     "substring at beginning",
+			s:        "hello world",
+			substr:   "hello",
+			expected: true,
+		},
+		{
+			name:     "substring at end",
+			s:        "hello world",
+			substr:   "world",
+			expected: true,
+		},
+		{
+			name:     "exact match",
+			s:        "hello",
+			substr:   "hello",
+			expected: true,
+		},
+		{
+			name:     "empty substring",
+			s:        "hello",
+			substr:   "",
+			expected: true,
+		},
+		{
+			name:     "substring longer than string",
+			s:        "hi",
+			substr:   "hello",
+			expected: false,
+		},
+		{
+			name:     "case sensitive",
+			s:        "Hello World",
+			substr:   "hello",
+			expected: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := containsString(tt.s, tt.substr)
+			if result != tt.expected {
+				t.Errorf("containsString(%q, %q) = %v, expected %v", tt.s, tt.substr, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestFindSubstring(t *testing.T) {
+	tests := []struct {
+		name     string
+		s        string
+		substr   string
+		expected bool
+	}{
+		{
+			name:     "substring found",
+			s:        "hello world",
+			substr:   "world",
+			expected: true,
+		},
+		{
+			name:     "substring not found",
+			s:        "hello world",
+			substr:   "foo",
+			expected: false,
+		},
+		{
+			name:     "substring at beginning",
+			s:        "hello world",
+			substr:   "hello",
+			expected: true,
+		},
+		{
+			name:     "multiple occurrences",
+			s:        "hello hello",
+			substr:   "hello",
+			expected: true,
+		},
+		{
+			name:     "overlapping patterns",
+			s:        "aaaa",
+			substr:   "aa",
+			expected: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := findSubstring(tt.s, tt.substr)
+			if result != tt.expected {
+				t.Errorf("findSubstring(%q, %q) = %v, expected %v", tt.s, tt.substr, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestGetDockerContainers(t *testing.T) {
+	// Skip if docker is not available
+	if !commandExists("docker") {
+		t.Skip("Docker not available, skipping test")
+	}
+
+	tests := []struct {
+		name        string
+		nameFilter  string
+		expectError bool
+	}{
+		{
+			name:        "filter by semantic-router",
+			nameFilter:  "semantic-router",
+			expectError: false,
+		},
+		{
+			name:        "filter by nonexistent name",
+			nameFilter:  "nonexistentcontainer12345",
+			expectError: false,
+		},
+		{
+			name:        "empty filter",
+			nameFilter:  "",
+			expectError: false,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			containers, err := getDockerContainers(tt.nameFilter)
+
+			if tt.expectError {
+				if err == nil {
+					t.Error("Expected error but got none")
+				}
+			} else {
+				if err != nil {
+					t.Errorf("Unexpected error: %v", err)
+				}
+				// containers should be a slice (may be empty)
+				if containers == nil {
+					t.Error("Expected non-nil slice")
+				}
+			}
+		})
+	}
+}
+
+func TestIsDockerRunning(t *testing.T) {
+	// This test checks if the function works, not necessarily if Docker is running
+	result := isDockerRunning()
+
+	// Result should be boolean (no error to check)
+	// Just verify the function returns without panicking
+	t.Logf("isDockerRunning() returned: %v", result)
+
+	// If docker command exists, the result should match commandExists
+	if commandExists("docker") {
+		// Docker command exists, so isDockerRunning should at least try to run
+		// The result depends on whether Docker daemon is actually running
+		t.Logf("Docker command exists, isDockerRunning returned: %v", result)
+	} else if result {
+		// If docker command doesn't exist, isDockerRunning should return false
+		t.Error("isDockerRunning() should return false when docker command doesn't exist")
+	}
+}
+
+func TestPIDFileOperations(t *testing.T) {
+	// Test PID file path functions
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	if pidFilePath == "" {
+		t.Error("pidFilePath should not be empty")
+	}
+
+	if logFilePath == "" {
+		t.Error("logFilePath should not be empty")
+	}
+
+	// Verify paths are absolute
+	if !filepath.IsAbs(pidFilePath) {
+		t.Errorf("pidFilePath should be absolute, got: %s", pidFilePath)
+	}
+
+	if !filepath.IsAbs(logFilePath) {
+		t.Errorf("logFilePath should be absolute, got: %s", logFilePath)
+	}
+}
+
+func TestDeployLocalPIDFileCreation(t *testing.T) {
+	// This is an integration test that would require actually running DeployLocal
+	// For now, we just verify the functions return valid paths
+	t.Run("verify PID file path", func(t *testing.T) {
+		pidFilePath := getPIDFilePath()
+		if pidFilePath == "" {
+			t.Error("getPIDFilePath() returned empty string")
+		}
+		if !filepath.IsAbs(pidFilePath) {
+			t.Errorf("getPIDFilePath() should return absolute path, got: %s", pidFilePath)
+		}
+	})
+
+	t.Run("verify log file path", func(t *testing.T) {
+		logFilePath := getLogFilePath()
+		if logFilePath == "" {
+			t.Error("getLogFilePath() returned empty string")
+		}
+		if !filepath.IsAbs(logFilePath) {
+			t.Errorf("getLogFilePath() should return absolute path, got: %s", logFilePath)
+		}
+	})
+}
+
+func TestUndeployLocalWithNoPIDFile(t *testing.T) {
+	// Ensure PID file doesn't exist
+	pidFilePath := getPIDFilePath()
+	os.Remove(pidFilePath)
+
+	// Call UndeployLocal - it should handle missing PID file gracefully
+	err := UndeployLocal()
+	// Should not return error for missing PID file
+	if err != nil {
+		t.Errorf("UndeployLocal should handle missing PID file gracefully, got error: %v", err)
+	}
+}
+
+func TestBuildRouter(t *testing.T) {
+	// Skip if make is not available
+	if !commandExists("make") {
+		t.Skip("make not available, skipping test")
+	}
+
+	// This is a smoke test - we don't actually want to build in unit tests
+	// Just verify the function exists and can be called
+	t.Run("buildRouter function accessible", func(t *testing.T) {
+		// We can't easily test buildRouter without side effects
+		// The function exists and is called by DeployLocal
+		// This test just documents that it's available
+		t.Log("buildRouter function is accessible via DeployLocal")
+	})
+}
+
+// Mock tests for deployment functions (without actual execution)
+
+func TestDeployDockerValidation(t *testing.T) {
+	t.Run("missing docker-compose file", func(t *testing.T) {
+		// Create a temporary config file
+		tmpDir := t.TempDir()
+		configPath := filepath.Join(tmpDir, "config.yaml")
+
+		// Write minimal config
+		configContent := `
+bert_model:
+  model_id: "test-model"
+  threshold: 0.8
+vllm_endpoints:
+  - name: "test"
+    address: "127.0.0.1"
+    port: 8000
+model_config:
+  test-model:
+    pricing:
+      prompt: 0.01
+      completion: 0.02
+default_model: "test-model"
+`
+		if err := os.WriteFile(configPath, []byte(configContent), 0o644); err != nil {
+			t.Fatal(err)
+		}
+
+		// DeployDocker should fail if docker-compose file doesn't exist
+		// (This would need to be in a different directory without the actual docker-compose.yml)
+		// For now, just verify the function signature
+		t.Skip("Skipping actual deployment test")
+	})
+}
+
+func TestUndeployDockerVolumeFlag(t *testing.T) {
+	t.Run("removeVolumes parameter", func(t *testing.T) {
+		// Test that the function accepts the removeVolumes parameter
+		// We can't test actual execution without Docker running
+		// Just verify the signature works
+
+		// Skip actual execution
+		t.Skip("Skipping actual undeploy test")
+
+		// This would fail without Docker, but shows parameter usage:
+		// err := UndeployDocker(false)
+		// err := UndeployDocker(true)
+	})
+}
+
+func TestUndeployKubernetesWaitFlag(t *testing.T) {
+	t.Run("wait parameter", func(t *testing.T) {
+		// Test that the function accepts the wait parameter
+		// Skip actual execution
+		t.Skip("Skipping actual undeploy test")
+
+		// This would fail without kubectl, but shows parameter usage:
+		// err := UndeployKubernetes("default", false)
+		// err := UndeployKubernetes("default", true)
+	})
+}
diff --git a/src/semantic-router/pkg/cli/deployment/helm.go b/src/semantic-router/pkg/cli/deployment/helm.go
new file mode 100644
index 000000000..a8b234b8e
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/helm.go
@@ -0,0 +1,365 @@
+package deployment
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"path/filepath"
+	"strings"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+)
+
+const (
+	defaultHelmRelease = "semantic-router"
+	defaultHelmChart   = "deploy/helm/semantic-router"
+)
+
+// DeployHelm deploys using Helm chart
+func DeployHelm(configPath, namespace string, releaseName string, withObs bool, setValues []string) error {
+	cli.Info("Deploying router with Helm...")
+
+	// Pre-deployment checks
+	cli.Info("Running pre-deployment checks...")
+
+	// 1. Check if helm exists
+	if !commandExists("helm") {
+		cli.Error("helm not found")
+		cli.Info("Install Helm: https://helm.sh/docs/intro/install/")
+		return fmt.Errorf("helm not found")
+	}
+
+	// 2. Check if kubectl exists (Helm needs it)
+	if !commandExists("kubectl") {
+		cli.Error("kubectl not found")
+		cli.Info("Install kubectl: https://kubernetes.io/docs/tasks/tools/")
+		return fmt.Errorf("kubectl not found")
+	}
+
+	// 3. Check cluster connectivity
+	cli.Info("Checking cluster connectivity...")
+	clusterInfoCmd := exec.Command("kubectl", "cluster-info")
+	if err := clusterInfoCmd.Run(); err != nil {
+		cli.Error("Unable to connect to Kubernetes cluster")
+		cli.Info("Check your kubeconfig: kubectl config view")
+		return fmt.Errorf("no connection to Kubernetes cluster")
+	}
+	cli.Success("Cluster connection verified")
+
+	// 4. Check/create namespace
+	cli.Info(fmt.Sprintf("Checking namespace '%s'...", namespace))
+	nsCheckCmd := exec.Command("kubectl", "get", "namespace", namespace)
+	if err := nsCheckCmd.Run(); err != nil {
+		cli.Info(fmt.Sprintf("Creating namespace '%s'...", namespace))
+		nsCreateCmd := exec.Command("kubectl", "create", "namespace", namespace)
+		if err := nsCreateCmd.Run(); err != nil {
+			cli.Warning(fmt.Sprintf("Failed to create namespace: %v", err))
+		} else {
+			cli.Success("Namespace created")
+		}
+	} else {
+		cli.Success("Namespace exists")
+	}
+
+	// 5. Verify chart exists
+	chartPath := defaultHelmChart
+	if !filepath.IsAbs(chartPath) {
+		absChart, err := filepath.Abs(chartPath)
+		if err == nil {
+			chartPath = absChart
+		}
+	}
+
+	if _, err := os.Stat(chartPath); os.IsNotExist(err) {
+		return fmt.Errorf("helm chart not found: %s", chartPath)
+	}
+
+	// Set release name
+	if releaseName == "" {
+		releaseName = defaultHelmRelease
+	}
+
+	// Check if release already exists
+	checkCmd := exec.Command("helm", "list", "-n", namespace, "-q")
+	output, _ := checkCmd.Output()
+	releases := strings.Split(strings.TrimSpace(string(output)), "\n")
+	releaseExists := false
+	for _, r := range releases {
+		if r == releaseName {
+			releaseExists = true
+			break
+		}
+	}
+
+	// Build helm command
+	var cmd *exec.Cmd
+	var action string
+
+	if releaseExists {
+		cli.Info(fmt.Sprintf("Release '%s' already exists, upgrading...", releaseName))
+		action = "upgrade"
+		cmd = exec.Command("helm", "upgrade", releaseName, chartPath, "-n", namespace, "--wait")
+	} else {
+		cli.Info("Installing Helm release...")
+		action = "install"
+		cmd = exec.Command("helm", "install", releaseName, chartPath, "-n", namespace, "--wait", "--create-namespace")
+	}
+
+	// Add config file override if provided
+	if configPath != "" {
+		absConfigPath, err := filepath.Abs(configPath)
+		if err == nil {
+			// Check if config file exists
+			if _, err := os.Stat(absConfigPath); err == nil {
+				// Note: The Helm chart would need to support config file override
+				// For now, we'll note that config should be embedded in values
+				cli.Info(fmt.Sprintf("Note: Using chart default config (custom config at %s)", absConfigPath))
+			}
+		}
+	}
+
+	// Add custom --set values
+	for _, setValue := range setValues {
+		cmd.Args = append(cmd.Args, "--set", setValue)
+	}
+
+	// Set observability
+	if !withObs {
+		cmd.Args = append(cmd.Args, "--set", "config.observability.tracing.enabled=false")
+	}
+
+	// Set timeout
+	cmd.Args = append(cmd.Args, "--timeout", "10m")
+
+	cli.Info(fmt.Sprintf("Running: %s", strings.Join(cmd.Args, " ")))
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("helm %s failed: %w", action, err)
+	}
+
+	cli.Success(fmt.Sprintf("Helm release '%s' %sd successfully", releaseName, action))
+
+	// Get service information
+	cli.Info("Fetching service information...")
+	svcCmd := exec.Command("kubectl", "get", "svc", "-n", namespace, "-l", "app.kubernetes.io/name=semantic-router")
+	svcCmd.Stdout = os.Stdout
+	svcCmd.Stderr = os.Stderr
+	_ = svcCmd.Run()
+
+	cli.Info("\nNext steps:")
+	cli.Info(fmt.Sprintf("  Check status: helm status %s -n %s", releaseName, namespace))
+	cli.Info(fmt.Sprintf("  Check pods: kubectl get pods -n %s -l app.kubernetes.io/name=semantic-router", namespace))
+	cli.Info(fmt.Sprintf("  View logs: kubectl logs -n %s -l app.kubernetes.io/name=semantic-router", namespace))
+	cli.Info(fmt.Sprintf("  Port forward: kubectl port-forward -n %s svc/%s 8080:8080", namespace, releaseName))
+
+	return nil
+}
+
+// UndeployHelm removes Helm release
+func UndeployHelm(namespace, releaseName string, wait bool) error {
+	cli.Info("Removing Helm release...")
+
+	// Check if helm exists
+	if !commandExists("helm") {
+		return fmt.Errorf("helm not found")
+	}
+
+	// Set release name
+	if releaseName == "" {
+		releaseName = defaultHelmRelease
+	}
+
+	// Check if release exists
+	checkCmd := exec.Command("helm", "list", "-n", namespace, "-q")
+	output, err := checkCmd.Output()
+	if err != nil {
+		return fmt.Errorf("failed to list releases: %w", err)
+	}
+
+	releases := strings.Split(strings.TrimSpace(string(output)), "\n")
+	releaseExists := false
+	for _, r := range releases {
+		if r == releaseName {
+			releaseExists = true
+			break
+		}
+	}
+
+	if !releaseExists {
+		cli.Warning(fmt.Sprintf("Release '%s' not found in namespace '%s'", releaseName, namespace))
+		return nil
+	}
+
+	// Uninstall release
+	cli.Info(fmt.Sprintf("Uninstalling release '%s'...", releaseName))
+	cmd := exec.Command("helm", "uninstall", releaseName, "-n", namespace)
+
+	if wait {
+		cmd.Args = append(cmd.Args, "--wait")
+		cli.Info("Waiting for resources to be deleted...")
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("helm uninstall failed: %w", err)
+	}
+
+	// Wait for pods to terminate if requested
+	if wait {
+		cli.Info("Verifying cleanup...")
+		timeout := 300 // 5 minutes
+		cleaned := false
+
+		for i := 0; i < timeout; i += 5 {
+			time.Sleep(5 * time.Second)
+
+			// Check for pods
+			//nolint:gosec // G204: releaseName and namespace are from internal config
+			checkCmd := exec.Command("kubectl", "get", "pods", "-n", namespace, "-l", "app.kubernetes.io/instance="+releaseName, "--no-headers")
+			output, err := checkCmd.Output()
+
+			if err != nil || len(output) == 0 {
+				cleaned = true
+				break
+			}
+
+			podCount := len(splitLines(string(output)))
+			if podCount == 0 {
+				cleaned = true
+				break
+			}
+
+			if i%10 == 0 {
+				cli.Info(fmt.Sprintf("Waiting for cleanup... (%ds/%ds, %d pods remaining)", i+5, timeout, podCount))
+			}
+		}
+
+		if !cleaned {
+			cli.Warning("Some resources may still be terminating")
+		} else {
+			cli.Success("All resources cleaned up")
+		}
+	}
+
+	cli.Success(fmt.Sprintf("Helm release '%s' uninstalled", releaseName))
+	return nil
+}
+
+// UpgradeHelmRelease upgrades an existing Helm release
+func UpgradeHelmRelease(configPath, namespace, releaseName string, timeout int) error {
+	cli.Info("Upgrading Helm release...")
+
+	// Check if helm exists
+	if !commandExists("helm") {
+		return fmt.Errorf("helm not found. Please install Helm: https://helm.sh/docs/intro/install/")
+	}
+
+	// Set release name
+	if releaseName == "" {
+		releaseName = defaultHelmRelease
+	}
+
+	// Check if release exists
+	checkCmd := exec.Command("helm", "list", "-n", namespace, "-q")
+	output, err := checkCmd.Output()
+	if err != nil {
+		return fmt.Errorf("failed to list releases: %w", err)
+	}
+
+	releases := strings.Split(strings.TrimSpace(string(output)), "\n")
+	releaseExists := false
+	for _, r := range releases {
+		if r == releaseName {
+			releaseExists = true
+			break
+		}
+	}
+
+	if !releaseExists {
+		cli.Warning(fmt.Sprintf("Release '%s' not found in namespace '%s'", releaseName, namespace))
+		cli.Info("Use 'vsr deploy helm' to create a new deployment")
+		return nil
+	}
+
+	// Verify chart exists
+	chartPath := defaultHelmChart
+	if !filepath.IsAbs(chartPath) {
+		absChart, err := filepath.Abs(chartPath)
+		if err == nil {
+			chartPath = absChart
+		}
+	}
+
+	if _, err := os.Stat(chartPath); os.IsNotExist(err) {
+		return fmt.Errorf("helm chart not found: %s", chartPath)
+	}
+
+	// Build upgrade command
+	cli.Info(fmt.Sprintf("Upgrading release '%s'...", releaseName))
+	cmd := exec.Command("helm", "upgrade", releaseName, chartPath, "-n", namespace, "--wait")
+
+	// Set timeout
+	if timeout > 0 {
+		cmd.Args = append(cmd.Args, "--timeout", fmt.Sprintf("%ds", timeout))
+	} else {
+		cmd.Args = append(cmd.Args, "--timeout", "5m")
+	}
+
+	cmd.Stdout = os.Stdout
+	cmd.Stderr = os.Stderr
+
+	if err := cmd.Run(); err != nil {
+		return fmt.Errorf("helm upgrade failed: %w", err)
+	}
+
+	cli.Success(fmt.Sprintf("Helm release '%s' upgraded successfully", releaseName))
+
+	// Check rollout status
+	cli.Info("Checking deployment status...")
+	//nolint:gosec // G204: releaseName and namespace are from internal config
+	rolloutCmd := exec.Command("kubectl", "rollout", "status", "deployment/"+releaseName, "-n", namespace, "--timeout=60s")
+	rolloutCmd.Stdout = os.Stdout
+	rolloutCmd.Stderr = os.Stderr
+	if err := rolloutCmd.Run(); err != nil {
+		cli.Warning("Deployment rollout status check failed")
+	}
+
+	cli.Info(fmt.Sprintf("Check status: helm status %s -n %s", releaseName, namespace))
+	return nil
+}
+
+// DetectHelmDeployment checks if a Helm deployment exists
+func DetectHelmDeployment(namespace string) *DeploymentStatus {
+	status := &DeploymentStatus{
+		Type:      "helm",
+		IsRunning: false,
+	}
+
+	if !commandExists("helm") {
+		return status
+	}
+
+	// List releases in namespace
+	cmd := exec.Command("helm", "list", "-n", namespace, "-q")
+	output, err := cmd.Output()
+	if err != nil || len(output) == 0 {
+		return status
+	}
+
+	releases := strings.Split(strings.TrimSpace(string(output)), "\n")
+	for _, release := range releases {
+		if release == defaultHelmRelease || strings.Contains(release, "semantic-router") {
+			status.IsRunning = true
+			status.ReleaseName = release
+			break
+		}
+	}
+
+	return status
+}
diff --git a/src/semantic-router/pkg/cli/deployment/paths_test.go b/src/semantic-router/pkg/cli/deployment/paths_test.go
new file mode 100644
index 000000000..3a7ca6bb5
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/paths_test.go
@@ -0,0 +1,79 @@
+package deployment
+
+import (
+	"fmt"
+	"os"
+	"path/filepath"
+	"testing"
+)
+
+// TestCrossPlatformPaths verifies cross-platform path handling using os.TempDir
+func TestCrossPlatformPaths(t *testing.T) {
+	t.Run("getPIDFilePath returns absolute path", func(t *testing.T) {
+		pidFilePath := getPIDFilePath()
+		if pidFilePath == "" {
+			t.Error("getPIDFilePath() returned empty string")
+		}
+		if !filepath.IsAbs(pidFilePath) {
+			t.Errorf("getPIDFilePath() should return absolute path, got: %s", pidFilePath)
+		}
+	})
+
+	t.Run("getLogFilePath returns absolute path", func(t *testing.T) {
+		logFilePath := getLogFilePath()
+		if logFilePath == "" {
+			t.Error("getLogFilePath() returned empty string")
+		}
+		if !filepath.IsAbs(logFilePath) {
+			t.Errorf("getLogFilePath() should return absolute path, got: %s", logFilePath)
+		}
+	})
+
+	t.Run("paths are user-specific", func(t *testing.T) {
+		pidFilePath := getPIDFilePath()
+		expectedSubstring := fmt.Sprintf("-%d.", os.Getuid())
+		// Use Go's strings package to check for substring
+		found := false
+		for i := 0; i <= len(pidFilePath)-len(expectedSubstring); i++ {
+			if i+len(expectedSubstring) <= len(pidFilePath) && pidFilePath[i:i+len(expectedSubstring)] == expectedSubstring {
+				found = true
+				break
+			}
+		}
+		if !found {
+			t.Errorf("pidFilePath should contain UID (%d), got: %s", os.Getuid(), pidFilePath)
+		}
+	})
+
+	t.Run("paths use temp directory", func(t *testing.T) {
+		pidFilePath := getPIDFilePath()
+		tempDir := os.TempDir()
+		// Check if path starts with temp directory
+		if len(pidFilePath) < len(tempDir) {
+			t.Errorf("pidFilePath too short, got: %s", pidFilePath)
+			return
+		}
+		if pidFilePath[:len(tempDir)] != tempDir {
+			t.Errorf("pidFilePath should start with temp directory (%s), got: %s", tempDir, pidFilePath)
+		}
+	})
+}
+
+// TestPathFunctions verifies path helper functions work correctly
+func TestPathFunctions(t *testing.T) {
+	t.Run("paths are consistent", func(t *testing.T) {
+		pid1 := getPIDFilePath()
+		pid2 := getPIDFilePath()
+		if pid1 != pid2 {
+			t.Errorf("getPIDFilePath() should return consistent results, got %s and %s", pid1, pid2)
+		}
+	})
+
+	t.Run("PID and log paths are different", func(t *testing.T) {
+		pidPath := getPIDFilePath()
+		logPath := getLogFilePath()
+		if pidPath == logPath {
+			t.Error("PID and log file paths should be different")
+		}
+	})
+}
diff --git a/src/semantic-router/pkg/cli/deployment/pid_management_test.go b/src/semantic-router/pkg/cli/deployment/pid_management_test.go
new file mode 100644
index 000000000..6e512ad9a
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/pid_management_test.go
@@ -0,0 +1,324 @@
+package deployment
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"testing"
+	"time"
+)
+
+// TestPIDFilePermissions verifies restrictive file permissions (0600) for security
+func TestPIDFilePermissions(t *testing.T) {
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Clean up any existing files
+	os.Remove(pidFilePath)
+	os.Remove(logFilePath)
+	defer os.Remove(pidFilePath)
+	defer os.Remove(logFilePath)
+
+	// Create log file with correct permissions (simulating DeployLocal)
+	logFile, err := os.OpenFile(logFilePath, os.O_CREATE|os.O_WRONLY|os.O_APPEND, 0o600)
+	if err != nil {
+		t.Fatalf("Failed to create log file: %v", err)
+	}
+	defer logFile.Close()
+
+	// Start a dummy process (simulating router)
+	cmd := exec.Command("sleep", "1")
+	cmd.Stdout = logFile
+	cmd.Stderr = logFile
+
+	if err := cmd.Start(); err != nil {
+		t.Fatalf("Failed to start process: %v", err)
+	}
+	defer func() {
+		if cmd.Process != nil {
+			_ = cmd.Process.Kill()
+		}
+	}()
+
+	pid := cmd.Process.Pid
+
+	// Write PID file with correct permissions
+	if err := os.WriteFile(pidFilePath, []byte(fmt.Sprintf("%d", pid)), 0o600); err != nil {
+		t.Fatalf("Failed to write PID file: %v", err)
+	}
+
+	t.Run("PID file has 0600 permissions", func(t *testing.T) {
+		info, err := os.Stat(pidFilePath)
+		if err != nil {
+			t.Fatalf("Failed to stat PID file: %v", err)
+		}
+		if info.Mode().Perm() != 0o600 {
+			t.Errorf("PID file permissions = %o, expected 0600", info.Mode().Perm())
+		}
+	})
+
+	t.Run("log file has 0600 permissions", func(t *testing.T) {
+		info, err := os.Stat(logFilePath)
+		if err != nil {
+			t.Fatalf("Failed to stat log file: %v", err)
+		}
+		if info.Mode().Perm() != 0o600 {
+			t.Errorf("Log file permissions = %o, expected 0600", info.Mode().Perm())
+		}
+	})
+
+	t.Run("PID file can be read", func(t *testing.T) {
+		pidBytes, err := os.ReadFile(pidFilePath)
+		if err != nil {
+			t.Fatalf("Failed to read PID file: %v", err)
+		}
+		expected := fmt.Sprintf("%d", pid)
+		if string(pidBytes) != expected {
+			t.Errorf("PID file content = %s, expected %s", string(pidBytes), expected)
+		}
+	})
+}
+
+// TestPIDFileRaceCondition verifies process cleanup when PID file write fails
+func TestPIDFileRaceCondition(t *testing.T) {
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Clean up
+	os.Remove(pidFilePath)
+	os.Remove(logFilePath)
+	defer os.Remove(pidFilePath)
+	defer os.Remove(logFilePath)
+
+	t.Run("process starts successfully with PID file", func(t *testing.T) {
+		logFile, err := os.OpenFile(logFilePath, os.O_CREATE|os.O_WRONLY|os.O_APPEND, 0o600)
+		if err != nil {
+			t.Fatalf("Failed to create log file: %v", err)
+		}
+		defer logFile.Close()
+
+		cmd := exec.Command("sleep", "1")
+		cmd.Stdout = logFile
+		cmd.Stderr = logFile
+
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start process: %v", err)
+		}
+		defer func() {
+			_ = cmd.Process.Kill()
+		}()
+
+		pid := cmd.Process.Pid
+
+		// Write PID file and cleanup on failure
+		if err := os.WriteFile(pidFilePath, []byte(fmt.Sprintf("%d", pid)), 0o600); err != nil {
+			// Kill the process if PID write fails to prevent orphaned processes
+			_ = cmd.Process.Kill()
+			t.Fatalf("Failed to write PID file: %v", err)
+		}
+
+		// Verify PID file exists
+		if _, err := os.Stat(pidFilePath); os.IsNotExist(err) {
+			t.Error("PID file should exist after successful write")
+		}
+	})
+
+	t.Run("simulate PID write failure scenario", func(t *testing.T) {
+		// Verify that process is killed if we cannot track it via PID file
+		// Prevents orphaned processes that cannot be managed
+
+		logFile, err := os.OpenFile(logFilePath, os.O_CREATE|os.O_WRONLY|os.O_APPEND, 0o600)
+		if err != nil {
+			t.Fatalf("Failed to create log file: %v", err)
+		}
+		defer logFile.Close()
+
+		cmd := exec.Command("sleep", "10")
+		cmd.Stdout = logFile
+		cmd.Stderr = logFile
+
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start process: %v", err)
+		}
+
+		pid := cmd.Process.Pid
+
+		// Simulate trying to write PID to invalid location
+		invalidPath := "/invalid/path/pid.file"
+		writeErr := os.WriteFile(invalidPath, []byte(fmt.Sprintf("%d", pid)), 0o600)
+
+		if writeErr != nil {
+			// Kill process if we can't track it via PID file
+			_ = cmd.Process.Kill()
+
+			// Verify process is killed
+			time.Sleep(100 * time.Millisecond)
+			if err := cmd.Process.Signal(os.Signal(nil)); err == nil {
+				t.Error("Process should be killed if PID file write fails")
+			}
+		}
+	})
+}
+
+// TestPIDFileCleanup verifies proper cleanup
+func TestPIDFileCleanup(t *testing.T) {
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Clean up
+	os.Remove(pidFilePath)
+	os.Remove(logFilePath)
+
+	t.Run("cleanup removes PID and log files", func(t *testing.T) {
+		// Create files
+		if err := os.WriteFile(pidFilePath, []byte("12345"), 0o600); err != nil {
+			t.Fatalf("Failed to create PID file: %v", err)
+		}
+		if err := os.WriteFile(logFilePath, []byte("test logs"), 0o600); err != nil {
+			t.Fatalf("Failed to create log file: %v", err)
+		}
+
+		// Verify they exist
+		if _, err := os.Stat(pidFilePath); os.IsNotExist(err) {
+			t.Error("PID file should exist before cleanup")
+		}
+		if _, err := os.Stat(logFilePath); os.IsNotExist(err) {
+			t.Error("Log file should exist before cleanup")
+		}
+
+		// Clean up
+		os.Remove(pidFilePath)
+		os.Remove(logFilePath)
+
+		// Verify they're gone
+		if _, err := os.Stat(pidFilePath); !os.IsNotExist(err) {
+			t.Error("PID file should not exist after cleanup")
+		}
+		if _, err := os.Stat(logFilePath); !os.IsNotExist(err) {
+			t.Error("Log file should not exist after cleanup")
+		}
+	})
+}
+
+// TestIsProcessRunning tests the process detection helper function
+func TestIsProcessRunning(t *testing.T) {
+	t.Run("current process is detected as running", func(t *testing.T) {
+		currentPID := os.Getpid()
+		if !isProcessRunning(currentPID) {
+			t.Error("Current process should be detected as running")
+		}
+	})
+
+	t.Run("non-existent process is not running", func(t *testing.T) {
+		// Use a very high PID that is unlikely to exist
+		fakePID := 99999999
+		if isProcessRunning(fakePID) {
+			t.Error("Non-existent process should not be detected as running")
+		}
+	})
+
+	t.Run("PID 0 is not running", func(t *testing.T) {
+		if isProcessRunning(0) {
+			t.Error("PID 0 should not be detected as running")
+		}
+	})
+
+	t.Run("negative PID is not running", func(t *testing.T) {
+		if isProcessRunning(-1) {
+			t.Error("Negative PID should not be detected as running")
+		}
+	})
+}
+
+// TestStopProcess tests the graceful process shutdown helper function
+func TestStopProcess(t *testing.T) {
+	t.Run("stops a running process gracefully", func(t *testing.T) {
+		// Start a long-running process that handles signals
+		// Using 'cat' with no input which will exit on signal
+		cmd := exec.Command("sleep", "60")
+		if err := cmd.Start(); err != nil {
+			t.Fatalf("Failed to start test process: %v", err)
+		}
+
+		pid := cmd.Process.Pid
+
+		// Verify it's running
+		if !isProcessRunning(pid) {
+			t.Fatal("Test process should be running before stop")
+		}
+
+		// Stop the process
+		err := stopProcess(pid)
+		if err != nil {
+			t.Errorf("stopProcess returned error: %v", err)
+		}
+
+		// Wait for process to fully terminate and be reaped
+		// Use cmd.Wait() to properly reap the zombie process
+		_ = cmd.Wait()
+
+		// Give the OS a moment to update process state
+		time.Sleep(500 * time.Millisecond)
+
+		// Verify it's stopped - the process should no longer exist
+		if isProcessRunning(pid) {
+			t.Error("Process should be stopped after stopProcess call")
+			// Clean up in case test fails
+			_ = cmd.Process.Kill()
+		}
+	})
+
+	t.Run("returns error for non-existent process", func(t *testing.T) {
+		fakePID := 99999999
+		err := stopProcess(fakePID)
+		// On Unix, FindProcess doesn't error but Signal does
+		// So we expect an error when trying to signal a non-existent process
+		if err == nil {
+			t.Error("stopProcess should return error for non-existent process")
+		}
+	})
+}
+
+// TestDeployLocalAlreadyRunning tests that deploy local prevents orphaned processes
+func TestDeployLocalAlreadyRunning(t *testing.T) {
+	pidFilePath := getPIDFilePath()
+	logFilePath := getLogFilePath()
+
+	// Clean up
+	os.Remove(pidFilePath)
+	os.Remove(logFilePath)
+	defer os.Remove(pidFilePath)
+	defer os.Remove(logFilePath)
+
+	t.Run("detects stale PID file with dead process", func(t *testing.T) {
+		// Create a PID file with a non-existent process
+		stalePID := 99999999
+		err := os.WriteFile(pidFilePath, []byte(fmt.Sprintf("%d", stalePID)), 0o600)
+		if err != nil {
+			t.Fatalf("Failed to create stale PID file: %v", err)
+		}
+
+		// Verify the process is not running
+		if isProcessRunning(stalePID) {
+			t.Skip("Skipping: fake PID unexpectedly exists")
+		}
+
+		// This simulates the cleanup logic in DeployLocal
+		// The stale PID file should be detected and cleaned up
+	})
+
+	t.Run("detects running process from PID file", func(t *testing.T) {
+		// Use our own PID as the "running" process
+		currentPID := os.Getpid()
+		err := os.WriteFile(pidFilePath, []byte(fmt.Sprintf("%d", currentPID)), 0o600)
+		if err != nil {
+			t.Fatalf("Failed to create PID file: %v", err)
+		}
+		defer os.Remove(pidFilePath)
+
+		// Verify the process is detected as running
+		if !isProcessRunning(currentPID) {
+			t.Error("Current process should be detected as running")
+		}
+	})
+}
diff --git a/src/semantic-router/pkg/cli/deployment/upgrade.go b/src/semantic-router/pkg/cli/deployment/upgrade.go
new file mode 100644
index 000000000..f87b66d10
--- /dev/null
+++ b/src/semantic-router/pkg/cli/deployment/upgrade.go
@@ -0,0 +1,267 @@
+package deployment
+
+import (
+	"fmt"
+	"os"
+	"os/exec"
+	"strconv"
+	"syscall"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+)
+
+// UpgradeLocal upgrades the local router deployment
+func UpgradeLocal(configPath string) error {
+	cli.Info("Upgrading local router...")
+
+	// Get cross-platform file path
+	pidFilePath := getPIDFilePath()
+
+	// Check if PID file exists (router is running)
+	if _, err := os.Stat(pidFilePath); os.IsNotExist(err) {
+		cli.Warning("No running local router found")
+		cli.Info("Use 'vsr deploy local' to start a new deployment")
+		return nil
+	}
+
+	// Read current PID
+	pidBytes, err := os.ReadFile(pidFilePath)
+	if err != nil {
+		return fmt.Errorf("failed to read PID file: %w", err)
+	}
+
+	pid, err := strconv.Atoi(string(pidBytes))
+	if err != nil {
+		return fmt.Errorf("invalid PID in file: %w", err)
+	}
+
+	cli.Info(fmt.Sprintf("Found running router (PID: %d)", pid))
+
+	// Rebuild the router binary
+	cli.Info("Rebuilding router binary...")
+	if buildErr := buildRouter(); buildErr != nil {
+		return fmt.Errorf("failed to rebuild router: %w", buildErr)
+	}
+	cli.Success("Binary rebuilt successfully")
+
+	// Find the process
+	process, err := os.FindProcess(pid)
+	if err != nil {
+		cli.Warning(fmt.Sprintf("Process %d not found", pid))
+		// Try to deploy fresh
+		return DeployLocal(configPath, false)
+	}
+
+	// Send SIGTERM for graceful shutdown
+	cli.Info("Stopping current router...")
+	if err := process.Signal(syscall.SIGTERM); err != nil {
+		cli.Warning(fmt.Sprintf("Failed to send SIGTERM: %v", err))
+	}
+
+	// Wait for process to stop (max 10 seconds)
+	stopped := false
+	for i := 0; i < 10; i++ {
+		time.Sleep(1 * time.Second)
+		if err := process.Signal(syscall.Signal(0)); err != nil {
+			stopped = true
+			break
+		}
+	}
+
+	// Force kill if needed
+	if !stopped {
+		cli.Warning("Forcing process termination...")
+		_ = process.Kill()
+		time.Sleep(1 * time.Second)
+	}
+
+	// Clean up old PID file
+	os.Remove(pidFilePath)
+
+	cli.Success("Old router stopped")
+
+	// Start new version
+	cli.Info("Starting upgraded router...")
+	if err := DeployLocal(configPath, false); err != nil {
+		return fmt.Errorf("failed to start upgraded router: %w", err)
+	}
+
+	cli.Success("Local router upgraded successfully")
+	return nil
+}
+
+// UpgradeDocker upgrades the Docker Compose deployment
+func UpgradeDocker(configPath string, withObservability bool) error {
+	cli.Info("Upgrading Docker deployment...")
+
+	// Check if docker-compose is running
+	if !isDockerRunning() {
+		cli.Warning("No running Docker deployment found")
+		cli.Info("Use 'vsr deploy docker' to start a new deployment")
+		return nil
+	}
+
+	composeFile := "deploy/docker-compose/docker-compose.yml"
+	if _, err := os.Stat(composeFile); os.IsNotExist(err) {
+		return fmt.Errorf("docker-compose file not found: %s", composeFile)
+	}
+
+	// Pull latest images
+	cli.Info("Pulling latest Docker images...")
+	var pullCmd *exec.Cmd
+	if commandExists("docker-compose") {
+		pullCmd = exec.Command("docker-compose", "-f", composeFile, "pull")
+	} else {
+		pullCmd = exec.Command("docker", "compose", "-f", composeFile, "pull")
+	}
+
+	pullCmd.Stdout = os.Stdout
+	pullCmd.Stderr = os.Stderr
+
+	if err := pullCmd.Run(); err != nil {
+		return fmt.Errorf("failed to pull latest images: %w", err)
+	}
+	cli.Success("Images pulled successfully")
+
+	// Recreate containers with new images
+	cli.Info("Recreating containers...")
+	var upCmd *exec.Cmd
+	if commandExists("docker-compose") {
+		upCmd = exec.Command("docker-compose", "-f", composeFile, "up", "-d", "--force-recreate", "--no-deps")
+	} else {
+		upCmd = exec.Command("docker", "compose", "-f", composeFile, "up", "-d", "--force-recreate", "--no-deps")
+	}
+
+	upCmd.Stdout = os.Stdout
+	upCmd.Stderr = os.Stderr
+
+	if err := upCmd.Run(); err != nil {
+		return fmt.Errorf("failed to recreate containers: %w", err)
+	}
+
+	// Wait for containers to be healthy
+	cli.Info("Waiting for containers to be ready...")
+	time.Sleep(5 * time.Second) // Give containers time to start
+
+	// Check container health
+	healthy := false
+	for i := 0; i < 30; i++ {
+		containers, _ := getDockerContainers("semantic-router")
+		if len(containers) > 0 {
+			// Simple health check - containers are running
+			healthy = true
+			break
+		}
+		time.Sleep(2 * time.Second)
+		if i%5 == 0 {
+			cli.Info(fmt.Sprintf("Waiting for containers... (%ds/60s)", i*2))
+		}
+	}
+
+	if !healthy {
+		cli.Warning("Could not verify container health")
+		cli.Info("Check status with: vsr status")
+		return fmt.Errorf("containers may not be healthy")
+	}
+
+	cli.Success("Docker deployment upgraded successfully")
+	cli.Info("Check status with: vsr status")
+	cli.Info("View logs with: vsr logs")
+	return nil
+}
+
+// UpgradeKubernetes upgrades the Kubernetes deployment
+func UpgradeKubernetes(configPath, namespace string, timeout int, wait bool) error {
+	cli.Info("Upgrading Kubernetes deployment...")
+
+	// Check if kubectl exists
+	if !commandExists("kubectl") {
+		return fmt.Errorf("kubectl not found. Please install kubectl")
+	}
+
+	// Check if deployment exists
+	checkCmd := exec.Command("kubectl", "get", "deployment", "semantic-router", "-n", namespace)
+	if err := checkCmd.Run(); err != nil {
+		cli.Warning("No deployment found in namespace: " + namespace)
+		cli.Info("Use 'vsr deploy kubernetes' to create a new deployment")
+		return nil
+	}
+
+	// Apply updated manifests
+	cli.Info("Applying updated manifests...")
+	manifestDir := "deploy/kubernetes"
+	if _, err := os.Stat(manifestDir); os.IsNotExist(err) {
+		return fmt.Errorf("kubernetes manifests not found: %s", manifestDir)
+	}
+
+	applyCmd := exec.Command("kubectl", "apply", "-f", manifestDir, "-n", namespace)
+	applyCmd.Stdout = os.Stdout
+	applyCmd.Stderr = os.Stderr
+
+	if err := applyCmd.Run(); err != nil {
+		return fmt.Errorf("failed to apply manifests: %w", err)
+	}
+	cli.Success("Manifests applied successfully")
+
+	// Trigger rolling restart
+	cli.Info("Triggering rolling restart...")
+	restartCmd := exec.Command("kubectl", "rollout", "restart", "deployment/semantic-router", "-n", namespace)
+	restartCmd.Stdout = os.Stdout
+	restartCmd.Stderr = os.Stderr
+
+	if err := restartCmd.Run(); err != nil {
+		return fmt.Errorf("failed to restart deployment: %w", err)
+	}
+
+	// Wait for rollout to complete if requested
+	if wait {
+		cli.Info("Waiting for rollout to complete...")
+		//nolint:gosec // G204: namespace is from internal config
+		rolloutCmd := exec.Command("kubectl", "rollout", "status", "deployment/semantic-router", "-n", namespace, fmt.Sprintf("--timeout=%ds", timeout))
+		rolloutCmd.Stdout = os.Stdout
+		rolloutCmd.Stderr = os.Stderr
+
+		if err := rolloutCmd.Run(); err != nil {
+			cli.Warning("Rollout status check failed")
+			cli.Info("Check manually: kubectl rollout status deployment/semantic-router -n " + namespace)
+			return fmt.Errorf("rollout may not have completed successfully: %w", err)
+		}
+		cli.Success("Rollout completed successfully")
+	} else {
+		cli.Info("Rollout started (not waiting for completion)")
+		cli.Info("Monitor with: kubectl rollout status deployment/semantic-router -n " + namespace)
+	}
+
+	cli.Success("Kubernetes deployment upgraded successfully")
+	cli.Info("Check status with: kubectl get pods -n " + namespace)
+	return nil
+}
+
+// UpgradeHelm upgrades the Helm deployment
+func UpgradeHelm(configPath, namespace string, timeout int) error {
+	cli.Info("Upgrading Helm deployment...")
+
+	// Check if helm exists
+	if !commandExists("helm") {
+		return fmt.Errorf("helm not found. Please install Helm: https://helm.sh/docs/intro/install/")
+	}
+
+	// Check if release exists
+	checkCmd := exec.Command("helm", "list", "-n", namespace, "-q")
+	output, err := checkCmd.Output()
+	if err != nil || len(output) == 0 {
+		cli.Warning("No Helm release found in namespace: " + namespace)
+		cli.Info("Use 'vsr deploy helm' to create a new deployment")
+		return nil
+	}
+
+	cli.Warning("Helm deployment upgrade is not fully implemented yet")
+	cli.Info("This feature will be available in a future release")
+	cli.Info("\nWorkaround:")
+	cli.Info("1. Update your values.yaml file")
+	cli.Info("2. Run: helm upgrade semantic-router ./deploy/helm/semantic-router -n " + namespace)
+	cli.Info("3. Wait: kubectl rollout status deployment/semantic-router -n " + namespace)
+
+	return fmt.Errorf("helm upgrade not yet implemented")
+}
diff --git a/src/semantic-router/pkg/cli/model/manager.go b/src/semantic-router/pkg/cli/model/manager.go
new file mode 100644
index 000000000..4ba5e4122
--- /dev/null
+++ b/src/semantic-router/pkg/cli/model/manager.go
@@ -0,0 +1,399 @@
+package model
+
+import (
+	"fmt"
+	"io"
+	"net/http"
+	"os"
+	"path/filepath"
+	"strings"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/classification"
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/cli"
+)
+
+// ModelInfo represents information about a model
+type ModelInfo struct {
+	ID           string
+	Name         string
+	Path         string
+	Type         string // "lora" or "legacy"
+	Architecture string // "bert", "roberta", "modernbert"
+	Downloaded   bool
+	Size         int64
+	Purpose      string // "intent", "pii", "security", "base"
+}
+
+// ModelManager handles model operations
+type ModelManager struct {
+	ModelsDir string
+}
+
+// NewModelManager creates a new model manager
+func NewModelManager(modelsDir string) *ModelManager {
+	if modelsDir == "" {
+		modelsDir = "./models"
+	}
+	return &ModelManager{
+		ModelsDir: modelsDir,
+	}
+}
+
+// ListModels lists all models (downloaded and configured)
+func (mm *ModelManager) ListModels() ([]ModelInfo, error) {
+	// Ensure models directory exists
+	if _, err := os.Stat(mm.ModelsDir); os.IsNotExist(err) {
+		cli.Info(fmt.Sprintf("Models directory does not exist: %s", mm.ModelsDir))
+		return []ModelInfo{}, nil
+	}
+
+	// Discover models using existing functionality
+	paths, err := classification.AutoDiscoverModels(mm.ModelsDir)
+	if err != nil {
+		return nil, fmt.Errorf("failed to discover models: %w", err)
+	}
+
+	var models []ModelInfo
+
+	// Add LoRA models if found
+	if paths.HasLoRAModels() {
+		models = append(models, ModelInfo{
+			ID:           "lora-intent-classifier",
+			Name:         "LoRA Intent Classifier",
+			Path:         paths.LoRAIntentClassifier,
+			Type:         "lora",
+			Architecture: paths.LoRAArchitecture,
+			Downloaded:   true,
+			Size:         mm.getDirectorySize(paths.LoRAIntentClassifier),
+			Purpose:      "intent",
+		})
+		models = append(models, ModelInfo{
+			ID:           "lora-pii-detector",
+			Name:         "LoRA PII Detector",
+			Path:         paths.LoRAPIIClassifier,
+			Type:         "lora",
+			Architecture: paths.LoRAArchitecture,
+			Downloaded:   true,
+			Size:         mm.getDirectorySize(paths.LoRAPIIClassifier),
+			Purpose:      "pii",
+		})
+		models = append(models, ModelInfo{
+			ID:           "lora-security-classifier",
+			Name:         "LoRA Security Classifier",
+			Path:         paths.LoRASecurityClassifier,
+			Type:         "lora",
+			Architecture: paths.LoRAArchitecture,
+			Downloaded:   true,
+			Size:         mm.getDirectorySize(paths.LoRASecurityClassifier),
+			Purpose:      "security",
+		})
+	}
+
+	// Add legacy models if found
+	if paths.HasLegacyModels() {
+		if paths.ModernBertBase != "" {
+			models = append(models, ModelInfo{
+				ID:           "modernbert-base",
+				Name:         "ModernBERT Base",
+				Path:         paths.ModernBertBase,
+				Type:         "legacy",
+				Architecture: "modernbert",
+				Downloaded:   true,
+				Size:         mm.getDirectorySize(paths.ModernBertBase),
+				Purpose:      "base",
+			})
+		}
+		if paths.IntentClassifier != "" {
+			models = append(models, ModelInfo{
+				ID:           "intent-classifier",
+				Name:         "Intent Classifier",
+				Path:         paths.IntentClassifier,
+				Type:         "legacy",
+				Architecture: "modernbert",
+				Downloaded:   true,
+				Size:         mm.getDirectorySize(paths.IntentClassifier),
+				Purpose:      "intent",
+			})
+		}
+		if paths.PIIClassifier != "" {
+			models = append(models, ModelInfo{
+				ID:           "pii-classifier",
+				Name:         "PII Classifier",
+				Path:         paths.PIIClassifier,
+				Type:         "legacy",
+				Architecture: "modernbert",
+				Downloaded:   true,
+				Size:         mm.getDirectorySize(paths.PIIClassifier),
+				Purpose:      "pii",
+			})
+		}
+		if paths.SecurityClassifier != "" {
+			models = append(models, ModelInfo{
+				ID:           "security-classifier",
+				Name:         "Security Classifier",
+				Path:         paths.SecurityClassifier,
+				Type:         "legacy",
+				Architecture: "modernbert",
+				Downloaded:   true,
+				Size:         mm.getDirectorySize(paths.SecurityClassifier),
+				Purpose:      "security",
+			})
+		}
+	}
+
+	return models, nil
+}
+
+// ValidateModel validates a specific model
+func (mm *ModelManager) ValidateModel(modelID string) error {
+	models, err := mm.ListModels()
+	if err != nil {
+		return err
+	}
+
+	// Find the model
+	var targetModel *ModelInfo
+	for i := range models {
+		if models[i].ID == modelID {
+			targetModel = &models[i]
+			break
+		}
+	}
+
+	if targetModel == nil {
+		return fmt.Errorf("model not found: %s", modelID)
+	}
+
+	// Check if directory exists
+	if _, err := os.Stat(targetModel.Path); os.IsNotExist(err) {
+		return fmt.Errorf("model directory does not exist: %s", targetModel.Path)
+	}
+
+	// Check for essential model files
+	essentialFiles := []string{"config.json"}
+	modelFiles := []string{"pytorch_model.bin", "model.safetensors"}
+
+	// Check essential files
+	for _, file := range essentialFiles {
+		filePath := filepath.Join(targetModel.Path, file)
+		if _, err := os.Stat(filePath); os.IsNotExist(err) {
+			return fmt.Errorf("missing essential file: %s", file)
+		}
+	}
+
+	// Check at least one model file exists
+	hasModelFile := false
+	for _, file := range modelFiles {
+		filePath := filepath.Join(targetModel.Path, file)
+		if _, err := os.Stat(filePath); err == nil {
+			hasModelFile = true
+			break
+		}
+	}
+
+	if !hasModelFile {
+		return fmt.Errorf("no model weights found (expected pytorch_model.bin or model.safetensors)")
+	}
+
+	return nil
+}
+
+// ValidateAllModels validates all downloaded models
+func (mm *ModelManager) ValidateAllModels() (map[string]error, error) {
+	models, err := mm.ListModels()
+	if err != nil {
+		return nil, err
+	}
+
+	results := make(map[string]error)
+	for _, model := range models {
+		if model.Downloaded {
+			results[model.ID] = mm.ValidateModel(model.ID)
+		}
+	}
+
+	return results, nil
+}
+
+// GetModelInfo returns detailed information about a model
+func (mm *ModelManager) GetModelInfo(modelID string) (*ModelInfo, error) {
+	models, err := mm.ListModels()
+	if err != nil {
+		return nil, err
+	}
+
+	for i := range models {
+		if models[i].ID == modelID {
+			return &models[i], nil
+		}
+	}
+
+	return nil, fmt.Errorf("model not found: %s", modelID)
+}
+
+// RemoveModel removes a model from disk
+func (mm *ModelManager) RemoveModel(modelID string) error {
+	model, err := mm.GetModelInfo(modelID)
+	if err != nil {
+		return err
+	}
+
+	if !model.Downloaded {
+		return fmt.Errorf("model is not downloaded: %s", modelID)
+	}
+
+	// Remove the model directory
+	if err := os.RemoveAll(model.Path); err != nil {
+		return fmt.Errorf("failed to remove model directory: %w", err)
+	}
+
+	return nil
+}
+
+// DownloadModel downloads a model from HuggingFace
+func (mm *ModelManager) DownloadModel(modelID string, progressCallback func(downloaded, total int64)) error {
+	// For now, this is a placeholder that calls the existing make command
+	// In the future, this could be implemented with direct HuggingFace API calls
+	cli.Warning("Model download currently uses the Makefile 'download-models' command")
+	cli.Info("Downloading all configured models...")
+
+	return fmt.Errorf("direct model download not yet implemented - use 'make download-models'")
+}
+
+// getDirectorySize calculates the total size of a directory
+func (mm *ModelManager) getDirectorySize(path string) int64 {
+	var size int64
+	_ = filepath.Walk(path, func(_ string, info os.FileInfo, err error) error {
+		if err != nil {
+			return err
+		}
+		if !info.IsDir() {
+			size += info.Size()
+		}
+		return nil
+	})
+	return size
+}
+
+// FormatSize formats a byte size in human-readable format
+func FormatSize(bytes int64) string {
+	const unit = 1024
+	if bytes < unit {
+		return fmt.Sprintf("%d B", bytes)
+	}
+	div, exp := int64(unit), 0
+	for n := bytes / unit; n >= unit; n /= unit {
+		div *= unit
+		exp++
+	}
+	return fmt.Sprintf("%.1f %ciB", float64(bytes)/float64(div), "KMGTPE"[exp])
+}
+
+// DownloadFile downloads a file from a URL with progress tracking
+func DownloadFile(filepath string, url string, progressCallback func(downloaded, total int64)) error {
+	// Create the file
+	out, err := os.Create(filepath)
+	if err != nil {
+		return err
+	}
+	defer out.Close()
+
+	// Get the data
+	//nolint:gosec // G107: URL is constructed internally and validated
+	resp, err := http.Get(url)
+	if err != nil {
+		return err
+	}
+	defer resp.Body.Close()
+
+	// Check server response
+	if resp.StatusCode != http.StatusOK {
+		return fmt.Errorf("bad status: %s", resp.Status)
+	}
+
+	// Create a progress reader if callback provided
+	var reader io.Reader = resp.Body
+	if progressCallback != nil {
+		reader = &progressReader{
+			reader:   resp.Body,
+			total:    resp.ContentLength,
+			callback: progressCallback,
+		}
+	}
+
+	// Write the body to file
+	_, err = io.Copy(out, reader)
+	if err != nil {
+		return err
+	}
+
+	return nil
+}
+
+// progressReader wraps an io.Reader to track progress
+type progressReader struct {
+	reader     io.Reader
+	total      int64
+	downloaded int64
+	callback   func(downloaded, total int64)
+	lastUpdate time.Time
+}
+
+func (pr *progressReader) Read(p []byte) (int, error) {
+	n, err := pr.reader.Read(p)
+	pr.downloaded += int64(n)
+
+	// Call callback every 100ms to avoid too frequent updates
+	if pr.callback != nil && time.Since(pr.lastUpdate) > 100*time.Millisecond {
+		pr.callback(pr.downloaded, pr.total)
+		pr.lastUpdate = time.Now()
+	}
+
+	return n, err
+}
+
+// GetModelStatus returns the overall status of models
+func (mm *ModelManager) GetModelStatus() map[string]interface{} {
+	// Use existing functionality
+	return classification.GetModelDiscoveryInfo(mm.ModelsDir)
+}
+
+// ensureModelsDirectory creates the models directory if it doesn't exist
+func (mm *ModelManager) ensureModelsDirectory() error {
+	if _, err := os.Stat(mm.ModelsDir); os.IsNotExist(err) {
+		if err := os.MkdirAll(mm.ModelsDir, 0o755); err != nil {
+			return fmt.Errorf("failed to create models directory: %w", err)
+		}
+	}
+	return nil
+}
+
+// GetConfiguredModels returns models configured in config file
+// This would need to be implemented with config file parsing
+func (mm *ModelManager) GetConfiguredModels() ([]string, error) {
+	// Placeholder - would parse config file to get configured models
+	return []string{}, nil
+}
+
+// isModelDownloaded checks if a model is downloaded
+func (mm *ModelManager) isModelDownloaded(modelID string) bool {
+	models, err := mm.ListModels()
+	if err != nil {
+		return false
+	}
+
+	for _, model := range models {
+		if model.ID == modelID && model.Downloaded {
+			return true
+		}
+	}
+	return false
+}
+
+// getModelPath returns the expected path for a model
+func (mm *ModelManager) getModelPath(modelID string) string {
+	// Convert model ID to directory name
+	dirName := strings.ReplaceAll(modelID, "-", "_")
+	return filepath.Join(mm.ModelsDir, dirName)
+}
diff --git a/src/semantic-router/pkg/cli/model/manager_test.go b/src/semantic-router/pkg/cli/model/manager_test.go
new file mode 100644
index 000000000..ef875e892
--- /dev/null
+++ b/src/semantic-router/pkg/cli/model/manager_test.go
@@ -0,0 +1,276 @@
+package model
+
+import (
+	"os"
+	"path/filepath"
+	"testing"
+)
+
+func TestNewModelManager(t *testing.T) {
+	t.Run("with models dir", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/models")
+		if mgr.ModelsDir != "/tmp/models" {
+			t.Errorf("Expected ModelsDir to be /tmp/models, got %s", mgr.ModelsDir)
+		}
+	})
+
+	t.Run("with empty dir defaults to ./models", func(t *testing.T) {
+		mgr := NewModelManager("")
+		if mgr.ModelsDir != "./models" {
+			t.Errorf("Expected ModelsDir to be ./models, got %s", mgr.ModelsDir)
+		}
+	})
+}
+
+func TestFormatSize(t *testing.T) {
+	tests := []struct {
+		name     string
+		bytes    int64
+		expected string
+	}{
+		{
+			name:     "bytes",
+			bytes:    512,
+			expected: "512 B",
+		},
+		{
+			name:     "kilobytes",
+			bytes:    1024,
+			expected: "1.0 KiB",
+		},
+		{
+			name:     "megabytes",
+			bytes:    1024 * 1024,
+			expected: "1.0 MiB",
+		},
+		{
+			name:     "gigabytes",
+			bytes:    1024 * 1024 * 1024,
+			expected: "1.0 GiB",
+		},
+		{
+			name:     "terabytes",
+			bytes:    1024 * 1024 * 1024 * 1024,
+			expected: "1.0 TiB",
+		},
+		{
+			name:     "mixed size",
+			bytes:    1536 * 1024 * 1024, // 1.5 GB
+			expected: "1.5 GiB",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := FormatSize(tt.bytes)
+			if result != tt.expected {
+				t.Errorf("FormatSize(%d) = %s, expected %s", tt.bytes, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestListModels(t *testing.T) {
+	t.Run("nonexistent directory", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+		models, err := mgr.ListModels()
+		// Should not error, just return empty list
+		if err != nil {
+			t.Errorf("Expected no error, got: %v", err)
+		}
+
+		if len(models) != 0 {
+			t.Errorf("Expected empty models list, got %d models", len(models))
+		}
+	})
+
+	t.Run("empty directory", func(t *testing.T) {
+		// Create a temporary empty directory
+		tmpDir := filepath.Join(os.TempDir(), "vsr-test-models-empty")
+		_ = os.MkdirAll(tmpDir, 0o755)
+		defer os.RemoveAll(tmpDir)
+
+		mgr := NewModelManager(tmpDir)
+		models, err := mgr.ListModels()
+		if err != nil {
+			t.Errorf("Expected no error, got: %v", err)
+		}
+
+		// Empty directory should return empty list (no models discovered)
+		if len(models) != 0 {
+			t.Errorf("Expected empty models list, got %d models", len(models))
+		}
+	})
+}
+
+func TestGetModelInfo(t *testing.T) {
+	t.Run("model not found", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+		_, err := mgr.GetModelInfo("nonexistent-model")
+
+		if err == nil {
+			t.Error("Expected error for nonexistent model, got nil")
+		}
+	})
+}
+
+func TestValidateModel(t *testing.T) {
+	t.Run("nonexistent model", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+		err := mgr.ValidateModel("nonexistent-model")
+
+		if err == nil {
+			t.Error("Expected error for nonexistent model, got nil")
+		}
+	})
+}
+
+func TestRemoveModel(t *testing.T) {
+	t.Run("nonexistent model", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+		err := mgr.RemoveModel("nonexistent-model")
+
+		if err == nil {
+			t.Error("Expected error for nonexistent model, got nil")
+		}
+	})
+}
+
+func TestGetDirectorySize(t *testing.T) {
+	// Create a temporary directory with some files
+	tmpDir := filepath.Join(os.TempDir(), "vsr-test-size")
+	_ = os.MkdirAll(tmpDir, 0o755)
+	defer os.RemoveAll(tmpDir)
+
+	// Create test files
+	testFile1 := filepath.Join(tmpDir, "file1.txt")
+	testFile2 := filepath.Join(tmpDir, "file2.txt")
+
+	_ = os.WriteFile(testFile1, []byte("hello"), 0o644)  // 5 bytes
+	_ = os.WriteFile(testFile2, []byte("world!"), 0o644) // 6 bytes
+
+	mgr := NewModelManager(tmpDir)
+	size := mgr.getDirectorySize(tmpDir)
+
+	// Should be 11 bytes total
+	if size != 11 {
+		t.Errorf("Expected size to be 11 bytes, got %d", size)
+	}
+}
+
+func TestGetModelPath(t *testing.T) {
+	mgr := NewModelManager("/tmp/models")
+
+	tests := []struct {
+		name     string
+		modelID  string
+		expected string
+	}{
+		{
+			name:     "simple model id",
+			modelID:  "test-model",
+			expected: "/tmp/models/test_model",
+		},
+		{
+			name:     "complex model id",
+			modelID:  "lora-intent-classifier",
+			expected: "/tmp/models/lora_intent_classifier",
+		},
+		{
+			name:     "no dashes",
+			modelID:  "simplemodel",
+			expected: "/tmp/models/simplemodel",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := mgr.getModelPath(tt.modelID)
+			if result != tt.expected {
+				t.Errorf("getModelPath(%s) = %s, expected %s", tt.modelID, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestIsModelDownloaded(t *testing.T) {
+	mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+
+	// Should return false for nonexistent model
+	if mgr.isModelDownloaded("test-model") {
+		t.Error("Expected false for nonexistent model, got true")
+	}
+}
+
+func TestValidateAllModels(t *testing.T) {
+	t.Run("empty models dir", func(t *testing.T) {
+		tmpDir := filepath.Join(os.TempDir(), "vsr-test-validate-all")
+		_ = os.MkdirAll(tmpDir, 0o755)
+		defer os.RemoveAll(tmpDir)
+
+		mgr := NewModelManager(tmpDir)
+		results, err := mgr.ValidateAllModels()
+		if err != nil {
+			t.Errorf("Expected no error, got: %v", err)
+		}
+
+		// No models to validate
+		if len(results) != 0 {
+			t.Errorf("Expected 0 results, got %d", len(results))
+		}
+	})
+}
+
+func TestGetModelStatus(t *testing.T) {
+	t.Run("returns status map", func(t *testing.T) {
+		mgr := NewModelManager("/tmp/nonexistent-models-dir-12345")
+		status := mgr.GetModelStatus()
+
+		// Should return a map with at least some keys
+		if status == nil {
+			t.Error("Expected non-nil status map")
+		}
+
+		// Check for expected keys
+		if _, hasDir := status["models_directory"]; !hasDir {
+			t.Error("Expected 'models_directory' key in status")
+		}
+
+		if _, hasStatus := status["discovery_status"]; !hasStatus {
+			t.Error("Expected 'discovery_status' key in status")
+		}
+	})
+}
+
+func TestEnsureModelsDirectory(t *testing.T) {
+	t.Run("creates directory if not exists", func(t *testing.T) {
+		tmpDir := filepath.Join(os.TempDir(), "vsr-test-ensure-models")
+		defer os.RemoveAll(tmpDir)
+
+		// Ensure it doesn't exist
+		os.RemoveAll(tmpDir)
+
+		mgr := NewModelManager(tmpDir)
+		err := mgr.ensureModelsDirectory()
+		if err != nil {
+			t.Errorf("Expected no error, got: %v", err)
+		}
+
+		// Check directory was created
+		if _, err := os.Stat(tmpDir); os.IsNotExist(err) {
+			t.Error("Directory was not created")
+		}
+	})
+
+	t.Run("no error if directory exists", func(t *testing.T) {
+		tmpDir := filepath.Join(os.TempDir(), "vsr-test-ensure-models-2")
+		_ = os.MkdirAll(tmpDir, 0o755)
+		defer os.RemoveAll(tmpDir)
+
+		mgr := NewModelManager(tmpDir)
+		err := mgr.ensureModelsDirectory()
+		if err != nil {
+			t.Errorf("Expected no error for existing directory, got: %v", err)
+		}
+	})
+}
diff --git a/src/semantic-router/pkg/cli/output.go b/src/semantic-router/pkg/cli/output.go
new file mode 100644
index 000000000..3eac00b3d
--- /dev/null
+++ b/src/semantic-router/pkg/cli/output.go
@@ -0,0 +1,76 @@
+package cli
+
+import (
+	"encoding/json"
+	"os"
+
+	"github.com/fatih/color"
+	"github.com/olekukonko/tablewriter"
+	"gopkg.in/yaml.v3"
+)
+
+// Color functions for terminal output
+var (
+	successColor = color.New(color.FgGreen, color.Bold)
+	errorColor   = color.New(color.FgRed, color.Bold)
+	warningColor = color.New(color.FgYellow, color.Bold)
+	infoColor    = color.New(color.FgCyan)
+)
+
+// Success prints a success message in green
+func Success(msg string) {
+	successColor.Println(msg)
+}
+
+// Error prints an error message in red
+func Error(msg string) {
+	errorColor.Println(msg)
+}
+
+// Warning prints a warning message in yellow
+func Warning(msg string) {
+	warningColor.Println(msg)
+}
+
+// Info prints an info message in cyan
+func Info(msg string) {
+	infoColor.Println(msg)
+}
+
+// PrintTable prints data in table format
+func PrintTable(headers []string, rows [][]string) {
+	table := tablewriter.NewWriter(os.Stdout)
+	table.SetHeader(headers)
+	table.SetAutoWrapText(false)
+	table.SetAutoFormatHeaders(true)
+	table.SetHeaderAlignment(tablewriter.ALIGN_LEFT)
+	table.SetAlignment(tablewriter.ALIGN_LEFT)
+	table.SetCenterSeparator("")
+	table.SetColumnSeparator("")
+	table.SetRowSeparator("")
+	table.SetHeaderLine(false)
+	table.SetBorder(false)
+	table.SetTablePadding("\t")
+	table.SetNoWhiteSpace(true)
+
+	for _, row := range rows {
+		table.Append(row)
+	}
+
+	table.Render()
+}
+
+// PrintJSON prints data in JSON format
+func PrintJSON(v interface{}) error {
+	encoder := json.NewEncoder(os.Stdout)
+	encoder.SetIndent("", "  ")
+	return encoder.Encode(v)
+}
+
+// PrintYAML prints data in YAML format
+func PrintYAML(v interface{}) error {
+	encoder := yaml.NewEncoder(os.Stdout)
+	encoder.SetIndent(2)
+	defer encoder.Close()
+	return encoder.Encode(v)
+}
diff --git a/src/semantic-router/pkg/cli/validator.go b/src/semantic-router/pkg/cli/validator.go
new file mode 100644
index 000000000..c12462e74
--- /dev/null
+++ b/src/semantic-router/pkg/cli/validator.go
@@ -0,0 +1,187 @@
+package cli
+
+import (
+	"errors"
+	"fmt"
+	"net/http"
+	"os"
+	"time"
+
+	"github.com/vllm-project/semantic-router/src/semantic-router/pkg/config"
+)
+
+// ValidationError represents a configuration validation error
+type ValidationError struct {
+	Field   string
+	Message string
+}
+
+func (e ValidationError) Error() string {
+	return fmt.Sprintf("%s: %s", e.Field, e.Message)
+}
+
+// ValidateConfig performs semantic validation on the configuration
+func ValidateConfig(cfg *config.RouterConfig) error {
+	var validationErrors []ValidationError
+
+	// Validate model consistency
+	if err := validateModelConsistency(cfg); err != nil {
+		var target ValidationError
+		if errors.As(err, &target) {
+			validationErrors = append(validationErrors, target)
+		}
+	}
+
+	// Validate categories
+	if err := validateCategories(cfg); err != nil {
+		var target ValidationError
+		if errors.As(err, &target) {
+			validationErrors = append(validationErrors, target)
+		}
+	}
+
+	// Validate category mapping path
+	if err := validateCategoryMappingPath(cfg); err != nil {
+		var target ValidationError
+		if errors.As(err, &target) {
+			validationErrors = append(validationErrors, target)
+		}
+	}
+
+	// Validate jailbreak
+	if err := validateJailbreak(cfg); err != nil {
+		var target ValidationError
+		if errors.As(err, &target) {
+			validationErrors = append(validationErrors, target)
+		}
+	}
+
+	// Validate PII
+	if err := validatePII(cfg); err != nil {
+		var target ValidationError
+		if errors.As(err, &target) {
+			validationErrors = append(validationErrors, target)
+		}
+	}
+
+	if len(validationErrors) > 0 {
+		return validationErrors[0] // Return first error
+	}
+
+	return nil
+}
+
+func validateModelConsistency(cfg *config.RouterConfig) error {
+	// Check that all models referenced in decisions exist in model_config
+	for _, decision := range cfg.Decisions {
+		for _, modelRef := range decision.ModelRefs {
+			if _, exists := cfg.ModelConfig[modelRef.Model]; !exists {
+				return ValidationError{
+					Field:   fmt.Sprintf("decisions.%s.modelRefs", decision.Name),
+					Message: fmt.Sprintf("model '%s' not found in model_config", modelRef.Model),
+				}
+			}
+		}
+	}
+
+	// Check that default_model exists
+	if cfg.DefaultModel != "" {
+		if _, exists := cfg.ModelConfig[cfg.DefaultModel]; !exists {
+			return ValidationError{
+				Field:   "default_model",
+				Message: fmt.Sprintf("default model '%s' not found in model_config", cfg.DefaultModel),
+			}
+		}
+	}
+
+	return nil
+}
+
+func validateCategories(cfg *config.RouterConfig) error {
+	if len(cfg.Categories) == 0 {
+		return ValidationError{
+			Field:   "categories",
+			Message: "at least one category must be defined",
+		}
+	}
+
+	for _, category := range cfg.Categories {
+		if len(category.ModelScores) == 0 {
+			return ValidationError{
+				Field:   fmt.Sprintf("categories.%s", category.Name),
+				Message: "model_scores must be defined for each category",
+			}
+		}
+	}
+
+	return nil
+}
+
+func validateCategoryMappingPath(cfg *config.RouterConfig) error {
+	if cfg.CategoryMappingPath == "" {
+		return ValidationError{
+			Field:   "category_mapping_path",
+			Message: "category_mapping_path must be defined",
+		}
+	}
+	if _, err := os.Stat(cfg.CategoryMappingPath); os.IsNotExist(err) {
+		return ValidationError{
+			Field:   "category_mapping_path",
+			Message: fmt.Sprintf("category_mapping.json file not found at %s", cfg.CategoryMappingPath),
+		}
+	}
+	return nil
+}
+
+func validateJailbreak(cfg *config.RouterConfig) error {
+	if cfg.PromptGuard.Enabled {
+		if cfg.PromptGuard.JailbreakMappingPath == "" {
+			return ValidationError{
+				Field:   "prompt_guard.jailbreak_mapping_path",
+				Message: "jailbreak_mapping_path must be defined when prompt_guard is enabled",
+			}
+		}
+		if _, err := os.Stat(cfg.PromptGuard.JailbreakMappingPath); os.IsNotExist(err) {
+			return ValidationError{
+				Field:   "prompt_guard.jailbreak_mapping_path",
+				Message: fmt.Sprintf("jailbreak_type_mapping.json file not found at %s", cfg.PromptGuard.JailbreakMappingPath),
+			}
+		}
+	}
+
+	return nil
+}
+
+func validatePII(cfg *config.RouterConfig) error {
+	if cfg.PromptGuard.Enabled {
+		if cfg.PIIMappingPath == "" {
+			return ValidationError{
+				Field:   "pii_mapping_path",
+				Message: "pii_mapping_path must be defined when prompt_guard is enabled",
+			}
+		}
+		if _, err := os.Stat(cfg.PIIMappingPath); os.IsNotExist(err) {
+			return ValidationError{
+				Field:   "pii_mapping_path",
+				Message: fmt.Sprintf("pii_type_mapping.json file not found at %s", cfg.PIIMappingPath),
+			}
+		}
+	}
+
+	return nil
+}
+
+// ValidateEndpointReachability checks if endpoints are reachable
+func ValidateEndpointReachability(endpoint string) error {
+	client := &http.Client{
+		Timeout: 5 * time.Second,
+	}
+
+	resp, err := client.Get(endpoint)
+	if err != nil {
+		return fmt.Errorf("endpoint not reachable: %w", err)
+	}
+	defer resp.Body.Close()
+
+	return nil
+}
diff --git a/src/semantic-router/pkg/cli/validator_test.go b/src/semantic-router/pkg/cli/validator_test.go
new file mode 100644
index 000000000..2ac65f35a
--- /dev/null
+++ b/src/semantic-router/pkg/cli/validator_test.go
@@ -0,0 +1,111 @@
+package cli
+
+import (
+	"testing"
+)
+
+func TestValidationError(t *testing.T) {
+	tests := []struct {
+		name     string
+		err      ValidationError
+		expected string
+	}{
+		{
+			name: "simple error",
+			err: ValidationError{
+				Field:   "test_field",
+				Message: "test message",
+			},
+			expected: "test_field: test message",
+		},
+		{
+			name: "nested field error",
+			err: ValidationError{
+				Field:   "decisions.test.modelRefs",
+				Message: "model not found",
+			},
+			expected: "decisions.test.modelRefs: model not found",
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result := tt.err.Error()
+			if result != tt.expected {
+				t.Errorf("Error() = %q, expected %q", result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestValidateModelConsistency(t *testing.T) {
+	// Skip testing internal config validation - this would require
+	// complex config setup. Instead, we test the higher-level ValidateConfig function.
+	t.Skip("Skipping validateModelConsistency unit tests - covered by integration tests")
+}
+
+func TestValidateCategories(t *testing.T) {
+	// Skip testing internal config validation - requires complex config setup
+	t.Skip("Skipping validateCategories unit tests - covered by integration tests")
+}
+
+func TestValidateCategoryMappingPath(t *testing.T) {
+	// Skip - requires complex config struct
+	t.Skip("Skipping - covered by integration tests")
+}
+
+func TestValidateJailbreak(t *testing.T) {
+	// Skip - requires complex config struct
+	t.Skip("Skipping - covered by integration tests")
+}
+
+func TestValidatePII(t *testing.T) {
+	// Skip - requires complex config struct
+	t.Skip("Skipping - covered by integration tests")
+}
+
+func TestValidateConfig(t *testing.T) {
+	// Skip complex config validation tests - requires full config structure
+	// These are better tested through end-to-end tests with actual config files
+	t.Skip("Skipping ValidateConfig unit tests - covered by integration tests")
+}
+
+func TestValidateEndpointReachability(t *testing.T) {
+	tests := []struct {
+		name        string
+		endpoint    string
+		expectError bool
+	}{
+		{
+			name:        "invalid endpoint",
+			endpoint:    "http://invalid-endpoint-that-does-not-exist-12345:9999",
+			expectError: true,
+		},
+		{
+			name:        "malformed URL",
+			endpoint:    "not-a-url",
+			expectError: true,
+		},
+		{
+			name:        "empty endpoint",
+			endpoint:    "",
+			expectError: true,
+		},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			err := ValidateEndpointReachability(tt.endpoint)
+
+			if tt.expectError {
+				if err == nil {
+					t.Error("Expected error but got none")
+				}
+			} else {
+				if err != nil {
+					t.Errorf("Unexpected error: %v", err)
+				}
+			}
+		})
+	}
+}
diff --git a/src/semantic-router/pkg/config/config.go b/src/semantic-router/pkg/config/config.go
index fcde8c42a..cb4fb4fb1 100644
--- a/src/semantic-router/pkg/config/config.go
+++ b/src/semantic-router/pkg/config/config.go
@@ -672,6 +672,15 @@ const (
 type Category struct {
 	// Metadata
 	CategoryMetadata `yaml:",inline"`
+	// ModelScores for the category
+	ModelScores []ModelScore `yaml:"model_scores,omitempty"`
+}
+
+// ModelScore represents a model's score for a category
+type ModelScore struct {
+	Model        string  `yaml:"model"`
+	Score        float64 `yaml:"score"`
+	UseReasoning *bool   `yaml:"use_reasoning"`
 }
 
 // Decision represents a routing decision that combines multiple rules with AND/OR logic
diff --git a/src/semantic-router/pkg/extproc/req_filter_cache.go b/src/semantic-router/pkg/extproc/req_filter_cache.go
index 76329d661..e4bc2b579 100644
--- a/src/semantic-router/pkg/extproc/req_filter_cache.go
+++ b/src/semantic-router/pkg/extproc/req_filter_cache.go
@@ -68,6 +68,13 @@ func (r *OpenAIRouter) handleCaching(ctx *RequestContext, categoryName string) (
 		} else if found {
 			// Mark this request as a cache hit
 			ctx.VSRCacheHit = true
+
+			// Set VSR decision context even for cache hits so headers are populated
+			// The categoryName passed here is the decision name from classification
+			if categoryName != "" {
+				ctx.VSRSelectedDecisionName = categoryName
+			}
+
 			// Log cache hit
 			logging.LogEvent("cache_hit", map[string]interface{}{
 				"request_id": ctx.RequestID,
@@ -77,7 +84,7 @@ func (r *OpenAIRouter) handleCaching(ctx *RequestContext, categoryName string) (
 				"threshold":  threshold,
 			})
 			// Return immediate response from cache
-			response := http.CreateCacheHitResponse(cachedResponse, ctx.ExpectStreamingResponse)
+			response := http.CreateCacheHitResponse(cachedResponse, ctx.ExpectStreamingResponse, categoryName)
 			ctx.TraceContext = spanCtx
 			return response, true
 		}
diff --git a/src/semantic-router/pkg/k8s/reconciler.go b/src/semantic-router/pkg/k8s/reconciler.go
index ce2fc9fb3..b56fb2b2e 100644
--- a/src/semantic-router/pkg/k8s/reconciler.go
+++ b/src/semantic-router/pkg/k8s/reconciler.go
@@ -259,7 +259,15 @@ func (r *Reconciler) validateAndUpdate(ctx context.Context, pool *v1alpha1.Intel
 	// Create new config by merging with static config
 	newConfig := *r.staticConfig
 	newConfig.BackendModels = *backendModels
-	newConfig.IntelligentRouting = *intelligentRouting
+
+	// Copy IntelligentRouting fields explicitly (since it's embedded with ,inline in YAML)
+	// Assigning the whole struct doesn't work correctly with embedded structs
+	newConfig.KeywordRules = intelligentRouting.KeywordRules
+	newConfig.EmbeddingRules = intelligentRouting.EmbeddingRules
+	newConfig.Categories = intelligentRouting.Categories
+	newConfig.Decisions = intelligentRouting.Decisions
+	newConfig.Strategy = intelligentRouting.Strategy
+	newConfig.ReasoningConfig = intelligentRouting.ReasoningConfig
 
 	// Call update callback
 	if r.onConfigUpdate != nil {
diff --git a/src/semantic-router/pkg/utils/http/response.go b/src/semantic-router/pkg/utils/http/response.go
index 879bff3f7..617cbd9da 100644
--- a/src/semantic-router/pkg/utils/http/response.go
+++ b/src/semantic-router/pkg/utils/http/response.go
@@ -249,7 +249,7 @@ func CreateJailbreakViolationResponse(jailbreakType string, confidence float32,
 }
 
 // CreateCacheHitResponse creates an immediate response from cache
-func CreateCacheHitResponse(cachedResponse []byte, isStreaming bool) *ext_proc.ProcessingResponse {
+func CreateCacheHitResponse(cachedResponse []byte, isStreaming bool, vsrDecisionName string) *ext_proc.ProcessingResponse {
 	var responseBody []byte
 	var contentType string
 
@@ -299,25 +299,38 @@ func CreateCacheHitResponse(cachedResponse []byte, isStreaming bool) *ext_proc.P
 		responseBody = cachedResponse
 	}
 
+	// Build headers including VSR decision headers for cache hits
+	setHeaders := []*core.HeaderValueOption{
+		{
+			Header: &core.HeaderValue{
+				Key:      "content-type",
+				RawValue: []byte(contentType),
+			},
+		},
+		{
+			Header: &core.HeaderValue{
+				Key:      headers.VSRCacheHit,
+				RawValue: []byte("true"),
+			},
+		},
+	}
+
+	// Add VSR decision header if provided
+	if vsrDecisionName != "" {
+		setHeaders = append(setHeaders, &core.HeaderValueOption{
+			Header: &core.HeaderValue{
+				Key:      headers.VSRSelectedDecision,
+				RawValue: []byte(vsrDecisionName),
+			},
+		})
+	}
+
 	immediateResponse := &ext_proc.ImmediateResponse{
 		Status: &typev3.HttpStatus{
 			Code: typev3.StatusCode_OK,
 		},
 		Headers: &ext_proc.HeaderMutation{
-			SetHeaders: []*core.HeaderValueOption{
-				{
-					Header: &core.HeaderValue{
-						Key:      "content-type",
-						RawValue: []byte(contentType),
-					},
-				},
-				{
-					Header: &core.HeaderValue{
-						Key:      headers.VSRCacheHit,
-						RawValue: []byte("true"),
-					},
-				},
-			},
+			SetHeaders: setHeaders,
 		},
 		Body: responseBody,
 	}
diff --git a/src/semantic-router/pkg/utils/http/response_test.go b/src/semantic-router/pkg/utils/http/response_test.go
index b53539fc9..22f3ce410 100644
--- a/src/semantic-router/pkg/utils/http/response_test.go
+++ b/src/semantic-router/pkg/utils/http/response_test.go
@@ -38,7 +38,7 @@ func TestCreateCacheHitResponse_NonStreaming(t *testing.T) {
 	}
 
 	// Test non-streaming response
-	response := CreateCacheHitResponse(cachedResponse, false)
+	response := CreateCacheHitResponse(cachedResponse, false, "test_decision")
 
 	// Verify response structure
 	if response == nil {
@@ -121,7 +121,7 @@ func TestCreateCacheHitResponse_Streaming(t *testing.T) {
 	}
 
 	// Test streaming response
-	response := CreateCacheHitResponse(cachedResponse, true)
+	response := CreateCacheHitResponse(cachedResponse, true, "test_decision")
 
 	// Verify response structure
 	if response == nil {
@@ -226,7 +226,7 @@ func TestCreateCacheHitResponse_StreamingWithInvalidJSON(t *testing.T) {
 	// Test with invalid JSON
 	invalidJSON := []byte("invalid json")
 
-	response := CreateCacheHitResponse(invalidJSON, true)
+	response := CreateCacheHitResponse(invalidJSON, true, "")
 
 	// Verify response structure
 	if response == nil {
diff --git a/tools/make/build-run-test.mk b/tools/make/build-run-test.mk
index f1fe5f619..a29050051 100644
--- a/tools/make/build-run-test.mk
+++ b/tools/make/build-run-test.mk
@@ -15,6 +15,29 @@ build-router: $(if $(CI),rust-ci,rust)
 	@mkdir -p bin
 	@cd src/semantic-router && go build --tags=milvus -o ../../bin/router cmd/main.go
 
+# Build vsr CLI
+build-cli: ## Build the vsr CLI tool
+	@$(LOG_TARGET)
+	@mkdir -p bin
+	@cd src/semantic-router && go build -ldflags="-r $(PWD)/candle-binding/target/release" -o ../../bin/vsr cmd/vsr/main.go
+	@echo "vsr CLI built successfully: bin/vsr"
+
+# Build all (router + CLI)
+build-all: ## Build both router and CLI
+build-all: build-router build-cli
+
+# Install vsr CLI to system
+install-cli: ## Install vsr CLI to /usr/local/bin
+install-cli: build-cli
+	@cp bin/vsr /usr/local/bin/vsr
+	@chmod +x /usr/local/bin/vsr
+	@echo "vsr installed to /usr/local/bin/vsr"
+
+# Test CLI
+test-cli: ## Run CLI unit tests  
+	@$(LOG_TARGET)
+	@cd src/semantic-router && go test -v ./cmd/vsr/commands/...
+
 # Run the router
 run-router: ## Run the router with the specified config
 run-router: build-router download-models
@@ -141,7 +164,7 @@ start-llm-katan:
 test-e2e-vllm: ## Run e2e tests with LLM Katan servers (make sure servers are running)
 test-e2e-vllm:
 	@echo "Running e2e tests with LLM Katan servers..."
-	@echo "⚠️  Note: Make sure LLM Katan servers are running with 'make start-llm-katan'"
+	@echo "Note: Make sure LLM Katan servers are running with 'make start-llm-katan'"
 	@python3 e2e-tests/run_all_tests.py
 
 # Run hallucination detection benchmark
diff --git a/website/docs/cli/commands-reference.md b/website/docs/cli/commands-reference.md
new file mode 100644
index 000000000..2cd13beda
--- /dev/null
+++ b/website/docs/cli/commands-reference.md
@@ -0,0 +1,123 @@
+# VSR CLI Command Reference
+
+## Global Flags
+
+- `--config, -c`: Path to the configuration file (default: `config/config.yaml`)
+- `--verbose, -v`: Enable verbose output for debugging
+- `--output, -o`: Output format (table, json, yaml) (default: `table`)
+
+## Commands
+
+### `vsr init`
+
+Initialize a new configuration file.
+
+**Usage:**
+
+```bash
+vsr init [flags]
+```
+
+**Flags:**
+
+- `--output`: Output path for the configuration file (default: `config/config.yaml`)
+- `--template`: Template to use: `default`, `minimal`, `full` (default: `default`)
+
+### `vsr config`
+
+Manage router configuration.
+
+**Subcommands:**
+
+- `view`: Display the current configuration.
+- `edit`: Open configuration in your default editor (uses `$EDITOR`).
+- `validate`: Validate configuration file syntax and semantics.
+- `set <key> <value>`: Set a specific configuration value using dot notation.
+- `get <key>`: Retrieve a specific configuration value.
+
+**Examples:**
+
+```bash
+vsr config set bert_model.threshold 0.7
+vsr config get default_model
+```
+
+### `vsr deploy`
+
+Deploy the router to a target environment.
+
+**Usage:**
+
+```bash
+vsr deploy [local|docker|kubernetes] [flags]
+```
+
+**Subcommands:**
+
+- `local`: Run the router as a local process.
+- `docker`: Deploy using Docker Compose.
+- `kubernetes`: Deploy to a Kubernetes cluster.
+
+**Flags:**
+
+- `--observability`: Enable observability stack (Prometheus, Grafana, Jaeger).
+- `--namespace` (Kubernetes only): Target namespace (default: `default`).
+
+### `vsr undeploy`
+
+Remove a deployment.
+
+**Usage:**
+
+```bash
+vsr undeploy [local|docker|kubernetes]
+```
+
+### `vsr status`
+
+Check the status of the router and its components.
+
+**Usage:**
+
+```bash
+vsr status
+```
+
+### `vsr logs`
+
+Fetch or stream logs from the router.
+
+**Usage:**
+
+```bash
+vsr logs [flags]
+```
+
+**Flags:**
+
+- `--follow, -f`: Follow log output.
+- `--tail, -n`: Number of lines to show from the end (default: 100).
+
+### `vsr get`
+
+Retrieve information about configured resources.
+
+**Usage:**
+
+```bash
+vsr get [models|categories|decisions|endpoints]
+```
+
+### `vsr test-prompt`
+
+Send a test prompt to the router to verify classification.
+
+**Usage:**
+
+```bash
+vsr test-prompt <text> [flags]
+```
+
+**Flags:**
+
+- `--endpoint`: Router API endpoint (default: `http://localhost:8080/v1/classify`).
diff --git a/website/docs/cli/overview.md b/website/docs/cli/overview.md
new file mode 100644
index 000000000..901421cd8
--- /dev/null
+++ b/website/docs/cli/overview.md
@@ -0,0 +1,76 @@
+# VSR CLI Overview
+
+The `vsr` (vLLM Semantic Router) CLI is a unified command-line tool designed to simplify the installation, configuration, deployment, and management of the Semantic Router.
+
+## Key Features
+
+- **Easy Installation**: Guided installation and setup process.
+- **Configuration Management**: View, edit, validate, and modify configuration files with ease.
+- **Deployment**: Deploy the router locally, via Docker Compose, or to Kubernetes with a single command.
+- **Status & Monitoring**: Check service health and view logs.
+- **Testing**: interactive prompt testing to verify routing logic.
+
+## Installation
+
+### From Binary
+
+Download the latest release for your platform and add it to your PATH.
+
+### From Source
+
+```bash
+
+make install-cli
+
+```
+
+## Quick Start
+
+1. **Initialize a new configuration:**
+   
+   ```bash
+   vsr init
+   ```
+
+   This creates a `config/config.yaml` file with default settings.
+
+2. **Edit the configuration:**
+   
+   ```bash
+   vsr config edit
+   ```
+
+   Opens the configuration file in your default editor.
+
+3. **Validate the configuration:**
+   
+   ```bash
+   vsr config validate
+   ```
+
+   Ensures your configuration is syntactically and semantically correct.
+
+4. **Deploy the router:**
+   
+   ```bash
+   vsr deploy docker
+   ```
+
+   Starts the router using Docker Compose.
+
+5. **Check status:**
+   
+   ```bash
+   vsr status
+   ```
+
+6. **Test a prompt:**
+   
+   ```bash
+   vsr test-prompt "What is the derivative of x^2?"
+   ```
+   
+## Next Steps
+
+- [Command Reference](commands-reference.md)
+- [Troubleshooting](troubleshooting.md)
diff --git a/website/docs/cli/troubleshooting.md b/website/docs/cli/troubleshooting.md
new file mode 100644
index 000000000..ec671ef84
--- /dev/null
+++ b/website/docs/cli/troubleshooting.md
@@ -0,0 +1,48 @@
+# VSR CLI Troubleshooting
+
+## Common Issues
+
+### "Config file not found"
+
+**Error:** `failed to read config: open config/config.yaml: no such file or directory`
+
+**Solution:**
+Run `vsr init` to generate a configuration file, or specify the correct path using the `--config` flag.
+
+### "Validation failed"
+
+**Error:** `❌ Semantic validation failed: ...`
+
+**Solution:**
+
+- Check that all models referenced in `decisions` are defined in `model_config`.
+- Ensure at least one category is defined.
+- Verify YAML syntax indentation.
+
+### "Docker command not found"
+
+**Error:** `docker-compose not found`
+
+**Solution:**
+Ensure Docker and Docker Compose are installed and available in your system PATH.
+
+### "Endpoint not reachable" during deployment
+
+**Solution:**
+
+- Check if the router process is running (`vsr status`).
+- Verify that the port (default 8080) is not blocked by a firewall.
+- If running in Docker, ensure ports are correctly mapped.
+
+### "Unknown resource" in `vsr get`
+
+**Solution:**
+Valid resources are `models`, `categories`, `decisions`, and `endpoints`. Check your spelling.
+
+## Debugging
+
+Use the `--verbose` flag to see detailed logs and error traces:
+
+```bash
+vsr deploy docker --verbose
+```