Developer Guide

Guide for developers contributing to Workload-Variant-Autoscaler.

Development Environment Setup

Prerequisites

Go 1.23.0+
Docker 17.03+
kubectl 1.32.0+
Kind (for local testing)
Make

Initial Setup

Clone the repository:

git clone https://github.com/llm-d-incubation/workload-variant-autoscaler.git
cd workload-variant-autoscaler

Install dependencies:
```
go mod download
```

Install development tools:

make setup-envtest
make controller-gen
make kustomize

Project Structure

workload-variant-autoscaler/
├── api/v1alpha1/          # CRD definitions
├── cmd/                   # Main application entry points
├── config/                # Kubernetes manifests
│   ├── crd/              # CRD manifests
│   ├── rbac/             # RBAC configurations
│   ├── manager/          # Controller deployment
│   └── samples/          # Example resources
├── deploy/                # Deployment scripts
│   ├── kubernetes/       # K8s deployment
│   ├── openshift/        # OpenShift deployment
│   └── kind/             # Local development
├── docs/                  # Documentation
├── internal/              # Private application code
│   ├── controller/       # Controller implementation
│   ├── collector/        # Metrics collection
│   ├── optimizer/        # Optimization logic
│   ├── actuator/         # Metric emission & scaling
│   └── modelanalyzer/    # Model analysis
├── pkg/                   # Public libraries
│   ├── analyzer/         # Queue theory models
│   ├── solver/           # Optimization algorithms
│   ├── core/             # Core domain models
│   └── config/           # Configuration structures
├── test/                  # Tests
│   ├── e2e/              # End-to-end tests
│   └── utils/            # Test utilities
└── tools/                 # Development tools
    └── vllm-emulator/    # Testing emulator

Development Workflow

Running Locally

Option 1: Outside the cluster

# Run the controller on your machine (connects to configured cluster)
make run

Option 2: In a Kind cluster

# Create a Kind cluster with emulated GPUs
make create-kind-cluster

# Deploy the controller
make deploy IMG=<your-image>

# Or deploy with llm-d infrastructure
make deploy-llm-d-wva-emulated-on-kind

Making Changes

Create a feature branch:
```
git checkout -b feature/my-feature
```
Make your changes

Generate code if needed:

# After modifying CRDs
make manifests generate

Run tests:
```
make test
```
Run linter:
```
make lint
```

Building and Testing

Build the Binary

make build

The binary will be in bin/manager.

Build Docker Image

make docker-build IMG=<your-registry>/wva-controller:tag

Push Docker Image

make docker-push IMG=<your-registry>/wva-controller:tag

Multi-architecture Build

PLATFORMS=linux/arm64,linux/amd64 make docker-buildx IMG=<your-registry>/wva-controller:tag

Testing

Unit Tests

# Run all unit tests
make test

# Run specific package tests
go test ./internal/optimizer/...

# With coverage
go test -cover ./...

E2E Tests

Kind E2E Tests

# Run all E2E tests
make test-e2e

# Run specific tests
make test-e2e FOCUS="single VA"

# Skip specific tests
make test-e2e SKIP="multiple VA"

OpenShift E2E Tests

# Run E2E tests on OpenShift cluster
make test-e2e-openshift

# With custom image
make test-e2e-openshift IMG=<your-registry>/wva-controller:tag

# Run specific OpenShift tests
make test-e2e-openshift FOCUS="HPA integration"

Prerequisites for OpenShift E2E:

Access to an OpenShift cluster (OCP 4.12+)
oc CLI tool configured and authenticated
Cluster admin permissions
Prometheus operator installed

See Testing Guide for more details.

Manual Testing

Deploy to Kind cluster:

make deploy-llm-d-wva-emulated-on-kind IMG=<your-image>

Create test resources:
```
kubectl apply -f config/samples/
```

Monitor controller logs:

kubectl logs -n workload-variant-autoscaler-system \
  deployment/workload-variant-autoscaler-controller-manager -f

Code Generation

After Modifying CRDs

# Generate deepcopy, CRD manifests, and RBAC
make manifests generate

Generate CRD Documentation

make crd-docs

Output will be in docs/user-guide/crd-reference.md.

Debugging

VSCode Launch Configuration

Create .vscode/launch.json:

{
  "version": "0.2.0",
  "configurations": [
    {
      "name": "Debug Controller",
      "type": "go",
      "request": "launch",
      "mode": "auto",
      "program": "${workspaceFolder}/cmd/main.go",
      "env": {
        "KUBECONFIG": "${env:HOME}/.kube/config"
      },
      "args": []
    }
  ]
}

Debugging in Cluster

# Build debug image
go build -gcflags="all=-N -l" -o bin/manager cmd/main.go

# Deploy and attach debugger (e.g., Delve)

Viewing Controller Logs

kubectl logs -n workload-variant-autoscaler-system \
  -l control-plane=controller-manager --tail=100 -f

Common Development Tasks

Adding a New Field to CRD

Modify api/v1alpha1/variantautoscaling_types.go
Run make manifests generate
Update tests
Run make crd-docs
Update user documentation

Adding a New Metric

Define metric in internal/metrics/metrics.go
Emit metric from appropriate controller location
Update Prometheus integration docs
Add to Grafana dashboards (if applicable)

Modifying Optimization Logic

Update code in pkg/solver/ or pkg/analyzer/
Add/update unit tests
Run make test
Update design documentation if algorithm changes

Documentation

Updating Documentation

After code changes, update relevant docs in:

docs/user-guide/ - User-facing changes
docs/design/ - Architecture/design changes
docs/integrations/ - Integration guide updates

Testing Documentation

Verify all commands and examples in documentation work:

# Test installation steps
# Test configuration examples
# Test all code snippets

Release Process

See Releasing Guide (coming soon) for the release process.

Getting Help

Check CONTRIBUTING.md
Review existing code and tests
Ask in GitHub Discussions
Attend community meetings

Useful Commands

# Format code
make fmt

# Vet code
make vet

# Run linter
make lint

# Fix linting issues
make lint-fix

# Clean build artifacts
rm -rf bin/ dist/

# Reset Kind cluster
make destroy-kind-cluster
make create-kind-cluster

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Developer Guide

Development Environment Setup

Prerequisites

Initial Setup

Project Structure

Development Workflow

Running Locally

Making Changes

Building and Testing

Build the Binary

Build Docker Image

Push Docker Image

Multi-architecture Build

Testing

Unit Tests

E2E Tests

Kind E2E Tests

OpenShift E2E Tests

Manual Testing

Code Generation

After Modifying CRDs

Generate CRD Documentation

Debugging

VSCode Launch Configuration

Debugging in Cluster

Viewing Controller Logs

Common Development Tasks

Adding a New Field to CRD

Adding a New Metric

Modifying Optimization Logic

Documentation

Updating Documentation

Testing Documentation

Release Process

Getting Help

Useful Commands

Next Steps

FilesExpand file tree

development.md

Latest commit

History

development.md

File metadata and controls

Developer Guide

Development Environment Setup

Prerequisites

Initial Setup

Project Structure

Development Workflow

Running Locally

Making Changes

Building and Testing

Build the Binary

Build Docker Image

Push Docker Image

Multi-architecture Build

Testing

Unit Tests

E2E Tests

Kind E2E Tests

OpenShift E2E Tests

Manual Testing

Code Generation

After Modifying CRDs

Generate CRD Documentation

Debugging

VSCode Launch Configuration

Debugging in Cluster

Viewing Controller Logs

Common Development Tasks

Adding a New Field to CRD

Adding a New Metric

Modifying Optimization Logic

Documentation

Updating Documentation

Testing Documentation

Release Process

Getting Help

Useful Commands

Next Steps