CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Overview

This is a Call Center Voice Agent Accelerator built with Azure Voice Live API and Azure Communication Services (ACS). It provides real-time speech-to-speech voice agents for call center scenarios with two client modes: web browser (for testing) and ACS phone calls (for production).

Technology Stack

Backend: Python 3.9+ with Quart (async Flask-like framework)
Package Manager: UV (fast Python dependency management via pyproject.toml)
Infrastructure: Azure Bicep templates for IaC
Deployment: Azure Developer CLI (azd)
Key Azure Services:
- Azure Voice Live API (Speech-to-speech with integrated ASR, LLM, TTS)
- Azure Communication Services (telephony/call automation)
- Azure Container Apps (hosting)
- Azure Container Registry
- Azure Key Vault (stores ACS connection string)

Development Commands

Local Development (server/ directory)

# Run the server locally
uv run server.py

# Access web client at http://127.0.0.1:8000

Docker Development

# Build image
docker build -t voiceagent .

# Run with environment variables
docker run --env-file .env -p 8000:8000 -it voiceagent

Deployment

# Login to Azure
azd auth login

# Deploy all resources (initial + updates)
azd up

# Deploy code changes only
azd deploy

# Clean up all resources
azd down

Testing with ACS Phone Client (Local)

Use Azure DevTunnels to expose local server for webhook testing:

devtunnel login
devtunnel create --allow-anonymous
devtunnel port create -p 8000
devtunnel host

Architecture

Core Application Structure

server/
├── server.py                    # Main Quart application with routes
├── app/
│   └── handler/
│       ├── acs_event_handler.py # Processes ACS incoming calls and callbacks
│       └── acs_media_handler.py # Manages audio streaming to Voice Live API
└── static/                      # Web client HTML/JS

Request Flow

Web Client Mode: Browser → /web/ws WebSocket → ACSMediaHandler → Voice Live API
ACS Phone Mode: Phone Call → ACS IncomingCall event → /acs/incomingcall → Answer call with media streaming → /acs/ws WebSocket → ACSMediaHandler → Voice Live API

Key Handlers

AcsEventHandler (acs_event_handler.py): Handles EventGrid subscription validation and incoming call events. Answers calls with MediaStreamingOptions configured for bidirectional audio.
ACSMediaHandler (acs_media_handler.py): Establishes WebSocket connection to Voice Live API, manages audio queues, and handles bidirectional audio streaming. Uses managed identity or API key authentication.

Infrastructure (infra/)

Bicep modules provision:

User-assigned managed identity (for Key Vault and AI services access)
AI Services (Voice Live API endpoint)
Communication Services (telephony)
Container Apps + Container Registry
Key Vault (stores ACS connection string as secret)
Monitoring (Log Analytics, Application Insights)

The main deployment is subscription-scoped (infra/main.bicep). Note: Limited to eastus2 and swedencentral regions due to Voice Live API availability.

Environment Configuration

Create .env file in server/ directory based on .env-sample.txt:

AZURE_VOICE_LIVE_API_KEY=<AI Foundry resource key>
AZURE_VOICE_LIVE_ENDPOINT=<AI Foundry resource endpoint>
VOICE_LIVE_MODEL=gpt-4o-mini
ACS_CONNECTION_STRING=<Communication Services connection string>
ACS_DEV_TUNNEL=<Optional: DevTunnel URL for local ACS testing>

When deployed to Azure, the container app uses:

Managed Identity for Voice Live API authentication
Key Vault secret reference for ACS connection string

Voice Live API Configuration

Session configuration is defined in acs_media_handler.py:session_config():

Turn Detection: Azure Semantic VAD with end-of-utterance detection
Audio Processing: Deep noise suppression and server echo cancellation
Voice: Configurable Azure Neural TTS voice (default: en-US-Aria)
Instructions: Customizable system prompt for LLM behavior

Post-Deployment Setup

After azd up:

Navigate to the Container App URL to test the web client
For phone testing:
- Create Event Grid subscription for IncomingCall events pointing to https://<container-app-url>/acs/incomingcall
- Provision a phone number for the ACS resource
- Call the number to test the voice agent

Important Notes

Security: ACS connection string is stored in Key Vault. Container app retrieves it via secret reference.
Authentication: Production deployments use managed identity for Voice Live API. Local development uses API key.
Region Constraints: Voice Live API is only available in specific regions (swedencentral strongly recommended).
WebSocket Endpoints: /web/ws for browser clients (raw audio), /acs/ws for ACS calls (PCM 24kHz mono).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Overview

Technology Stack

Development Commands

Local Development (server/ directory)

Docker Development

Deployment

Testing with ACS Phone Client (Local)

Architecture

Core Application Structure

Request Flow

Key Handlers

Infrastructure (infra/)

Environment Configuration

Voice Live API Configuration

Post-Deployment Setup

Important Notes

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Overview

Technology Stack

Development Commands

Local Development (server/ directory)

Docker Development

Deployment

Testing with ACS Phone Client (Local)

Architecture

Core Application Structure

Request Flow

Key Handlers

Infrastructure (infra/)

Environment Configuration

Voice Live API Configuration

Post-Deployment Setup

Important Notes