Orpheus Architecture

This document describes the high-level architecture of the Orpheus wildlife monitoring and cross-species communication platform.

Overview

Orpheus is a Python monorepo designed for real-time wildlife monitoring on edge computing hardware (NVIDIA Jetson Orin NX). The platform follows a modular, service-oriented architecture that enables:

Real-time audio/video processing with low-latency ML inference
Distributed agent communication via MQTT message broker
Edge deployment on resource-constrained hardware
Cross-platform development (macOS for development, ARM Linux for production)

System Architecture

graph TB
    subgraph Hardware["🔌 Hardware Layer"]
        MIC[🎤 USB Microphone<br/>Behringer UMC404HD]
        CAM[📷 Camera<br/>Amcrest/CSI]
    end

    subgraph Agents["🤖 Detection Agents"]
        AUDIO[Audio Motion Agent<br/>orpheus-agent-audio-motion]
        VIDEO[Video Motion Agent<br/>orpheus-agent-video-motion]
        BIRD[Bird Detection Agent<br/>orpheus-agent-bird-detection]
        CROW[Crow Detection Agent<br/>orpheus-agent-crow-detection]
        PLAYBACK[Audio Playback Agent<br/>orpheus-agent-audio-playback]
    end

    subgraph VideoCapture["📹 Video Capture"]
        SNAP[Video Snapshotter<br/>orpheus-agent-video-snapshotter]
        TIMELAPSE[Video Timelapser<br/>orpheus-agent-video-timelapser]
    end

    subgraph Services["⚙️ Core Services"]
        MQTT[MQTT Broker<br/>Mosquitto :1883]
        DASH[Dashboard<br/>FastAPI :8080]
    end

    subgraph Platform["📦 Platform Library"]
        COMMON[orpheus-common<br/>Config • MQTT • Storage • Logging]
    end

    subgraph Storage["💾 Storage"]
        DATA[/data/orpheus/<br/>Audio clips, Logs, Events/]
    end

    MIC --> AUDIO
    CAM --> VIDEO
    CAM --> SNAP
    SNAP --> TIMELAPSE
    AUDIO --> MQTT
    AUDIO --> BIRD
    AUDIO --> CROW
    VIDEO --> MQTT
    BIRD --> MQTT
    CROW --> MQTT
    MQTT --> DASH
    MQTT --> PLAYBACK
    AUDIO --> DATA
    VIDEO --> DATA
    BIRD --> DATA
    CROW --> DATA
    SNAP --> DATA
    TIMELAPSE --> DATA
    COMMON --> AUDIO
    COMMON --> VIDEO
    COMMON --> BIRD
    COMMON --> CROW
    COMMON --> PLAYBACK
    COMMON --> DASH

    style Hardware fill:#e1f5fe
    style Agents fill:#fff3e0
    style Services fill:#e8f5e9
    style Platform fill:#f3e5f5
    style Storage fill:#fce4ec

Data Flow

sequenceDiagram
    participant M as 🎤 Microphone
    participant A as Audio Agent
    participant Q as MQTT Broker
    participant D as Dashboard
    participant S as Storage

    M->>A: Audio Stream (48kHz)
    
    loop Every 100ms frame
        A->>A: Analyze audio level
        alt Motion Detected
            A->>S: Save audio clip (.flac)
            A->>Q: Publish detection event
            Q->>D: Forward event
            D->>D: Update UI
        end
    end
    
    A->>Q: Publish health status (every 5s)
    Q->>D: Forward status

Component Architecture

Monorepo Structure

graph LR
    subgraph Root["📁 orpheus/"]
        subgraph P["platform/"]
            COMMON["orpheus-common<br/>━━━━━━━━━━━━<br/>config.py<br/>mqtt.py<br/>logging.py<br/>storage/"]
        end
        
        subgraph S["services/"]
            MQTT["orpheus-mqtt<br/>━━━━━━━━━━━<br/>Mosquitto broker"]
            DASH["orpheus-dashboard<br/>━━━━━━━━━━━━━<br/>FastAPI + JS"]
        end
        
        subgraph A["agents/"]
            AUDIO["orpheus-agent-<br/>audio-motion<br/>━━━━━━━━━━━<br/>Detection agent"]
        end
    end

    COMMON --> DASH
    COMMON --> AUDIO
    AUDIO --> MQTT
    DASH --> MQTT

    style P fill:#f3e5f5
    style S fill:#e8f5e9
    style A fill:#fff3e0

Platform Constraints

Constraint	Requirement
Target Hardware	NVIDIA Jetson Orin NX (ARM)
Python Version	3.9.5 (locked for Jetson compatibility)
Development Platforms	macOS (Apple Silicon), Ubuntu
ML Framework	PyTorch with CUDA/TensorRT

Audio Processing Pipeline

graph LR
    subgraph Input["🎤 Audio Input"]
        ALSA[ALSA Source<br/>USB Audio]
        DEFAULT[Default Input<br/>System Mic]
        SYNTH[Synthetic<br/>Test Signal]
    end

    subgraph Processing["⚡ Processing"]
        PROC[Channel Processor]
        DET[Detector Algorithm<br/>• Fixed Threshold<br/>• Adaptive Threshold]
        BUF[Pre-buffer<br/>Ring Buffer]
    end

    subgraph Output["📤 Output"]
        CLIP[Clip Saver<br/>.flac files]
        PUB[MQTT Publisher<br/>Events & Status]
    end

    ALSA --> PROC
    DEFAULT --> PROC
    SYNTH --> PROC
    PROC --> DET
    DET --> BUF
    BUF --> CLIP
    DET --> PUB

    style Input fill:#e3f2fd
    style Processing fill:#fff8e1
    style Output fill:#e8f5e9

Dashboard Architecture

graph TB
    subgraph Browser["🌐 Browser"]
        JS[JavaScript UI]
        WS[WebSocket Client]
    end

    subgraph Backend["🖥️ FastAPI Backend"]
        API[REST API<br/>/api/v1/*]
        WSS[WebSocket Server<br/>/ws]
        STATIC[Static Files<br/>/static]
    end

    subgraph Data["📊 Data Sources"]
        CONF[OrpheusConfig]
        STORE[Storage<br/>Audio files]
        HW[Hardware Status<br/>Cameras, Audio]
    end

    JS <--> API
    WS <--> WSS
    API --> CONF
    API --> STORE
    API --> HW
    WSS --> CONF

    style Browser fill:#e3f2fd
    style Backend fill:#e8f5e9
    style Data fill:#fff3e0

MQTT Topic Structure

graph TD
    ROOT[orpheus/]
    
    ROOT --> AUDIO[audio/]
    ROOT --> DETECTION[detection/]
    ROOT --> SYSTEM[system/]
    ROOT --> VIDEO[video/]
    
    AUDIO --> A_EVENTS[motion/events<br/>Audio motion events]
    AUDIO --> A_STATUS[motion/status<br/>Channel status]
    AUDIO --> A_PLAYBACK[playback/request<br/>Playback requests]
    
    DETECTION --> D_BIRD[bird/events<br/>BirdNET detections]
    DETECTION --> D_CROW[crow/events<br/>Crow detections]
    
    SYSTEM --> S_HEALTH[*/health<br/>Agent health]
    SYSTEM --> S_CONFIG[config<br/>Config changes]
    
    VIDEO --> V_EVENTS[motion/events<br/>Motion events]
    VIDEO --> V_STATUS[motion/status<br/>Camera status]

    style ROOT fill:#1565c0,color:#fff
    style AUDIO fill:#2196f3,color:#fff
    style DETECTION fill:#9c27b0,color:#fff
    style SYSTEM fill:#4caf50,color:#fff
    style VIDEO fill:#ff9800,color:#fff

Configuration Flow

flowchart LR
    subgraph Sources["Configuration Sources"]
        YAML["config/orpheus.yaml<br/>(defaults)"]
        ENV["Environment Variables<br/>ORPHEUS_*"]
        DOTENV[".env file<br/>(development)"]
        PROD["/opt/orpheus/config/<br/>(production)"]
    end

    subgraph Singleton["OrpheusConfig"]
        LOAD[Load & Merge]
        VALIDATE[Pydantic Validation]
        CACHE[Singleton Instance]
    end

    subgraph Consumers["Config Consumers"]
        AGENT[Audio Agent]
        DASHBOARD[Dashboard]
        STORAGE[Storage Manager]
    end

    YAML --> LOAD
    ENV --> LOAD
    DOTENV --> LOAD
    PROD --> LOAD
    LOAD --> VALIDATE
    VALIDATE --> CACHE
    CACHE --> AGENT
    CACHE --> DASHBOARD
    CACHE --> STORAGE

    style Sources fill:#fff3e0
    style Singleton fill:#e8f5e9
    style Consumers fill:#e3f2fd

Deployment Architecture

graph TB
    subgraph Jetson["🖥️ Jetson Orin NX"]
        subgraph Systemd["systemd Services"]
            S1[orpheus-mqtt.service]
            S2[orpheus-dashboard.service]
            S3[orpheus-agent-audio-motion.service]
        end
        
        subgraph Hardware["Hardware"]
            USB[USB Audio Interface]
            NET[Network Interface]
        end
    end

    subgraph Network["📡 Network"]
        LAN[Local Network]
        BROWSER[Web Browser<br/>Dashboard UI]
    end

    USB --> S3
    S3 --> S1
    S1 --> S2
    S2 --> NET
    NET --> LAN
    LAN --> BROWSER

    style Jetson fill:#e8f5e9
    style Network fill:#e3f2fd

Directory Structure

orpheus/
├── platform/
│   └── orpheus-common/       # Shared platform library
│       ├── src/orpheus_common/
│       │   ├── config.py     # Configuration management
│       │   ├── mqtt.py       # MQTT client wrapper
│       │   ├── logging.py    # Structured logging
│       │   ├── storage/      # File storage utilities
│       │   ├── hardware/     # Hardware abstraction
│       │   └── diagnostics/  # Health monitoring
│       └── tests/
│
├── services/
│   ├── orpheus-mqtt/         # MQTT message broker (Mosquitto)
│   │   ├── config/
│   │   ├── scripts/
│   │   └── systemd/
│   │
│   └── orpheus-dashboard/    # Web-based diagnostic UI
│       ├── src/              # FastAPI backend
│       ├── static/           # Frontend assets
│       └── systemd/
│
├── agents/
│   ├── orpheus-agent-audio-motion/  # Audio motion detection agent
│   │   ├── src/orpheus_agent_audio_motion/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   ├── audio_source.py       # Audio capture backends
│   │   │   ├── detector_algorithm.py # Detection algorithms
│   │   │   └── channel_processor.py  # Per-channel processing
│   │   └── tests/
│   │
│   ├── orpheus-agent-video-motion/  # Video motion detection agent
│   │   ├── src/orpheus_agent_video_motion/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   ├── video_source.py       # RTSP stream capture
│   │   │   └── motion_detector.py    # Motion detection
│   │   └── tests/
│   │
│   ├── orpheus-agent-bird-detection/  # BirdNET species identification
│   │   ├── src/orpheus_agent_bird_detection/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   └── birdnet_model.py      # BirdNET ONNX inference
│   │   └── tests/
│   │
│   ├── orpheus-agent-crow-detection/  # Crow vocalization analysis
│   │   ├── src/orpheus_agent_crow_detection/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   ├── embedder.py           # AVES embedder (16kHz)
│   │   │   └── classifier.py         # Multi-task classifier
│   │   └── tests/
│   │
│   ├── orpheus-agent-audio-playback/  # Audio output agent
│   │   ├── src/orpheus_agent_audio_playback/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   └── playback.py           # Audio playback manager
│   │   └── tests/
│   │
│   ├── orpheus-agent-video-snapshotter/  # Periodic camera snapshots
│   │   ├── src/orpheus_agent_video_snapshotter/
│   │   │   ├── main.py               # Agent entrypoint
│   │   │   └── config.py             # Configuration loading
│   │   └── tests/
│   │
│   └── orpheus-agent-video-timelapser/  # Timelapse generation
│       ├── src/orpheus_agent_video_timelapser/
│       │   ├── main.py               # Agent entrypoint
│       │   └── config.py             # Configuration loading
│       └── tests/
│
├── hardware/                 # Hardware-specific configurations
├── artifacts/                # ML models, recordings (Git LFS)
├── tools/                    # Development utilities
└── docs/                     # Documentation

Configuration

Configuration is managed through a layered system:

Base Config: config/orpheus.yaml (defaults)
Environment Override: ORPHEUS_* environment variables
Local Override: .env files (development)
Instance Config: /opt/orpheus/config/orpheus.yaml (production)

# Example orpheus.yaml structure
audio:
  channels:
    - id: mic_1
      device: alsa://orpheus_umc?channel=1
      enabled: true
      detection:
        algorithm: adaptive_threshold
        threshold_db: -40.0

storage:
  base_path: /data/orpheus
  retain_days: 30

mqtt:
  broker_host: localhost
  broker_port: 1883

Agents

The Orpheus platform uses a layered agent architecture where Layer 1 agents detect motion/activity, and Layer 2 agents perform specialized analysis.

Layer 1: Motion Detection

Audio Motion Detection (`orpheus-agent-audio-motion`)

Purpose: Detect audio activity above threshold across 4 microphone channels
Input: Raw audio from Behringer UMC404HD (48kHz, 4 channels)
Output: Audio clips + motion events via orpheus/audio/motion/events
Algorithm: Fixed or adaptive threshold detection with pre-buffering
Storage: FLAC audio clips in /data/orpheus/audio/audio_motion/{channel}/

Video Motion Detection (`orpheus-agent-video-motion`)

Purpose: Detect visual motion in camera feeds
Input: RTSP streams from 4 Amcrest IP cameras
Output: Motion events + video clips via orpheus/video/motion/events
Algorithm: Frame differencing with configurable sensitivity
Storage: MP4 video clips in /data/orpheus/video/video_motion/{camera}/

Layer 2: Specialized Analysis

Bird Detection (`orpheus-agent-bird-detection`)

Purpose: Identify bird species from audio clips
Model: BirdNET ONNX (species classification)
Input: Audio motion events from orpheus/audio/motion/events
Output: Species detections via orpheus/detection/bird/events

Data Format:

{
  "event_id": "bird_det_...",
  "timestamp": "2025-12-05T22:35:58Z",
  "channel_id": "1",
  "detections": [
    {
      "species_code": "amecro",
      "species_common": "American Crow",
      "confidence": 0.92,
      "start_time": 1.2,
      "end_time": 3.5
    }
  ],
  "audio_clip_path": "/data/orpheus/audio/..."
}

Crow Detection (`orpheus-agent-crow-detection`)

Purpose: Detailed crow vocalization analysis (species, call type, quality)
Models:
- AVES embedder (aves-base-bio.pt): 16kHz audio → 768-dim embeddings
- Multi-task classifier (mt_70.pt): species, call type, quality prediction
Input: Audio motion events from orpheus/audio/motion/events
Processing: Resample 48kHz → 16kHz, extract embeddings, classify
Output: Crow detections via orpheus/detection/crow/events

Data Format:

{
  "event_id": "crow_det_...",
  "timestamp": "2025-12-05T22:35:58Z",
  "channel_id": "1",
  "detection": {
    "species": "american_crow",
    "call_type": "caw",
    "quality_score": 0.87
  },
  "audio_clip_path": "/data/orpheus/audio/...",
  "inference_time_ms": 145
}

Storage: Detections stored in DetectionDB (SQLite) at /data/orpheus/detections/

Layer 3: Output

Audio Playback (`orpheus-agent-audio-playback`)

Purpose: Play audio responses via system speakers
Input: Playback requests from orpheus/audio/playback/request
Output: Audio via default ALSA output device
Features: Sound registry, repeat counts, pause between repeats
Use Cases: Wildlife callbacks, alert sounds, test signals

Video Capture & Processing

These agents handle video data capture and processing, independent of the motion detection layer.

Video Snapshotter (`orpheus-agent-video-snapshotter`)

Purpose: Capture periodic still images from IP cameras
Input: RTSP streams from configured cameras
Output: JPEG files in /data/orpheus/video/snapshots/{YYYY.MM.DD}/
Configuration: Per-camera intervals (e.g., 5m, 10m)
Design: On-demand RTSP connections minimize resource usage
See: ADR 0002: Video Snapshot Architecture

Video Timelapser (`orpheus-agent-video-timelapser`)

Purpose: Generate timelapse videos from snapshots
Input: JPEG snapshots from snapshotter
Output: H.264 MP4 videos in /data/orpheus/video/timelapses/{YYYY.MM.DD}/
Tiers: Multiple lookback windows (24h, 12h, 6h, 1h, 30m, 10m)
Encoding: mp4v + ffmpeg transcode for Jetson compatibility
Filename Format: {camera}.{label}.{tier}.{lookback}.{timestamp}.mp4
See: ADR 0003: Timelapse Generation Architecture
See: ADR 0004: Jetson Video Codec Strategy

Development Workflow

graph LR
    subgraph Dev["💻 Development (Mac)"]
        CODE[Write Code]
        TEST[make test]
        LINT[make lint]
        FMT[make format]
    end

    subgraph CI["🔄 CI/CD"]
        PR[Pull Request]
        GHA[GitHub Actions]
        COV[Coverage Check]
    end

    subgraph Prod["🚀 Production (Jetson)"]
        DEPLOY[make services-install]
        START[make services-start]
        LOGS[make service-logs]
    end

    CODE --> TEST
    TEST --> LINT
    LINT --> FMT
    FMT --> PR
    PR --> GHA
    GHA --> COV
    COV --> DEPLOY
    DEPLOY --> START
    START --> LOGS

    style Dev fill:#e3f2fd
    style CI fill:#fff3e0
    style Prod fill:#e8f5e9

Testing Strategy

Unit Tests: Per-component with pytest
Coverage Target: 70% minimum (most components); orpheus-common 78%, dashboard 80%, audio-motion 72%
CI/CD: GitHub Actions on push/PR
Platform Tests: Separate workflows for ARM validation

Future Architecture

graph TB
    subgraph Current["✅ Current"]
        AUDIO_NOW[Audio Detection]
        BIRDNET[BirdNET Integration<br/>Species ID]
        DASH_NOW[Dashboard]
        MQTT_NOW[MQTT Broker]
    end

    subgraph Planned["🔮 Planned"]
        YOLO[YOLOv8 Video<br/>Object Detection]
        ACTIVE[Active Inference<br/>Playback Response]
        MULTI[Multi-Station<br/>Distributed Sensors]
        SPATIAL[Spatial Web<br/>GIS Integration]
    end

    AUDIO_NOW --> BIRDNET
    DASH_NOW --> SPATIAL
    MQTT_NOW --> MULTI
    BIRDNET --> ACTIVE
    YOLO --> ACTIVE

    style Current fill:#e8f5e9
    style Planned fill:#fff3e0

Contributing

When contributing, ensure:

Changes work on Python 3.9.5 (Jetson constraint)
No dependencies incompatible with ARM architecture
Tests pass with make test
Code is linted with ruff
Documentation is updated

See CONTRIBUTING.md for detailed guidelines.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Orpheus Architecture

Overview

System Architecture

Data Flow

Component Architecture

Monorepo Structure

Platform Constraints

Audio Processing Pipeline

Dashboard Architecture

MQTT Topic Structure

Configuration Flow

Deployment Architecture

Directory Structure

Configuration

Agents

Layer 1: Motion Detection

Audio Motion Detection (`orpheus-agent-audio-motion`)

Video Motion Detection (`orpheus-agent-video-motion`)

Layer 2: Specialized Analysis

Bird Detection (`orpheus-agent-bird-detection`)

Crow Detection (`orpheus-agent-crow-detection`)

Layer 3: Output

Audio Playback (`orpheus-agent-audio-playback`)

Video Capture & Processing

Video Snapshotter (`orpheus-agent-video-snapshotter`)

Video Timelapser (`orpheus-agent-video-timelapser`)

Development Workflow

Testing Strategy

Future Architecture

Contributing

Uh oh!

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

Orpheus Architecture

Overview

System Architecture

Data Flow

Component Architecture

Monorepo Structure

Platform Constraints

Audio Processing Pipeline

Dashboard Architecture

MQTT Topic Structure

Configuration Flow

Deployment Architecture

Directory Structure

Configuration

Agents

Layer 1: Motion Detection

Audio Motion Detection (orpheus-agent-audio-motion)

Video Motion Detection (orpheus-agent-video-motion)

Layer 2: Specialized Analysis

Bird Detection (orpheus-agent-bird-detection)

Crow Detection (orpheus-agent-crow-detection)

Layer 3: Output

Audio Playback (orpheus-agent-audio-playback)

Video Capture & Processing

Video Snapshotter (orpheus-agent-video-snapshotter)

Video Timelapser (orpheus-agent-video-timelapser)

Development Workflow

Testing Strategy

Future Architecture

Contributing

Audio Motion Detection (`orpheus-agent-audio-motion`)

Video Motion Detection (`orpheus-agent-video-motion`)

Bird Detection (`orpheus-agent-bird-detection`)

Crow Detection (`orpheus-agent-crow-detection`)

Audio Playback (`orpheus-agent-audio-playback`)

Video Snapshotter (`orpheus-agent-video-snapshotter`)

Video Timelapser (`orpheus-agent-video-timelapser`)