Agentic API

Description

Agentic API is a sophisticated, asynchronous AI agent system built with FastAPI that intelligently routes and processes tasks between specialized agents. The system automatically determines whether a task requires code generation or content research, then orchestrates the execution through a robust queue-based architecture with comprehensive logging and monitoring.

Features

🤖 Intelligent Agent Routing: PeerAgent automatically determines the best agent (Code vs Content) for each task
🔄 Asynchronous Processing: Celery workers with RabbitMQ for scalable, non-blocking task execution
📝 Code Generation: Structured code output with language detection and explanations
🔍 Content Research: Web-based content aggregation with whitelist validation and source normalization
🔐 Enterprise Security: JWT authentication, rate limiting, and idempotency controls
📊 Comprehensive Logging: End-to-end request tracking with MongoDB-based log events
🚀 Scalable Architecture: Horizontally scalable API and worker nodes
🔄 Idempotency: Prevents duplicate task execution with intelligent hash-based detection
🌐 RESTful API: Clean, documented endpoints with proper HTTP status codes
📈 Real-time Progress: Job status tracking with progress indicators and detailed error reporting

Technologies

Backend Framework: FastAPI (Python 3.8+)
Task Queue: Celery with RabbitMQ
Primary Database: MongoDB (jobs, log_events)
Secondary Database: PostgreSQL (optional, Celery results)
Message Broker: RabbitMQ
Caching & Rate Limiting: Redis
Authentication: JWT
AI/LLM: OpenAI API
Containerization: Docker & Docker Compose
Task Orchestration: Custom JobsOrchestrator
Agent System: PeerAgent, CodeAgent, ContentAgent

LLM Model Selection Strategy

The system uses different LLM models for each agent type, optimized for their specific roles. Model names are configured via environment variables:

LLM_MODEL_ROUTER - For PeerAgent routing decisions
LLM_MODEL_CONTENT - For ContentAgent research and content generation
LLM_MODEL_CODE - For CodeAgent code generation

Model Selection Rationale

Router (PeerAgent): Fast & Cost-Effective

Model: "Mini" class chat model (e.g., gpt-4o-mini equivalent)
Why: Only handles routing decisions with short prompts
Parameters: temperature≈0.0–0.2, max_tokens≈64
Goal: Low latency, minimal cost for simple classification tasks

ContentAgent: Quality & Accuracy

Model: "Mid-to-high" quality general model (e.g., gpt-4o equivalent)
Why: Fluent responses with source integration and factual accuracy
Parameters: temperature≈0.3–0.5, max_tokens based on task size
Goal: High-quality content with proper source attribution

CodeAgent: Code-Centric Capability

Model: Code-focused powerful model (e.g., code-specialized variants)
Why: Correct syntax, longer context handling, code generation expertise
Parameters: temperature≈0.2–0.3, presence_penalty=0, top_p≈0.9
Goal: Accurate, well-structured code output with explanations

Why This Separation?

Cost Efficiency: Using expensive models for simple routing tasks is wasteful
Specialized Performance: Content and code generation require different strengths; independent model selection improves quality
Flexibility: All models configurable via environment variables → supports cloud/hybrid/on-premise model deployments

System Architecture

For visual representations of the system architecture and sequence flows, please refer to the following diagrams:

Based on the system architecture image, here's the Mermaid representation showing the complete flow from client request to result delivery. Why Mermaid? We chose Mermaid for its text-based format, making it easy to update, version control, and maintain. Changes can be made directly in the code, ensuring the documentation stays synchronized with the codebase.

graph TB
    %% Client Layer
    Client[Client<br/>JWT + Idempotency] --> FastAPI

    %% API & Orchestration Layer
    subgraph "API & Orchestration"
        Redis[Redis<br/>rate limit]
        FastAPI[FastAPI<br/>auth, execute, jobs]
        Orchestrator[JobsOrchestrator<br/>create job + log]
    end

    %% Queue & Worker Layer
    subgraph "Queue & Worker"
        RabbitMQ[RabbitMQ<br/>queue]
        Worker[Celery Worker<br/>run_agent_task]
    end

    %% Routing & Agents Layer
    subgraph "Routing & Agents"
        PeerAgent[PeerAgent<br/>route decision]
        CodeAgent[CodeAgent]
        ContentAgent[ContentAgent]
    end

    %% Data & Logging Layer
    subgraph "Data & Logging"
        MongoDB[(MongoDB<br/>jobs, logs)]
        LoggingPipeline[Logging<br/>Pipeline]
    end

    %% Delivery Layer
    subgraph "Delivery"
        Polling[Polling<br/>GET /jobs/id]
        Webhook[Webhook<br/>POST result]
    end

    %% Main Flow - Solid Lines
    Redis -->|rate limit| FastAPI
    FastAPI --> Orchestrator
    Orchestrator --> MongoDB
    Orchestrator --> RabbitMQ
    RabbitMQ --> Worker
    Worker --> PeerAgent
    PeerAgent --> CodeAgent
    PeerAgent --> ContentAgent
    CodeAgent --> MongoDB
    ContentAgent --> MongoDB
    MongoDB --> Polling

    %% Immediate Response Flow
    FastAPI -->|immediate| ImmediateResponse[202 Accepted<br/>+ Location]

    %% Logging Flow - Dashed Lines
    Orchestrator -.->|first log| LoggingPipeline
    Worker -.->|logs| LoggingPipeline
    CodeAgent -.->|logs| LoggingPipeline
    ContentAgent -.->|logs| LoggingPipeline
    LoggingPipeline -.-> MongoDB

    %% Optional Webhook Flow
    MongoDB -.->|logs| Webhook

    %% Styling
    classDef clientLayer fill:#ffffff,stroke:#000000,stroke-width:2px,color:#000000
    classDef apiLayer fill:#e6f3ff,stroke:#0066cc,stroke-width:2px,color:#000000
    classDef queueLayer fill:#f0e6ff,stroke:#6600cc,stroke-width:2px,color:#000000
    classDef agentLayer fill:#e6ffe6,stroke:#006600,stroke-width:2px,color:#000000
    classDef dataLayer fill:#ffffe6,stroke:#cccc00,stroke-width:2px,color:#000000
    classDef deliveryLayer fill:#ffe6e6,stroke:#cc0000,stroke-width:2px,color:#000000
    classDef responseLayer fill:#f0f0f0,stroke:#666666,stroke-width:2px,color:#333333

    class Client clientLayer
    class Redis,FastAPI,Orchestrator apiLayer
    class RabbitMQ,Worker queueLayer
    class PeerAgent,CodeAgent,ContentAgent agentLayer
    class MongoDB,LoggingPipeline dataLayer
    class Polling,Webhook deliveryLayer
    class ImmediateResponse responseLayer

Logging Strategy and Architectural Decisions

MongoDB-based Logging: We chose MongoDB for logging due to its flexible schema, excellent write performance, and natural fit for event-driven architectures. The log_events collection captures every critical step of the task lifecycle, enabling comprehensive debugging and monitoring.

Event-Driven Logging: Each significant operation (request received, agent decision, execution start/finish, progress updates) generates structured log events. This approach provides:

Complete request traceability
Performance bottleneck identification
Error root cause analysis
Operational insights and metrics

Decoupled Logging: Logs are written asynchronously without blocking the main execution flow, ensuring high performance while maintaining comprehensive observability.

How to Setup

Prerequisites

Python 3.10+
Docker & Docker Compose
MongoDB instance
RabbitMQ instance
Redis instance
OpenAI API key
SERPAPI API KEY

Docker Setup (Local)

For detailed Docker setup instructions, please refer to our Docker Setup Guide.

Production Considerations

Horizontal scaling for both API and worker nodes
MongoDB replica set for high availability
RabbitMQ clustering for message broker redundancy
Redis cluster for distributed rate limiting
Proper monitoring and alerting setup

API Usage

For comprehensive API documentation and usage examples, please refer to our API Usage Guide.

Remaining Tasks

Implement comprehensive test coverage
Add more specialized agents (Data Analysis, Translation, etc.)
Create CI/CD pipelines with automated testing
Add performance benchmarks and load testing
Add advanced monitoring with Prometheus & Grafana
Implement webhook system for external integrations
Add agent performance analytics and optimization
Implement production deployment guide
Add query parameters support for jobs listing endpoint (status, agent, limit, offset filtering)

Contributing

We welcome contributions! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
app		app
compose		compose
docs		docs
migrations		migrations
scripts		scripts
tests		tests
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agentic API

Description

Features

Technologies

LLM Model Selection Strategy

Model Selection Rationale

Why This Separation?

System Architecture

Logging Strategy and Architectural Decisions

How to Setup

Prerequisites

Docker Setup (Local)

Production Considerations

API Usage

Remaining Tasks

Contributing

License

About

Uh oh!

Releases

Packages

Languages

umutdz/agentic-api

Folders and files

Latest commit

History

Repository files navigation

Agentic API

Description

Features

Technologies

LLM Model Selection Strategy

Model Selection Rationale

Why This Separation?

System Architecture

Logging Strategy and Architectural Decisions

How to Setup

Prerequisites

Docker Setup (Local)

Production Considerations

API Usage

Remaining Tasks

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages