Agent Instructions

Before running the project, you need to set the required environment variables. These are defined in common/global_config.py.

Create a .env file in the root of the project and add the environment variables defined in common/global_config.py. You can find the required keys as fields in the Config class (any field with type str that looks like an API key).

Before submitting any PR, it should always run make ci first so that it can see the CI outputs and fix any issues that come up

Agent Instructions

This document provides instructions for you, the AI agent, on how to work with this codebase. Please follow these guidelines carefully.

Coding Style

Variable Naming: Use snake_case for all function, file, and directory names. Use CamelCase for class names. Use lowercase for variable names and ALL_CAPS for constants.
Indentation: Use 4 spaces for indentation.
Strings: Use double quotes for strings.

Global Configuration

This project uses a centralized system for managing global configuration, including hyperparameters and secrets. The configuration is powered by pydantic-settings, which provides automatic validation and type checking.

Configuration Files:

common/global_config.yaml - Base configuration values
common/config_models.py - Pydantic models defining the structure and validation
common/global_config.py - Main Config class using BaseSettings
.env - Environment variables and secrets (git-ignored)

Dependency Management

Never use uv pip. Instead, run uv --help to see the available commands for dependency management.

Hyperparameters: Add any hyperparameters that apply across the entire codebase to common/global_config.yaml. Do not define them as constants in the code. Examples include MAX_RETRIES and MODEL_NAME. If you need to add a new hyperparameter with a nested structure, define the corresponding Pydantic model in common/config_models.py first.
Secrets: Store private keys and other secrets in a .env file in the root of the project. These will be loaded automatically. Examples include OPENAI_API_KEY and GITHUB_PERSONAL_ACCESS_TOKEN. These are defined as required fields in the Config class in common/global_config.py.

You can access configuration values in your Python code like this:

from common import global_config

# Access non-secret values
print(global_config.example_parent.example_child)

# Access secret values
print(global_config.OPENAI_API_KEY)

Logging

This project uses a centralized logging configuration with loguru.

Setup: Always import and call the setup function from src/utils/logging_config.py at the beginning of your file.
Usage: Use the imported log object to log messages.

from loguru import logger as log
from src/utils/logging_config import setup_logging

# Set up logging at the start of your file
setup_logging()

# Use the logger as needed
log.info("This is an info message.")
log.error("This is an error message.")
log.debug("This is a debug message.")

Configuration: Never configure logging directly in your files. The log levels are controlled by common/global_config.yaml.

LLM Inference with DSPY

For all LLM inference tasks, you must use the DSPYInference module. This module handles both standard inference and tool-use and is integrated with our observability tools.

from utils.llm.dspy_inference import DSPYInference
import dspy
import asyncio

class ExtractInfo(dspy.Signature):
    """Extract structured information from text."""
    text: str = dspy.InputField()
    title: str = dspy.OutputField()
    headings: list[str] = dspy.OutputField()
    entities: list[dict[str, str]] = dspy.OutputField(desc="a list of entities and their metadata")

def web_search_tool(query: str) -> str:
    """Search the web for information."""
    return "example search term"

# Inference without tool-use
inf_module = DSPYInference(pred_signature=ExtractInfo)

# Inference with tool-use
inf_module_with_tool_use = DSPYInference(
    pred_signature=ExtractInfo,
    tools=[web_search_tool],
)

result = asyncio.run(inf_module.run(
    text="Apple Inc. announced its latest iPhone 14 today. The CEO, Tim Cook, highlighted its new features in a press release."
))

print(result.title)
print(result.headings)
print(result.entities)

LLM Observability with LangFuse

To ensure we can monitor the behavior of our LLMs, you must use LangFuse for observability.

Usage: Use the @observe decorator for functions that contain LLM calls. If you need a more descriptive name for the observation span, use langfuse_context.update_current_observation.

from langfuse.decorators import observe, langfuse_context

@observe
def function_name(...):
    # To give the span a more descriptive name, update the observation
    langfuse_context.update_current_observation(name=f"some-descriptive-name")

Long-Running Code

For any code that is expected to run for a long time, you must follow this pattern to ensure it is resumable, reproducible, and parallelizable.

Structure: Break down long-running processes into init(), continue(id), and cleanup(id) functions.
State: Always checkpoint the state and resume using an id. Do not pass any other parameters. This forces the state to be serializable. Use descriptive names for the id, like runId or taskId.
System Boundaries: When calling external services (like microservices or LLM APIs), you must implement rate limiting, timeouts, retries, and log tracing.
Output Formatting: Keep data in a structured format until the very end of the process. Do not format output (e.g., with f-strings) until it is ready to be presented to the user.

Testing

You are required to write tests for new features.

Framework: Use pytest for all tests.
Location: Add new tests to the tests/ directory. If you create a new subdirectory, make sure to add a __init__.py file to it.
Structure: Inherit from TestTemplate for all test classes. Use self.config for test-specific configuration.

import pytest
from tests.test_template import TestTemplate, slow_test, nondeterministic_test

class TestMyFeature(TestTemplate):
    @pytest.fixture(autouse=True)
    def setup_shared_variables(self, setup):
        # Initialize any shared attributes here
        pass

    # Use decorators for slow or nondeterministic tests
    @slow_test
    def test_my_function(self):
        # Your test code here
        assert True

No unittest: Do not use the unittest framework.

Type Hinting

Use Built-ins: For type hinting, use the built-in collection types (e.g., list, tuple, dict) directly instead of importing List, Tuple, and Dict from the typing module. This is standard for Python 3.9 and later.

GitHub Actions

Authentication: When writing GitHub Actions workflows, use the built-in secrets.GITHUB_TOKEN for authentication whenever possible. This token is automatically generated for each workflow run and has its permissions scoped to the repository. Only use a personal access token (PAT) if you require special privileges that the default token does not provide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Instructions

Agent Instructions

Coding Style

Global Configuration

Dependency Management

Logging

LLM Inference with DSPY

LLM Observability with LangFuse

Long-Running Code

Testing

Type Hinting

GitHub Actions

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Agent Instructions

Agent Instructions

Coding Style

Global Configuration

Dependency Management

Logging

LLM Inference with DSPY

LLM Observability with LangFuse

Long-Running Code

Testing

Type Hinting

GitHub Actions