sygaldry

Developer Tools for Applied AI Engineering

sygaldry provides essential tooling for AI engineers and researchers building production systems. Our open-source projects focus on practical solutions to real challenges faced when integrating AI into applications and workflows.

Mission

We build tools that bridge the gap between AI research and production deployment. Every project emerges from hands-on experience building AI systems at scale, addressing pain points we've encountered firsthand.

Philosophy

Good tools should be:

Practical - Solve real problems developers face daily
Composable - Work seamlessly with existing toolchains
Performant - Optimized for production workloads
Transparent - Clear internals, no black boxes

Projects

Our repositories contain tools and libraries focused on:

Efficient model integration patterns
Production-ready inference pipelines
Workflow orchestration for AI systems
Performance optimization utilities
Development workflow enhancements

Each project includes comprehensive documentation, examples, and benchmarks to help you evaluate and integrate our tools into your stack.

Why sygaldry

Building AI applications with Mirascope often involves writing the same patterns repeatedly - search tools, web scrapers, document parsers, and agent architectures. sygaldry provides:

Production-Ready Mirascope Components - Battle-tested agents, tools, and models built with Mirascope best practices
Copy & Customize - Not a dependency, but code you own and can modify
Mirascope Native - Built specifically for Mirascope with proper decorators, response models, and async patterns
Smart CLI - Intelligently places components based on your project configuration
Provider Agnostic - Works with OpenAI, Anthropic, Google, Mistral, and any Mirascope-supported provider
Built-in Observability - Optional Lilypad integration for tracing and monitoring

How sygaldry Works

sygaldry uses a three-tier configuration system:

sygaldry.json - Your project configuration that tells the CLI where to place components
component.json - Each component's metadata, dependencies, and structure
sygaldry.md - Component documentation that becomes part of your codebase

When you run sygaldry add <component>, the CLI:

Reads your sygaldry.json to understand your project structure
Fetches the component's component.json to know what files to copy
Places files in the correct directories based on component type
Installs any required dependencies
Applies any customizations (provider, model, Lilypad integration)

📖 See the complete configuration flow example to understand how all pieces work together.

Quick Start

Installation

# Install the CLI
pip install sygaldry-cli

# Or with uv (recommended)
uv pip install sygaldry-cli

Initialize Your Project

# Create a sygaldry configuration file
sygaldry init

# This creates a sygaldry.json with your project structure

Your sygaldry.json might look like:

{
  "$schema": "./sygaldry.schema.json",
  "agentDirectory": "src/agents",
  "toolDirectory": "src/tools",
  "promptTemplateDirectory": "src/prompts",
  "responseModelDirectory": "src/models",
  "evalDirectory": "src/evals",
  "aliases": {
    "agents": "@/agents",
    "tools": "@/tools",
    "prompts": "@/prompts"
  },
  "defaultProvider": "openai",
  "defaultModel": "gpt-4o-mini",
  "stream": false
}

Add Components

# Add a PDF search tool to src/tools/pdf_search/
sygaldry add pdf_search_tool

# Add a web search agent with custom provider to src/agents/web_search/
sygaldry add web_search_agent --provider anthropic --model claude-3-opus

# Add with observability to src/agents/research_assistant/
sygaldry add research_assistant_agent --with-lilypad

# Add from a URL
sygaldry add https://your-registry.com/components/custom_agent.json

Component Architecture

Directory Structure

Components are organized by type, with each component in its own directory:

your_project/
├── sygaldry.json                 # Your project configuration
├── src/
│   ├── agents/               # Agent components (from agentDirectory)
│   │   ├── research_assistant/
│   │   │   ├── __init__.py
│   │   │   ├── agent.py     # Main agent implementation
│   │   │   └── sygaldry.md     # Component documentation
│   │   └── web_search/
│   │       ├── __init__.py
│   │       ├── agent.py
│   │       └── sygaldry.md
│   ├── tools/                # Tool components (from toolDirectory)
│   │   ├── pdf_search/
│   │   │   ├── __init__.py
│   │   │   ├── tool.py      # Main tool implementation
│   │   │   └── sygaldry.md
│   │   └── csv_search/
│   │       ├── __init__.py
│   │       ├── tool.py
│   │       └── sygaldry.md
│   ├── prompts/              # Prompt templates (from promptTemplateDirectory)
│   │   └── summarization/
│   │       ├── __init__.py
│   │       ├── prompt.py
│   │       └── sygaldry.md
│   └── models/               # Response models (from responseModelDirectory)
│       └── research_output/
│           ├── __init__.py
│           ├── model.py
│           └── sygaldry.md

Component Configuration

Each component includes a component.json that defines:

{
  "$schema": "https://sygaldry.ai/schemas/component.json",
  "name": "pdf_search",
  "version": "0.1.0",
  "type": "tool",  // Determines which directory to use
  "description": "Search within PDF documents using fuzzy matching",
  "files_to_copy": [
    {
      "source": "tool.py",
      "destination": "tool.py",
      "type": "module"
    },
    {
      "source": "__init__.py",
      "destination": "__init__.py",
      "type": "init_file"
    }
  ],
  "target_directory_key": "toolDirectory",  // Which sygaldry.json key to use
  "python_dependencies": [
    {"name": "PyPDF2", "version": ">=3.0.0"},
    {"name": "fuzzywuzzy", "version": ">=0.18.0"}
  ],
  "supports_lilypad": true,
  "template_variables": {
    "provider": "{{provider}}",
    "model": "{{model}}"
  }
}

Mirascope Integration

All components follow Mirascope best practices:

# Example tool component (functional pattern)
def search_pdf_content(file_path: str, query: str) -> str:
    """Search within a PDF document for the given query."""
    # Tool implementation
    with open(file_path, 'rb') as f:
        # PDF processing logic
        return f"Found {query} in {file_path}"

# Example agent using the tool
@llm.call(provider="{{provider}}", model="{{model}}", tools=[search_pdf_content])
@prompt_template("""
Analyze the following research topic: {topic}
Use the search tool to find relevant information in the provided PDFs.
""")
async def research_topic(topic: str, pdf_files: list[str]) -> str:
    """Research a topic using PDF search tools."""
    ...

# The agent can now call the tool when needed
response = await research_topic("machine learning", ["paper1.pdf", "paper2.pdf"])
if tool := response.tool:
    result = tool.call()  # Executes search_pdf_content with LLM-provided args

Available Components

Components are named with type suffixes in the registry to prevent conflicts, but installed in clean directories:

Agents

Registry name → Installation directory

academic_research_agent → agents/academic_research/ - Academic paper research and analysis
code_generation_execution_agent → agents/code_generation_execution/ - Code generation with execution capabilities
dataset_builder_agent → agents/dataset_builder/ - Automated dataset creation and curation
document_segmentation_agent → agents/document_segmentation/ - Intelligent document chunking
hallucination_detector_agent → agents/hallucination_detector/ - Detect and prevent LLM hallucinations
knowledge_graph_agent → agents/knowledge_graph/ - Build and query knowledge graphs
market_intelligence_agent → agents/market_intelligence/ - Market research and competitive analysis
pii_scrubbing_agent → agents/pii_scrubbing/ - Remove personally identifiable information
recruiting_assistant_agent → agents/recruiting_assistant/ - Resume screening and candidate matching
research_assistant_agent → agents/research_assistant/ - General research and information gathering
sales_intelligence_agent → agents/sales_intelligence/ - Lead qualification and sales insights
text_summarization_agent → agents/text_summarization/ - Multi-format text summarization
web_search_agent → agents/web_search/ - Advanced web search with multiple providers

Tools

Registry name → Installation directory

pdf_search_tool → tools/pdf_search/ - Search within PDF documents with fuzzy matching
csv_search_tool → tools/csv_search/ - Query and analyze CSV files
code_docs_search_tool → tools/code_docs_search/ - Search code documentation
code_interpreter_tool → tools/code_interpreter/ - Execute Python code safely
directory_search_tool → tools/directory_search/ - File system navigation and search
duckduckgo_search_tool → tools/duckduckgo_search/ - Web search via DuckDuckGo
exa_search_tool → tools/exa_search/ - Neural search with Exa API
firecrawl_scrape_tool → tools/firecrawl_scrape/ - Advanced web scraping
git_repo_search_tool → tools/git_repo_search/ - Search within Git repositories
json_search_tool → tools/json_search/ - Query JSON documents
qwant_search_tool → tools/qwant_search/ - Privacy-focused web search
url_content_parser_tool → tools/url_content_parser/ - Extract and parse web content
youtube_video_search_tool → tools/youtube_video_search/ - Search and analyze YouTube videos

Prompt Templates (`prompts`)

Mirascope prompt templates:

Advanced Techniques - Chain of thought, few-shot learning
Common Patterns - Reusable prompt patterns
Text Processing - Summarization, extraction, transformation

Response Models (`models`)

Pydantic models for Mirascope structured outputs

Evaluations (`evals`)

Mirascope evaluation frameworks for testing LLM applications

Customization During Installation

The CLI supports customization flags that modify components during installation:

# These flags customize the component's code
sygaldry add research_assistant \
  --provider anthropic \          # Sets @llm.call(provider="anthropic")
  --model claude-3-opus \         # Sets @llm.call(model="claude-3-opus")
  --with-lilypad \               # Adds @lilypad.trace() decorators
  --stream                       # Enables streaming responses

The CLI uses template variables in the component files to apply these customizations.

Development

Prerequisites

Python 3.12+
devbox (recommended)
task (recommended)

Setup Development Environment

# Clone the repository
git clone https://github.com/sygaldry-ai/sygaldry
cd sygaldry

# Install dependencies
task install

# Enter development shell (with devbox)
devbox shell

Running Tests

# Run all tests
task test

# Run with coverage
task test:coverage

Creating Custom Components

1. Choose Component Type and Create Directory

# For a tool component
mkdir -p packages/sygaldry_registry/components/tools/my_tool

# For an agent component
mkdir -p packages/sygaldry_registry/components/agents/my_agent

2. Create component.json

Important: Component names must include a type suffix to avoid registry conflicts:

Agents: _agent (e.g., research_agent)
Tools: _tool (e.g., pdf_search_tool)
Prompts: _prompt (e.g., summarization_prompt)
Models: _model (e.g., research_output_model)
Evals: _eval (e.g., accuracy_eval)

{
  "$schema": "https://sygaldry.ai/schemas/component.json",
  "name": "my_search_tool",  // Note the _tool suffix
  "version": "0.1.0",
  "type": "tool",
  "description": "My custom Mirascope tool",
  "files_to_copy": [
    {"source": "tool.py", "destination": "tool.py", "type": "module"},
    {"source": "__init__.py", "destination": "__init__.py", "type": "init_file"}
  ],
  "target_directory_key": "toolDirectory",
  "python_dependencies": [
    {"name": "requests", "version": ">=2.28.0"}
  ],
  "supports_lilypad": true,
  "template_variables": {
    "provider": "{{provider}}",
    "model": "{{model}}"
  }
}

3. Implement the Component

# tool.py - Mirascope functional tool pattern
def my_search_function(query: str, limit: int = 10) -> list[str]:
    """
    Search for items matching the query.
    
    Args:
        query: The search query string
        limit: Maximum number of results to return
        
    Returns:
        List of matching results
    """
    # Your actual implementation here
    results = []  # Search logic
    return results[:limit]

# Export the tool function
__all__ = ["my_search_function"]

For agents that use tools:

# agent.py
from mirascope import llm, prompt_template
from tools.my_search import my_search_function

@llm.call(
    provider="{{provider}}", 
    model="{{model}}", 
    tools=[my_search_function]  # Tools are passed here
)
@prompt_template("Help me find information about: {topic}")
async def research_agent(topic: str) -> str:
    """Agent that can search for information."""
    ...

4. Create sygaldry.md Documentation

The sygaldry.md file is a crucial part of each component that serves multiple purposes:

Documentation: Comprehensive guide for using the component
Context: Provides context to LLMs when using the component
Examples: Shows real-world usage patterns
Configuration: Documents available options and customizations

Example sygaldry.md structure:

# Component Name

Brief description of what this component does.

## Overview
Detailed explanation of the component's purpose and capabilities.

## Configuration
- `provider`: LLM provider (default: from sygaldry.json)
- `model`: Model to use (default: from sygaldry.json)
- Template variables and their effects

## Usage Examples
\```python
# Basic usage
from tools.my_tool import my_tool_function
result = await my_tool_function(...)
\```

## API Reference
Detailed documentation of functions, classes, and parameters.

## Best Practices
Tips for optimal usage with Mirascope.

Interactive Testing & Examples

Want to test drive the agents before integrating them? We've got you covered!

The examples/ directory contains interactive testing tools:

Quick Exploration:

🚀 simple_agent_demo.py - Quick demos of 4 core agents (great for beginners)
🎯 interactive_agent_tester.py - Full interactive tester for all 30+ agents
📓 agent_testing_notebook.ipynb - Jupyter notebook for visual testing

Production Testing with Audit Logging:

📝 agent_runner_with_logging.py - Comprehensive audit logging
🧪 run_scenarios.py - Scenario-based testing with full logs
🔍 view_logs.py - Log viewer and analyzer
📋 test_scenarios.json - Realistic test scenarios

Quick Start

Option 1: Use the menu script (easiest)

# Install dependencies
pip install -r examples/requirements-interactive.txt

# Set your API key
export OPENAI_API_KEY="your-key"

# Run the menu
./test_agents.sh

Option 2: Run specific tests

# Quick demo
python3 examples/simple_agent_demo.py

# Production testing with logging
python3 examples/run_scenarios.py

# View audit logs
python3 examples/view_logs.py

📖 See examples/INTERACTIVE_TESTING.md for the complete testing guide. 📝 See examples/AGENT_TESTING_WITH_LOGGING.md for logging and audit documentation.

For Engineers, By Engineers

These tools emerge from our work at Grey Haven AI, where we build production AI systems across industries. We open source the patterns and utilities that prove their value in real deployments.

Contributing

We welcome contributions from the community. Whether you're fixing bugs, improving documentation, or proposing new features, your input helps make these tools better for everyone.

Check individual project repositories for contribution guidelines and development setup instructions.

Adding Components to the Registry

Follow the component structure guidelines
Use Mirascope decorators and patterns
Include comprehensive tests
Document in sygaldry.md
Ensure all dependencies are specified
Test with multiple LLM providers

Documentation

Main README: This file - overview and quick start
Development Guide: CLAUDE.md - comprehensive development context
Contributing: CONTRIBUTING.md - how to contribute
Interactive Testing: examples/ - test agents interactively
Additional Docs: docs/ - dependency info, PyPI README, test reports
Component Docs: Each component has its own README in packages/sygaldry_registry/components/

Support

Issues: File bugs and feature requests on GitHub Issues
Discussions: Join conversations on GitHub Discussions
Examples: See examples/ directory for hands-on testing

License

MIT License - see LICENSE for details.

Acknowledgments

Inspired by shadcn/ui's approach to component libraries
Built specifically for the Mirascope ecosystem
Special thanks to all contributors

Built with engineering rigor at Grey Haven AI

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
.claude		.claude
.cursor/rules		.cursor/rules
.devcontainer		.devcontainer
.github		.github
.sourcegraph		.sourcegraph
.vscode		.vscode
docs		docs
examples		examples
hooks		hooks
packages/sygaldry_registry		packages/sygaldry_registry
scripts		scripts
src/sygaldry_cli		src/sygaldry_cli
sygaldry		sygaldry
taskfiles		taskfiles
tests		tests
.clinerules.md		.clinerules.md
.dive-ci		.dive-ci
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.hadolint.yaml		.hadolint.yaml
.markdownlint.jsonc		.markdownlint.jsonc
.markdownlintignore		.markdownlintignore
.pc_env		.pc_env
.pre-commit-config.yaml		.pre-commit-config.yaml
.release-please-manifest.json		.release-please-manifest.json
.shellcheckrc		.shellcheckrc
.tool-versions		.tool-versions
.windsurfrules.md		.windsurfrules.md
AGENT.md		AGENT.md
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
devbox.json		devbox.json
devbox.lock		devbox.lock
docker-bake.hcl		docker-bake.hcl
docker-compose.yml		docker-compose.yml
doppler.yaml		doppler.yaml
package.json		package.json
process-compose.yml		process-compose.yml
pyproject.toml		pyproject.toml
release-please-config.json		release-please-config.json
sygaldry.json		sygaldry.json
sygaldry.schema.json		sygaldry.schema.json
taskfile.yml		taskfile.yml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

sygaldry

Mission

Philosophy

Projects

Why sygaldry

How sygaldry Works

Quick Start

Installation

Initialize Your Project

Add Components

Component Architecture

Directory Structure

Component Configuration

Mirascope Integration

Available Components

Agents

Tools

Prompt Templates (prompts)

Response Models (models)

Evaluations (evals)

Customization During Installation

Development

Prerequisites

Setup Development Environment

Running Tests

Creating Custom Components

1. Choose Component Type and Create Directory

2. Create component.json

3. Implement the Component

4. Create sygaldry.md Documentation

Interactive Testing & Examples

Quick Start

For Engineers, By Engineers

Contributing

Adding Components to the Registry

Documentation

Support

License

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Prompt Templates (`prompts`)

Response Models (`models`)

Evaluations (`evals`)

Packages