π₯ Your AI Teammate That Fixes Your AI 24/7 π₯
Handit catches failures, writes fixes, tests them, and ships PRs, automatically. Like having a dedicated teammate for your AI, except it works 24/7.
π Quick Start β’ π Core Features β’ π Docs β’ π Schedule a Call
handit.ai solves AI reliability.
Modern AI applications are fragile β they hallucinate, break schemas, leak PII, and fail silently. When your AI fails at 2am, customers complain, and you're debugging blind. Did the model change? Is a tool broken? Is there a logic error? Without visibility, you're playing whack-a-mole with quality issues.
handit.ai is your AI teammate that monitors your AI 24/7, detects issues, generates fixes, tests them against real data, and ships them as pull requestsβall automatically.
Write JavaScript, TypeScript, Python, and more. What used to take manual debugging and firefighting now happens automatically with handit.ai.
Get your AI teammate up and running in under 5 minutes:
Navigate to your AI project directory and run:
npx @handit/cli setupThe CLI will guide you through connecting your AI teammate:
- π§ Connect your handit.ai account
- π± Install the handit SDK in your project
- π Configure your API key for monitoring
- π§ Connect evaluation models (OpenAI, Together AI, etc.)
- π Connect your GitHub repository for automated PRs
β Check your dashboard: Go to dashboard.handit.ai - you should see:
- Tracing data flowing in real-time
- Quality scores for evaluated interactions
- Agent Performance showing baseline metrics
β Confirm GitHub integration: Check your repository - you should see:
- handit app installed in repository settings
- Ready for PRs - your AI teammate can now create pull requests
That's it! Your AI teammate is now monitoring your AI, evaluating quality, and ready to create pull requests with fixes whenever issues are detected.
Need custom control? Add monitoring decorators manually to your agent functions:
# Python
pip install handit-ai
# JavaScript/TypeScript
npm install @handit.ai/handit-aiPython:
# Auto-generated by handit-cli setup
from handit_ai import tracing, configure
import os
configure(HANDIT_API_KEY=os.getenv("HANDIT_API_KEY"))
# Tracing added to your main agent function (entry point)
@tracing(agent="customer-service-agent")
async def process_customer_request(user_message: str):
# Your existing agent logic (unchanged)
intent = await classify_intent(user_message) # Not traced individually
context = await search_knowledge(intent) # Not traced individually
response = await generate_response(context) # Not traced individually
return responseJavaScript:
// Auto-generated by handit-cli setup
import { configure, startTracing, endTracing } from '@handit.ai/handit-ai';
configure({
HANDIT_API_KEY: process.env.HANDIT_API_KEY
});
// Tracing added to your main agent function (entry point)
export const processCustomerRequest = async (userMessage) => {
startTracing({ agent: "customer-service-agent" });
try {
// Your existing agent logic (unchanged)
const intent = await classifyIntent(userMessage); // Not traced individually
const context = await searchKnowledge(intent); // Not traced individually
const response = await generateResponse(context); // Not traced individually
return response;
} finally {
endTracing();
}
};That's it! Check dashboard.handit.ai to see your traces.
On-Call 24/7: Monitors every request, catches failures in real-time before customers complain.
- Hallucinations and incorrect responses
- Schema breaks and validation errors
- PII leaks and security issues
- Performance degradation and timeouts
Insights: Analyzes root causes, generates fixes and tests solutions on actual failure cases in production.
- Prompt improvements and optimizations
- Configuration changes and guardrails
- Code fixes for logic errors
- Model parameter adjustments
Opens PRs with proven fixes: You review and merge, or auto-deploy with guardrails.
- Tested fixes with real performance data
- Detailed explanations of changes
- A/B testing results and metrics
- Rollback capabilities
Self-improving AI agent that automatically converts messy, unstructured documents into clean, structured data and CSV tables. Perfect for processing invoices, purchase orders, contracts, medical reports, and any other document types. But here's the kicker - it actually gets better at its job over time.
Key Features: β¨
- Schema Inference π: AI analyzes documents and creates optimal JSON structure
- Data Extraction π: Maps document fields to schema with confidence scoring
- CSV Generation π: Automatically creates organized tables for data visualization
- Multimodal Support πΌοΈ: Handles images, PDFs, and text files
- Session Management ποΈ: Isolated processing for different document batches
- Self-improvement π§ : Handit observes every agent interaction, and if a failure is detected, it automatically fixes it
Technologies: π οΈ Python, LangGraph, LangChain, OpenAI, FastAPI, Pandas, Handit.ai
Write your AI agents in your preferred language:
| Language | Status | SDK Package |
|---|---|---|
| Python | β Stable | handit-ai>=0.0.62 |
| JavaScript | β Stable | @handit.ai/handit-ai |
| TypeScript | β Stable | @handit.ai/handit-ai |
| Go | β Available | HTTP API integration |
| Any Stack/Framework | β Available | HTTP API integration (n8n, Zapier, etc.) |
| Java, C#, Ruby, PHP | β Available | REST API integration |
| LangChain & LangGraph | β Available | Python/JS SDK |
| LlamaIndex, AutoGen | β Available | Python/JS SDK + HTTP API |
| CrewAI, Swarm | β Available | Python SDK + HTTP API |
See how teams eliminated their AI firefighting with handit.ai:
ASPE.ai was running a high-stakes agent that was silently failing every time. Within 48 hours of connecting handit, the system identified the issue, tested fixes, and deployed the new prompts.
- +62.3% Accuracy improvement
- +36% Response relevance
- +97.8% Success rate
XBuild's AI was suffering from prompt drift that tanked performance across key models. handit stepped in, ran automatic A/B tests, and deployed the top-performing versions.
- +34.6% Accuracy improvement
- +19.1% Success rate
- +6600 Automatic evaluations
Advanced users only. If you need custom control over your AI teammate setup, you can manually add monitoring code instead of using the CLI.
When to use manual setup:
- Custom deployment environments
- Complex agent architectures
- Need granular control over monitoring
Quick manual setup:
- Manual Setup Guide - Add decorators yourself
- Advanced Setup - Node-by-node monitoring
β CLI command not found?
- Solution: Install Node.js first:
node --version(should show v16+) - If still failing: Try
npx @handit/cli setupdirectly
β "Authentication failed" during setup?
- Solution: Check your Handit.ai account credentials at dashboard.handit.ai
- If still failing: Try logging out and back in to your Handit account
β No traces appearing in dashboard?
- Solution: Run
npx @handit/cli setupagain to regenerate configuration - Check: Your generated code is actually being executed (not just imported)
- Verify: API key was set correctly:
echo $HANDIT_API_KEY
β Evaluations not running?
- Solution: Re-run
npx @handit/cli setupto verify model connections - Check: Model tokens have sufficient credits in your provider dashboard
- Verify: Your AI is receiving traffic (evaluations only run on active agents)
β GitHub app installation failed?
- Solution: Ensure you have admin access to the repository
- Try:
npx @handit/cli setupagain to reinstall the app - Check: Repository permissions in GitHub Settings β Applications
Write your AI agents in your preferred language:
| Language | Status | SDK Package |
|---|---|---|
| Python | β Stable | handit-ai>=0.0.62 |
| JavaScript | β Stable | @handit.ai/handit-ai |
| TypeScript | β Stable | @handit.ai/handit-ai |
| Go | β Available | HTTP API integration |
| Any Stack/Framework | β Available | HTTP API integration (n8n, Zapier, etc.) |
| Java, C#, Ruby, PHP | β Available | REST API integration |
| LangChain & LangGraph | β Available | Python/JS SDK |
| LlamaIndex, AutoGen | β Available | Python/JS SDK + HTTP API |
| CrewAI, Swarm | β Available | Python SDK + HTTP API |
Open source because you need to trust what pushes to prod.
- π Questions: Use our Discord community
- π Bug Reports: GitHub Issues
- π Documentation: Official Docs
- π Schedule a Call: Book a Demo
We have a public roadmap for handit.ai. You can view it here.
Feel free to add comments to the issues, or create a new issue if you have a feature request.
| Feature | Status | Link | Description |
|---|---|---|---|
| Advanced Prompt Optimization | Planned | #485 | Multi-model prompt optimization |
| Custom Evaluation Metrics | Planned | #495 | User-defined evaluation criteria |
| Real-time Dashboard | Planned | #497 | Live monitoring interface |
| Auto-deployment | Planned | #476 | Automated deployment with guardrails |
| Multi-agent Support | Planned | #477 | Complex agent orchestration |
| Custom Integrations | Planned | #480 | Third-party tool integrations |
- π Documentation - Complete guides and API reference
- π¬ Discord - Community support and discussions
- π GitHub Issues - Bug reports and feature requests
- π₯ Demo - See handit in action
We welcome contributions! Whether it's:
- π Bug fixes and improvements
- β¨ New features
- π Documentation and examples
- π Language support additions
- π¨ Dashboard UI enhancements
Check our Contributing Guide to get started.
# Clone the repository
git clone https://github.com/handit-ai/handit.ai.git
cd handit.ai
# Install dependencies
npm install
# Start development environment
npm run devThanks to everyone helping bring Handit to life:
Want to appear here? Star the repo, follow along, and make your first PR π
This project is licensed under the MIT License - see the LICENSE file for details.
- Community: Discord for real-time help
- Support: Contact Us for technical issues
- Documentation: docs.handit.ai for comprehensive guides
- Advanced: Manual Setup for custom configurations
Stop Being Your AI's On-Call Engineer
Let handit.ai handle the 2am failures while you focus on building features.
Get Started Free β’ View on GitHub β’ Join Discord
Open source. GitHub-native. Starts working in minutes.



