All notable changes to ACE Framework will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
0.7.0 - 2025-12-04
- Complete terminology rename - Playbook → Skillbook, Bullet → Skill
Playbook→SkillbookBullet→SkillGenerator→AgentCurator→SkillManagerOfflineAdapter→OfflineACEOnlineAdapter→OnlineACEDeltaOperation→UpdateOperationDeltaBatch→UpdateBatch- Migration: Update imports and method calls to use new names
- JSON files: Change
"bullets"key to"skills"in saved skillbooks
- Deduplication consolidation_operations field - SkillManagerOutput now properly captures consolidation operations from LLM responses
- Deduplication not working - Added
consolidation_operationsfield to SkillManagerOutput Pydantic model. Previously, Instructor was silently dropping these operations.
0.5.0 - 2025-11-20
- Playbook format changed to TOON (Token-Oriented Object Notation)
Playbook.as_prompt()now returns TOON format instead of markdown- Reason: 16-62% token savings for improved scalability and reduced inference costs
- Migration: No action needed if using playbook with Generator/Curator/Reflector
- Debugging: Use
playbook._as_markdown_debug()orstr(playbook)for human-readable output - Details: Uses tab delimiters and excludes internal metadata (created_at, updated_at)
- ACELiteLLM integration - Simple conversational agent with automatic learning
- ACELangChain integration - Wrap LangChain Runnables with ACE learning
- Custom integration pattern - Wrap ANY agentic system with ACE learning
- Base utilities in
ace/integrations/base.pywithwrap_playbook_context()helper - Complete working example in
examples/custom_integration_example.py - Integration Pattern: Inject playbook → Execute agent → Learn from results
- Base utilities in
- Integration exports - Import ACEAgent, ACELiteLLM, ACELangChain from
acepackage root - TOON compression for playbooks - 16-62% token reduction vs markdown
- Citation-based tracking - Strategies cited inline as
[section-00001], auto-extracted from reasoning - Enhanced browser traces - Full execution logs (2200+ chars) passed to Reflector
- Test coverage - Improved from 28% to 70% (241 tests total)
- Renamed SimpleAgent → ACELiteLLM - Clearer naming for conversational agent integration
Playbook.__str__()returns markdown (TOON reserved for LLM consumption viaas_prompt())
- Browser-use trace integration - Reflector now receives complete execution traces
- Fixed initial query duplication (task appeared in both question and reasoning)
- Fixed missing trace data (reasoning field now contains 2200+ chars vs 154 chars)
- Fixed screenshot attribute bug causing AttributeError on step.state.screenshot
- Fixed invalid bullet ID filtering - hallucinated/malformed citations now filtered out
- Added comprehensive regression tests to catch these issues
- Impact: Reflector can now properly analyze browser agent's thought process
- Test coverage improved: 69% → 79% for browser_use.py
- Prompt v2.1 test assertions updated to match current format
- All 206 tests now pass (was 189)
0.4.0 - 2025-10-26
- Production Observability with Opik integration
- Enterprise-grade monitoring and tracing
- Automatic token usage and cost tracking for all LLM calls
- Real-time cost monitoring via Opik dashboard
- Graceful degradation when Opik is not installed
- Browser Automation Demos showing ACE vs baseline performance
- Domain checker demo with learning capabilities
- Form filler demo with adaptive strategies
- Side-by-side comparison of baseline vs ACE-enhanced automation
- Support for UV package manager (10-100x faster than pip)
- Added uv.lock for reproducible builds
- UV-specific installation and development instructions
- Improved documentation structure with multiple guides
- QUICK_START.md for 5-minute quickstart
- API_REFERENCE.md for complete API documentation
- PROMPT_ENGINEERING.md for advanced techniques
- SETUP_GUIDE.md for development setup
- TESTING_GUIDE.md for testing procedures
- Optional dependency groups for modular installation
observabilityfor Opik integrationdemosfor browser automation exampleslangchainfor LangChain supporttransformersfor local model supportdevfor development toolsallfor all features combined
- Replaced explainability module with observability
- Removed empty ace/explainability directory
- Migrated to production-grade Opik monitoring
- Updated all documentation to reflect this change
- Improved Python version requirements consistency (3.11+ everywhere)
- Enhanced README with clearer examples and installation options
- Reorganized examples directory for better discoverability
- Updated CLAUDE.md with comprehensive codebase guidance
- Package configuration in pyproject.toml
- Documentation references to non-existent explainability module
- Python version inconsistencies across documentation files
- Empty ace/explainability module (replaced by observability)
- Outdated references to explainability features in documentation
0.3.0 - 2025-10-16
- Experimental v2 Prompts with state-of-the-art prompt engineering
- Confidence scoring at bullet and answer levels
- Domain-specific variants for math and code generation
- Hierarchical structure with identity headers and metadata
- Concrete examples and anti-patterns for better guidance
- PromptManager for version control and A/B testing
- Comprehensive prompt engineering documentation (
docs/PROMPT_ENGINEERING.md) - Advanced examples demonstrating v2 prompts (
examples/advanced_prompts_v2.py) - Comparison script for v1 vs v2 prompts (
examples/compare_v1_v2_prompts.py) - Playbook persistence with
save_to_file()andload_from_file()methods - Example demonstrating playbook save/load functionality (
examples/playbook_persistence.py) - py.typed file for PEP 561 type hint support
- Mermaid flowchart visualization in README showing ACE learning loop
- Enhanced docstrings with comprehensive examples throughout codebase
- Improved README with v2 prompts section and visual diagrams
- Updated formatting to comply with Black code style
- README incorrectly referenced non-existent docs/ directory
- Test badge URL in README (test.yml → tests.yml)
- Code formatting issues detected by GitHub Actions
0.2.0 - 2025-10-15
- LangChain integration via
LangChainLiteLLMClientfor advanced workflows - Router support for load balancing across multiple model deployments
- Comprehensive example for LangChain usage (
examples/langchain_example.py) - Optional installation group:
pip install ace-framework[langchain] - PyPI badges and Quick Links section in README
- CHANGELOG.md for version tracking
- Parameter filtering in LiteLLM and LangChain clients (refinement_round, max_refinement_rounds)
- GitHub Actions workflow using deprecated artifact actions v3 → v4
- Improved README with better structure and badges
- Updated .gitignore to exclude build artifacts and development files
- Unnecessary development files from repository
0.1.1 - 2025-10-15
- GitHub Actions workflow for PyPI publishing
- Updated artifact upload/download actions from v3 to v4
0.1.0 - 2025-10-15
- Initial release of ACE Framework
- Core ACE implementation based on paper (arXiv:2510.04618)
- Three-role architecture: Generator, Reflector, and Curator
- Playbook system for storing and evolving strategies
- LiteLLM integration supporting 100+ LLM providers
- Offline and Online adaptation modes
- Async and streaming support
- Example scripts for quick start
- Comprehensive test suite
- PyPI packaging and GitHub Actions CI/CD
- Self-improving agents that learn from experience
- Delta operations for incremental playbook updates
- Support for OpenAI, Anthropic, Google, and more via LiteLLM
- Type hints and modern Python practices
- MIT licensed for open source use