Skip to content

v0.1.33

Choose a tag to compare

@Henry-811 Henry-811 released this 02 Jan 18:46
· 805 commits to main since this release
a6e0775

πŸš€ Release Highlights β€” v0.1.33 (2026-01-02)

πŸ”„ Reactive Context Compression

  • Automatic Recovery: Conversation automatically compressed when context length errors occur
  • Seamless Continuation: Agents resume work after compression without losing progress
  • Configurable Ratio: Set compression_target_ratio (0-1) to control how much context to preserve

πŸ“¦ Streaming Buffer System

  • Response Tracking: Tracks partial agent responses during streaming for compression recovery
  • Backend Integration: Works across all supported backends (OpenAI, Claude, Gemini, Grok)

πŸ›‘οΈ MCP Tool Protections

πŸ› Bug Fixes

  • Grok MCP Tools: Fixed MCP tool visibility by adjusting tool handling in chat completions
  • Gemini Vote-Only Mode: Fixed vote_only parameter handling in Gemini backend streaming
  • GPT-5 Model Behavior: System prompt adjustments and default reasoning set for newer models
  • Circuit Breaker: Improved debugging output with shorter ultimate timeout for faster failure detection

πŸ“– Getting Started

What's Changed

Full Changelog: v0.1.32...v0.1.33