Skip to content

v0.1.28

Choose a tag to compare

@Henry-811 Henry-811 released this 22 Dec 18:21
· 959 commits to main since this release
95552dc

πŸš€ Release Highlights β€” v0.1.28 (2025-12-22)

πŸ–ΌοΈ Unified Multimodal Tools

  • Consolidated Tools: read_media for understanding and generate_media for generation (images, audio, video)
  • Backend Selector: Automatically routes to appropriate provider (Gemini, OpenAI, OpenRouter) based on availability
  • Multi-Provider Support: OpenAI (gpt-image-1, Sora, TTS), Google (Imagen, Veo), and OpenRouter

πŸ“„ Web UI Artifact Previewer

  • Document Preview: Preview PDFs, DOCX, PPTX, XLSX directly in the web interface
  • Media Preview: View images, HTML, SVG, and Mermaid diagrams inline
  • Code Preview: Sandpack-powered code preview with syntax highlighting
  • New Components: ArtifactPreviewModal and InlineArtifactPreview for flexible rendering

πŸ” OpenRouter Model Filtering

  • Tool-Capable Models Only: Model list now filters to only show models supporting tool calling
  • Capability Detection: Checks supported_parameters for "tools" capability before including models

πŸ“– Getting Started

  • Quick Start Guide: Try the new features today
  • Try These Examples:
    • unified_multimodal.yaml - Try the unified multimodal tools for understanding and generation
    • massgen --web - Try the Web UI with artifact previewer

What's Changed

  • feat: Restricting openrouter models with no tool call support (MAS 165) by @shubham2345 in #680
  • fix: Fix Azure OpenAI tool calls and workflow integration by @AbhimanyuAryan in #675
  • feat: Standardize multimedia understanding and generation, preview of media in webui, background shell docker fix by @ncrispino in #678
  • docs: docs for v0.1.28 by @Henry-811 in #682
  • feat: v0.1.28 by @Henry-811 in #681

New Contributors

Full Changelog: v0.1.27...v0.1.28