Current Version: v0.1.62
Release Schedule: Mondays, Wednesdays, Fridays @ 9am PT
Last Updated: March 11, 2026
This roadmap outlines MassGen's development priorities for upcoming releases. Each release focuses on specific capabilities with real-world use cases.
Want to contribute or collaborate on a specific track? Reach out to the track owners below:
| Track | GitHub | Discord |
|---|---|---|
| Tool System Refactoring | @qidanrui | danrui2020 |
| Multimodal Support | @qidanrui | danrui2020 |
| General Interoperability | @qidanrui | danrui2020 |
| RL Integration | @qidanrui @praneeth999 | danrui2020, ram2561 |
| Agent Adapter System | @Eric-Shang | ericshang. |
| Framework Streaming | @Eric-Shang | ericshang. |
| Irreversible Actions Safety | @franklinnwren | zhichengren |
| Computer Use | @franklinnwren | zhichengren |
| Memory Module | @qidanrui @ncrispino | danrui2020, nickcrispino |
| Rate Limiting System | @AbhimanyuAryan | abhimanyuaryan |
| DSPy Integration | @praneeth999 | ram2561 |
| MassGen Handbook | @a5507203 @Henry-811 | crinvo, henry_weiqi |
| Session Management | @ncrispino | nickcrispino |
| Automatic MCP Tool Selection | @ncrispino | nickcrispino |
| Parallel File Operations | @ncrispino | nickcrispino |
| MassGen Terminal Evaluation | @ncrispino | nickcrispino |
| Textual Terminal Display | @praneeth999 | ram2561 |
| Web UI | @voidcenter | justin_zhang |
For general questions, join the #massgen channel on Discord
| Release | Target | Feature | Owner | Use Case |
|---|---|---|---|---|
| v0.1.63 | 03/13/26 | Gemini CLI Backend | @ncrispino | Gemini CLI backend support (#952) |
| v0.1.63 | 03/13/26 | Image/Video Edit Capabilities | @ncrispino | Check and support img/video editing capabilities (#959) |
| v0.1.64 | 03/16/26 | OpenAI Audio API | @ncrispino | Support OpenAI audio API for audio understanding (#960) |
All releases ship on MWF @ 9am PT when ready
Released: March 11, 2026 | PR: #992
- MassGen Skill: New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
- Session Viewer: New
massgen viewercommand for real-time observation of automation sessions with interactive picker and web mode - Backend Improvements: Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
- Quickstart Enhancements: Headless quickstart (
--quickstart --headless) for CI/CD, web quickstart (--web-quickstart) for browser-based setup - Evaluation & Planning: Better planning prompts with thoroughness support, removed should/could criteria
Released: March 9, 2026 | PR: #986
- Round Evaluator Subagent Type: New
round_evaluatorsubagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment - Orchestrator Refactoring: Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
- Evaluation Improvements: Improved evaluation prompts with task plan injection for context-aware assessment
- New Config:
round_evaluator_example.yamlfor easy adoption of the round evaluator paradigm - Fixes: Session resumption from already-resumed logs, SUBAGENT.md generality improvements, round evaluation prompt clarity
Released: March 6, 2026 | PR: #978
- Multimodal Tool Improvements: Rewritten
read_mediawith clearer schema, newMediaCallLedgerHookfor media call tracking - Subagent Enhancements:
inherit_spawning_agent_backend,final_answer_strategy, per-agentsubagent_agentsoverride - GPT-5.4 Support: New default OpenAI flagship model added to the model registry
- Decomp Mode Cooperates with Checklist: Decomposition mode now cooperates with checklist workflow for quality-gated subtask iteration
- Fixes: Checklist/proposal injection improvements, Codex prompt caching calculation fix, task plan refresh, skill prefix handling
Released: March 4, 2026 | PR: #969
- Planning Improvements: Auto-add improvements to task plan, plan review enhancements
- Checklist & Evaluation Enhancements: Better eval gen config, checklist fixes, Gemini tool name normalization for MCP
- Subagent Improvements: Adjusted subagent behavior, subagent manager enhancements, Docker skill write access fixes
- Media Generation Fixes: Video gen skill adjustments (no fallback to animated on errors), video understanding criticality, impact metric restoration
- Bug Fixes: Answer anonymization fix, quickstart and test updates, plan/Docker small fixes
1. Gemini CLI Backend (@ncrispino)
- Issue: #952
- Add Gemini CLI as a first-class backend
- Use Case: Enable Gemini CLI as a native backend option alongside Claude Code and Codex
2. Check Image/Video Edit Capabilities (@ncrispino)
- Issue: #959
- Investigate and support image and video editing capabilities across providers
- Multi-turn editing workflows with continuation IDs
- Use Case: Enable iterative media editing within multi-agent workflows
- ✅ Gemini CLI backend functional and tested
- ✅ Image editing capabilities documented and tested
- ✅ Video editing capabilities documented and tested
1. Support OpenAI Audio API for Audio Understanding (@ncrispino)
- Issue: #960
- OpenAI audio API integration for audio understanding tasks
- Integration with existing
read_mediatool for audio input - Use Case: Enable audio understanding via OpenAI's native audio API
- ✅ OpenAI audio API working via
read_media - ✅ Audio understanding integrated into multi-agent workflows
These features are being actively developed on separate parallel tracks and will ship incrementally on the MWF release schedule:
- PR: #283
- Unified agent interface for easier backend integration
- Shipping: Continuous improvements
- Human-in-the-loop approval system for dangerous operations
- LLM-based tool risk detection
- Target: v0.1.3 and beyond
- PR: #252
- Image, audio, video processing across backends
- Shipping: Incremental improvements each release
- Issues: #347, #348
- Short and long-term memory implementation with persistence
- Status: ✅ Completed in v0.1.5
- Agent task planning with dependency tracking
- Status: ✅ Completed in v0.1.7
- LLM agent automation with status tracking and silent execution
- MassGen running MassGen for self-improvement workflows
- Status: ✅ Completed in v0.1.8
- Case Study: Meta-level self-analysis demonstrating automation mode (
meta-self-analysis-automation-mode.md)
- Question paraphrasing for multi-agent diversity
- Semantic validation and caching system
- Status: ✅ Completed in v0.1.8
- PR: #462
- Real-time streaming for LangGraph and SmoLAgent intermediate steps
- Enhanced debugging and monitoring for external framework tools
- Status: ✅ Completed in v0.1.10
- PR: #383
- Multi-dimensional rate limiting for Gemini models
- Model-specific limits with sliding window tracking
- Status: ✅ Completed in v0.1.11
- Issue: #387
- Comprehensive user documentation and handbook at https://massgen.github.io/Handbook/
- Centralized policies and resources for development and research teams
- Status: ✅ Completed in v0.1.10
- PR: #402
- Browser and desktop automation with OpenAI, Claude, and Gemini integration
- Visual perception through screenshot processing and action execution
- Status: ✅ Completed in v0.1.9
- PR: #466
- Complete session state tracking and restoration
- Resume previous MassGen conversations with full context
- Status: ✅ Completed in v0.1.9
- PR: #515
- Semantic search capabilities via semtools (embedding-based similarity)
- Symbol-level code understanding via serena (LSP integration)
- Package as reusable skills within MassGen framework
- Status: ✅ Completed in v0.1.12
- PR: #515
- Complete refactoring of system prompt assembly
- Hierarchical structure with improved LLM attention management
- Skills system local execution support
- Status: ✅ Completed in v0.1.12
- PR: #513
- Enhanced Gemini computer use with Docker integration
- Multi-agent coordination for computer automation
- VNC visualization and debugging support
- Status: ✅ Completed in v0.1.12
- Issue: #414
- Tool integration via importable Python code instead of schema-based tools
- MCP server registry with auto-discovery
- Reduces token usage through on-demand tool loading
- Status: ✅ Completed in v0.1.13
- PR: #475
- Natural Language Integration Platform for advanced tool routing
- Multi-backend support across Claude, Gemini, and OpenAI
- Per-agent and orchestrator-level configuration
- Status: ✅ Completed in v0.1.13
- PR: #520
- Configurable concurrent tool execution across all backends
- Model-level and local execution controls
- Asyncio-based scheduling with semaphore limits
- Status: ✅ Completed in v0.1.14
- PR: #530
- Full integration for Google's Gemini 3 Pro model
- Function calling support with parallel tool capabilities
- Status: ✅ Completed in v0.1.14
- Issue: #441
- Increase parallelism of file read operations
- Standard efficiency evaluation and benchmarking methodology
- Status: ✅ Completed in v0.1.14
- PR: #547
- Automatic generation of diverse system messages for multi-agent configurations
- Multiple generation strategies: complementary, diverse, specialized, adversarial
- Status: ✅ Completed in v0.1.15
- PR: #545, #538
- GitHub Container Registry integration with ARM support
- MassGen pre-installed in Docker images for immediate use
- Status: ✅ Completed in v0.1.15
- Issue: #510
- Enable custom tools to run in isolated Docker containers
- Security isolation and portability for custom tool execution
- Status: ✅ Completed in v0.1.15
- Issue: #476
- PR: #553
- Self-evaluation and improvement of frontend/UI through terminal recording
- Automated video generation and case study creation using VHS
- Status: ✅ Completed in v0.1.16
- Issue: #543
- PR: #553
- Accurate cost calculation using LiteLLM's pricing database
- Integration with LiteLLM pricing for 500+ models with auto-updates
- Status: ✅ Completed in v0.1.16
- PR: #555
- Persistent memory with multi-turn session support
- Memory archiving for session persistence and continuity
- Status: ✅ Completed in v0.1.16
- Issue: #476
- Four new skills for MassGen to develop and maintain itself
- Self-documenting release workflows and configuration generation
- Status: ✅ Completed in v0.1.16
- Issue: #537
- Enhance consistency of memory retrieval across agents
- Improve tool reminder system for better agent awareness
- Standardize memory access patterns
- Status: ✅ Completed in v0.1.16
- Issue: #539
- PR: #482
- Rich terminal UI using Textual framework with dark/light themes
- Enhanced visualization for multi-agent coordination
- Status: ✅ Completed in v0.1.17
- Issue: #437
- PR: #569
- Enable agents to broadcast questions when facing implementation uncertainties
- Human-in-the-loop and agent-to-agent communication for clarification
- Status: ✅ Completed in v0.1.18
- PR: #568
- Programmatic tool calling from code execution sandbox
- Server-side tool search with deferred loading
- Status: ✅ Completed in v0.1.18
- PR: #580
- MassGen as a LiteLLM custom provider with
MassGenLLMclass - New
run()andbuild_config()functions for programmatic execution NoneDisplayfor silent output in programmatic/LiteLLM use- Status: ✅ Completed in v0.1.19
- PR: #572
enable_strict_tool_useconfig flag with recursive schema patchingoutput_schemaparameter for structured JSON outputs- Status: ✅ Completed in v0.1.19
- PR: #576
- Automatic retry mechanism for rate limit errors (429, 503)
- Jittered exponential backoff with
Retry-Afterheader support - Status: ✅ Completed in v0.1.19
- Issue: #552
- Automatic Docker container setup for Computer Use Agent
- Auto-detection of CUA configs with automatic container creation
- Status: ✅ Completed in v0.1.20
- PR: #588
- Browser-based real-time visualization for multi-agent coordination
- FastAPI server with WebSocket streaming and React frontend
- Status: ✅ Completed in v0.1.20
- Improved function call handling for multi-turn contexts
- Preserves function_call entries and generates stub outputs
- Status: ✅ Completed in v0.1.20
- Issue: #562
- Comprehensive documentation for computer use workflows
- Environment naming conventions and automatic setup instructions
- Status: ✅ Completed in v0.1.20
- PR: #596
- Ctrl+C saves partial progress during multi-agent coordination
- Session restoration for incomplete turns with
--continue - Multi-turn mode returns to prompt instead of exiting
- Status: ✅ Completed in v0.1.21
- PR: #600
- Shadow agents for non-blocking broadcast responses
- Full context inheritance (conversation history + current turn)
- Parallel spawning with asyncio.gather()
- Status: ✅ Completed in v0.1.22
- PR: #607
- Automation-friendly Web UI view with status header and session polling
- LOG_DIR and STATUS path output for programmatic monitoring
- Session persistence API for completed sessions
- Status: ✅ Completed in v0.1.23
- PR: #608
- Flag-based cancellation handling in multi-turn mode
- Terminal state restoration after Rich display cancellation
- Cancelled turns build proper history entries with partial results
- Status: ✅ Completed in v0.1.23
- Commit: 34279c88
- SessionMountManager for pre-mounting session directories to Docker containers
- Eliminates container recreation between turns (sub-second vs 2-5 second transitions)
- Status: ✅ Completed in v0.1.23
- Commits: 028f591d, 477423a6
- New
/inspectcommand for reviewing agent outputs from any turn /inspect allto list all turns with summaries- Interactive menu for viewing agent outputs, final answers, and coordination logs
- Status: ✅ Completed in v0.1.23
- PR: #608
- New
run_async_safely()helper for nested event loop handling - Fixed mem0 adapter async lifecycle issues
- Status: ✅ Completed in v0.1.23
- Expanded token counting and cost calculation across multiple providers
- Real-time token usage for OpenRouter, xAI/Grok, Gemini, Claude Code backends
- Per-agent token breakdown with cost inspection command
- Status: ✅ Completed in v0.1.24
- PR: #584
- New backend for ByteDance's UI-TARS-1.5-7B model for GUI automation
- OpenAI-compatible API via HuggingFace Inference Endpoints
- Tool implementation with Docker and browser automation examples
- Status: ✅ Completed in v0.1.25
- PR: #629
- Framework for creating and iterating on reusable workflow plans
- Skills capture steps, Python scripts, and learnings through iteration
- Support for loading skills from previous sessions
- Status: ✅ Completed in v0.1.25
- PR: #589
- Adaptive layout management for different terminal sizes
- Enhanced dark/light themes with modals and panels
- Improved agent coordination visualization
- Status: ✅ Completed in v0.1.25
- PR: #634
- Test-time compute scaling via
response_depthparameter (low/medium/high) - Controls solution complexity in shadow agent broadcast responses
- Status: ✅ Completed in v0.1.26
- PR: #634
- Comprehensive Docker error detection with platform-specific resolution
- Distinguishes binary not installed, daemon not running, permission denied, images missing
- Status: ✅ Completed in v0.1.26
- PR: #634
- Guided first-run setup with SetupPage, ConfigEditorModal, CoordinationStep
- API key management endpoints and environment checks
- Status: ✅ Completed in v0.1.26
- Commits: 598a32f8, dc920078
- Native multimodal understanding for Gemini and OpenAI backends
- Image, audio, video understanding via
read_mediawith backend-native APIs - Status: ✅ Completed in v0.1.28
- Commit: dc920078
- Unified
generate_mediatool with provider selection - New
generation/module for OpenAI (DALL-E, Sora, TTS), Google (Imagen, Veo), OpenRouter - Status: ✅ Completed in v0.1.28
- Commit: 598a32f8
- Preview workspace artifacts directly in web interface
- Support for PDF, DOCX, PPTX, XLSX, images, HTML, SVG, Markdown, Mermaid
- Status: ✅ Completed in v0.1.28
- Commit: bc7881d2
- New
min_answers_before_votingorchestrator configuration option - Integrated into CLI quickstart wizard and Web UI CoordinationStep
- Status: ✅ Completed in v0.1.28
- Commit: c71094ac
- Parameter filtering for unsupported Azure parameters
- Fixed tool_choice handling, message validation, and response format extraction
- Status: ✅ Completed in v0.1.28
- Commit: 40acf82c
- Model list filters to only show models supporting tool calling
- Checks
supported_parametersfor "tools" capability - Status: ✅ Completed in v0.1.28
- PR: #690
- Spawn parallel child MassGen processes for independent task execution
- Process isolation with independent workspaces per subagent
- New
spawn_subagentstool with result aggregation and token tracking - Status: ✅ Completed in v0.1.29
- PR: #801
- Linear: MAS-214
- Background subagent execution with
async_=Trueparameter - Poll for subagent completion and retrieve results
- Status: ✅ Completed in v0.1.41
- PR: #806
- Comprehensive visual overhaul with modern "Conversational AI" aesthetic
- Rounded corners, desaturated colors, edge-to-edge layouts, polished modals
- Human Input Queue for injecting messages to agents mid-stream
- Status: ✅ Completed in v0.1.42
- PR: #804
- Fixed coordination issues for single-agent AG2 setups
- Single agent can now vote for itself after producing its first answer
- Status: ✅ Completed in v0.1.42
- PR: #815
- Consecutive MCP tool calls grouped into collapsible tree views
- Shows 3 items by default with "+N more" indicator, click to expand
- Respects Timeline Chronology Rule: tools only batch when consecutive
- New
ToolBatchCardwidget andToolBatchTrackerstate machine - Status: ✅ Completed in v0.1.43
- PR: #812
- New documentation page with visual SVG comparisons (MassGen vs single-agent)
- Video tutorials section with Getting Started and Development videos
- Iterative refinement examples showing multi-round improvements
- Status: ✅ Completed in v0.1.43
- PR: #815
- Final presentation display fix (reasoning vs answer separation)
- Plan mode enhancements with PlanOptionsPopover
- Quoted path support for paths with spaces
- Various bug fixes (status bar, scrolling, mode buttons)
- Status: ✅ Completed in v0.1.43
- Commit: 30aca047
- Enhanced
get_tool_metrics_summary()with per-call averages - Output distribution stats (min/max/median) for bottleneck analysis
- Status: ✅ Completed in v0.1.29
- Commit: 78177372
- New mode for assigning different system messages per agent in quickstart
- Options: "Skip", "Same for all", "Different per agent"
- Status: ✅ Completed in v0.1.29
- PR: #685
- Fixed duplicate item errors when using
previous_response_id - Preserved function call ID for proper reasoning item pairing
- Status: ✅ Completed in v0.1.29
- PR: #693
- Native web search integration via OpenRouter's plugins array
- Maps
enable_web_searchto{"id": "web"}plugin format - Status: ✅ Completed in v0.1.30
- PR: #699
- Two diversity modes:
perspective(values/priorities) andimplementation(solution types) - Phase-based adaptation with softened personas for convergence
- Status: ✅ Completed in v0.1.30
- PR: #698
- Support both Azure-specific and OpenAI-compatible endpoints
- Environment variable expansion (
${VAR}) in config files - Status: ✅ Completed in v0.1.30
- PR: #688
- Comprehensive test fixes with xfail registry
- Fixed persistent memory retrieval and backend tool registration
- Status: ✅ Completed in v0.1.30
- PR: #708
- Comprehensive structured logging and tracing via Logfire (Pydantic team)
- Automatic LLM instrumentation for OpenAI, Anthropic Claude, and Google Gemini backends
- Tool execution tracing with timing metrics and agent coordination observability
- Enable via
--logfireCLI flag orMASSGEN_LOGFIRE_ENABLED=trueenvironment variable - Status: ✅ Completed in v0.1.31
- PR: #705
- Tool calls accumulated and yielded as structured
tool_callschunks - Fixed streaming behavior for Azure OpenAI tool calling
- Status: ✅ Completed in v0.1.31
- PR: #704
- Fixed logging output for web search operations
- Status: ✅ Completed in v0.1.31
- PR: #715
- Enhanced
massgen exportwith turn range selection and workspace options - Multi-turn file collection preserving turn/attempt structure
- Status: ✅ Completed in v0.1.32
- PR: #711
- Moved Logfire from required to optional
[observability]extra - Helpful error message when
--logfireused without Logfire installed - Status: ✅ Completed in v0.1.32
- Commit: a808d730
- Separate log files per orchestration restart attempt
- Handler reconfiguration via
set_log_attempt()function - Status: ✅ Completed in v0.1.32
- Commit: 7c7a32e3
- Automatic DOCX/PPTX/XLSX to PDF conversion for session sharing
- Docker + LibreOffice headless conversion with image fallback
- Status: ✅ Completed in v0.1.32
- Issue: #617
- PR: #697
- Automatic context compression when context length errors are detected
- Streaming buffer system for compression recovery
- Status: ✅ Completed in v0.1.33
- Issue: #645
- PR: #669
- Native model listing APIs for providers (Groq, Together, and others)
- Research third-party wrappers; document manual update processes
- Status: ✅ Completed in v0.1.34
- Issue: #612
- Ensure code-based tools work properly in Web UI
- Integration with new Web UI features
- Status: ✅ Completed in v0.1.34
- Issue: #686
- Verify and improve PPTX generation capabilities
- Test slide generation workflows and output quality
- Status: ✅ Completed in v0.1.34
- Issue: #647
- Restrict OpenRouter model list to only show models that support tool use
- Filter based on
supported_parameterscapability checks - Status: ✅ Completed in v0.1.34
- Issue: #739
- Use OpenAI's native
/compactendpoint instead of custom summarization - Leverage API-level context compression for better efficiency
- Status: ✅ Completed in v0.1.48
- Issue: #683
- PR: #761
- Enhanced logging for better debugging and observability via Logfire workflow attributes
- New
massgen logs analyzeCLI command with self-analysis mode - Status: ✅ Completed in v0.1.35
- Issue: #766
- Allow users to choose which model to use for
massgen logs analyzeself-analysis mode - Configurable model selection for different analysis requirements
- Status: ✅ Completed in v0.1.50
- Issue: #745
- PR: #769
- Extensible hook system for agent lifecycle events
- Enable custom actions at key orchestration points
- Status: ✅ Completed in v0.1.36
- PR: #794
- Complete plan-then-execute workflow separating "what to build" from "how to build it"
--plan-and-executeand--execute-planCLI options- Task verification workflow with
verifiedstatus and verification groups - Plan storage system in
.massgen/plans/with frozen snapshots - Status: ✅ Completed in v0.1.39
- Issue: #722
- Enhanced log sharing workflows
- Improved analysis tools and visualizations
- Target: v0.1.50+
- Issue: #773
- Plugin/extension for spawning MassGen agents directly from Claude Code interface
- Seamless integration with Claude Code workflows
- Target: v0.1.50+
- Issue: #809
- Support targeted queries to specific agents via subagent spawning
- Three modes: broadcast to all, selective broadcast, targeted ask
- Pass full
_streaming_bufferto shadow agents for improved context - Target: v0.1.52
- PR: #858
- New coordination mode that decomposes tasks into subtasks assigned to individual agents
- Task decomposer with presenter agent role for final synthesis
- TUI mode bar toggle, subtask assignment display, and generation modals
- Status: ✅ Completed in v0.1.48
- PR: #857
- Linear: MAS-272
- Git worktree-based isolation for agent file writes with review workflow
- Review modal for approving/rejecting changes before applying to original paths
- Shadow repo support for non-git directories
- Status: ✅ Completed in v0.1.48
- PR: #857
- Linear: MAS-267
- Docker setup step in quickstart wizard with animated pull progress
- Real-time stdout streaming for image downloads
- Status: ✅ Completed in v0.1.48
- PR: #869
- Prevents fast agents from dominating coordination rounds
- Configurable
fairness_lead_cap_answersandmax_midstream_injections_per_round - Status: ✅ Completed in v0.1.49
- PR: #869
- Persona easing toggle now accessible from TUI mode bar
- Status: ✅ Completed in v0.1.49
- PR: #869
- New
checklist_tools_server.pyMCP server for structured quality evaluation - Binary pass/fail scoring for objective quality assessment
- Status: ✅ Completed in v0.1.49
- PR: #869
- New "Analyzing" state in TUI mode bar for in-app run analysis
- Configurable analysis profiles with log directory and turn selection
- Status: ✅ Completed in v0.1.49
- PR: #869
- CI/CD workflow (
tests.yml), SVG snapshot baselines, 16+ new test files - Testing strategy specification and visual regression testing
- Status: ✅ Completed in v0.1.49
- PR: #861
- Fixed "[No response generated]" errors from incorrect chunk type comparison
- Status: ✅ Completed in v0.1.49
- PR: #877
- Plans divided into chunks executed one at a time with progress checkpoints
- Chunk browsing in TUI, frozen plan snapshots,
target_steps/target_chunksparameters - Iterative planning review modal with Continue/Edit/Finalize options
- Status: ✅ Completed in v0.1.50
- PR: #878
- New lifecycle modes (
create_or_update,create_new,consolidate) - Skill organizer for merging overlapping skills,
SKILL_REGISTRY.mdrouting guide - Previous-session skill loading with
load_previous_session_skillsconfig - Local Skills MCP for Docker/local execution contexts
- Status: ✅ Completed in v0.1.50
- PR: #877
- Branch accumulation across rounds, cross-agent diff visibility via
generate_branch_summaries() - Orphan worktree cleanup
- Status: ✅ Completed in v0.1.50
- PR: #877
- Vertical/horizontal adaptive layout with compact labels on narrow terminals
- TUI homescreen and theming improvements
- Status: ✅ Completed in v0.1.50
- PR: #955
- File-based delegation protocol for container-to-host subagent spawning
- SubagentLaunchWatcher with atomic JSON request/response exchange
- Workspace path validation against allowlist for security
- Status: ✅ Completed in v0.1.57
- PR: #955
- Issue: #874
- Substantiveness tracking (transformative/structural/incremental) for convergence decisions
- Builder subagent type for large artifact generation with fresh context
- Diagnostic report gating and per-agent checklist scoring
- Claude Code reasoning parameters for updated SDK
- Status: ✅ Completed in v0.1.57
- Issues: #942, #951
- ElevenLabs TTS & STT integration for high-quality voice synthesis and transcription
- Nano Banana 2 as default image generation model
- Grok Imagine image/video generation (#958) via xAI API
- Media generation skills (image, video, audio) and multi-turn image editing with continuation IDs
- Status: ✅ Completed in v0.1.58
- PR: #962
- First-class provider integration for NVIDIA Inference Microservices
- Support for NVIDIA-hosted models via NIM API
- Status: ✅ Completed in v0.1.58
- PR: #964
- New
quality_rethinkingsubagent type for targeted per-element craft improvements - Explicit improve/preserve listings in checklists with better label refresh ordering
- Subagent hardening: better '@' parsing, error handling for multiple submit_checklist calls
- Status: ✅ Completed in v0.1.58
- PR: #251
- Enhanced file operations and workspace management
- Shipping: Continuous improvement
Advanced Orchestration Patterns
- Advanced task decomposition strategies and parallel coordination
- Assignment of agents to specific tasks and increasing of diversity
- Improvement in voting as tasks continue
Self-Learning & Adaptation
- Agents learn from past executions to improve future performance
- Automatic skill acquisition from successful task completions
- Feedback loops for continuous improvement
- Memory systems for retaining learned patterns across sessions
Visual Workflow Designer
- No-code multi-agent workflow creation
- Drag-and-drop agent configuration
- Real-time testing and debugging
Enterprise Features
- Role-based access control (RBAC)
- Audit logs and compliance reporting
- Multi-user collaboration
- Advanced analytics and cost tracking
Additional Framework Integrations
- LangChain agent support
- CrewAI compatibility
- Custom framework adapters
Complete Multimodal Pipeline
- End-to-end audio processing (speech-to-text, text-to-speech)
- Video understanding and generation
- Advanced document processing (PDF, Word, Excel)
Track development progress:
Interested in contributing? You have two options:
Option 1: Join an Existing Track
- See Contributors & Contact table above for active tracks
- Contact the track owner via Discord to discuss your ideas
- Follow CONTRIBUTING.md for development process
Option 2: Create Your Own Track
- Have a significant feature idea? Propose a new track!
- Reach out via the #massgen channel on Discord
- Work with the MassGen dev team to integrate your track into the roadmap
- Become a track owner and guide other contributors
See CONTRIBUTING.md for development setup, code standards, testing, and documentation requirements.
- CHANGELOG.md - Complete release history
- CONTRIBUTING.md - Contribution guidelines
- Documentation - Full user guide
This roadmap is community-driven. Releases ship on Mondays, Wednesdays, Fridays @ 9am PT. Timelines may shift based on priorities and feedback. Open an issue to suggest changes!
Last Updated: March 11, 2026 Maintained By: MassGen Team