You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
New general-purpose MassGen Skill with 4 modes (general, evaluate, plan, spec) for use from Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
New round evaluator subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
14
17
@@ -18,11 +21,33 @@ Rewritten read_media with clearer schema and MediaCallLedgerHook for media call
Planning improvements with auto-added improvements to task plan and plan review enhancements. Checklist and evaluation enhancements with better eval gen config and Gemini tool name normalization. Subagent behavior adjustments and media generation fixes.
Comprehensive multimodal revamp with ElevenLabs TTS/STT, Nano Banana 2 image generation, and Grok multimedia. Nvidia NIM backend for NVIDIA Inference Microservices. Quality rethinking subagent for per-element craft improvements. Smarter checklists with improve/preserve listings. Logging architecture refactor and CLI mode flags.
23
-
24
24
---
25
25
26
+
## [0.1.62] - 2026-03-11
27
+
28
+
### Added
29
+
-**MassGen Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
30
+
-**Session Viewer** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` command for real-time observation of automation sessions with interactive session picker and web mode
31
+
-**Headless Quickstart** ([#992](https://github.com/massgen/MassGen/pull/992)): Non-interactive setup via `--quickstart --headless` for CI/CD integration
32
+
-**Web Quickstart** ([#992](https://github.com/massgen/MassGen/pull/992)): Browser-based setup flow via `--web-quickstart`
33
+
-**Skill Auto-Sync** ([#992](https://github.com/massgen/MassGen/pull/992)): GitHub Actions workflow to auto-sync MassGen Skill to separate repository for easy installation
34
+
35
+
### Changed
36
+
-**Claude Code Backend** ([#992](https://github.com/massgen/MassGen/pull/992)): Background task execution support and SDK MCP integration
37
+
-**Codex Backend** ([#992](https://github.com/massgen/MassGen/pull/992)): Native filesystem access, JSONL event streaming, and MCP tool support
38
+
-**Copilot Model Discovery** ([#992](https://github.com/massgen/MassGen/pull/992)): Runtime model fetching with metadata caching
39
+
-**Planning & Evaluation** ([#992](https://github.com/massgen/MassGen/pull/992)): Better planning prompts with thoroughness support, removed should/could criteria to reduce output similarity
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -359,7 +359,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
359
359
360
360
## 🔧 Development Workflow
361
361
362
-
> **Important**: Our next version is v0.1.62. If you want to contribute, please contribute to the `dev/v0.1.62` branch (or `main` if dev/v0.1.62 doesn't exist yet).
362
+
> **Important**: Our next version is v0.1.63. If you want to contribute, please contribute to the `dev/v0.1.63` branch (or `main` if dev/v0.1.63 doesn't exist yet).
363
363
364
364
### 1. Create Feature Branch
365
365
@@ -368,7 +368,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
368
368
git fetch upstream
369
369
370
370
# Create feature branch from dev/v0.1.60 (or main if dev branch doesn't exist yet)
Copy file name to clipboardExpand all lines: README.md
+38-33Lines changed: 38 additions & 33 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -45,7 +45,7 @@ MassGen is a cutting-edge multi-agent framework that coordinates AI agents to so
45
45
This project started with the "threads of thought" and "iterative refinement" ideas presented in [The Myth of Reasoning](https://docs.ag2.ai/latest/docs/blog/2025/04/16/Reasoning/), and extends the classic "multi-agent conversation" idea in [AG2](https://github.com/ag2ai/ag2). Here is a [video recording](https://www.youtube.com/watch?v=xM2Uguw1UsQ) of the background context introduction presented at the Berkeley Agentic AI Summit 2025.
46
46
47
47
<palign="center">
48
-
<b>🤖 For LLM Agents:</b> <ahref="AI_USAGE.md">AI_USAGE.md</a> - Complete automation guide to run MassGen inside an LLM
48
+
<b>🧩 Use MassGen as a Skill:</b> <code>npx skills add massgen/skills --all</code> — then type invoke the skill in Claude Code, Cursor, Copilot, or 40+ other agents. <ahref="https://github.com/massgen/skills">Learn more →</a>
49
49
</p>
50
50
51
51
<palign="center">
@@ -69,7 +69,7 @@ This project started with the "threads of thought" and "iterative refinement" id
69
69
<detailsopen>
70
70
<summary><h3>🆕 Latest Features</h3></summary>
71
71
72
-
-[v0.1.61 Features](#-latest-features-v0161)
72
+
-[v0.1.62 Features](#-latest-features-v0162)
73
73
</details>
74
74
75
75
<detailsopen>
@@ -122,15 +122,15 @@ This project started with the "threads of thought" and "iterative refinement" id
@@ -155,22 +155,24 @@ This project started with the "threads of thought" and "iterative refinement" id
155
155
156
156
---
157
157
158
-
## 🆕 Latest Features (v0.1.61)
158
+
## 🆕 Latest Features (v0.1.62)
159
159
160
-
**🎉 Released: March 9, 2026**
160
+
**🎉 Released: March 11, 2026**
161
161
162
-
**What's New in v0.1.61:**
163
-
-**🔄 Round Evaluator Paradigm** - New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment.
164
-
-**📝 Evaluation Improvements** - Improved evaluation prompts with task plan injection for context-aware assessment.
165
-
-**🔧 Orchestrator Refactoring** - Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow.
162
+
**What's New in v0.1.62:**
163
+
-**🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
164
+
-**👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
165
+
-**⚡ Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
166
166
167
-
**Try v0.1.61 Features:**
167
+
**Try v0.1.62 Features:**
168
168
```bash
169
-
# Install or upgrade
170
-
pip install --upgrade massgen
169
+
# Install the MassGen Skill for your AI agent
170
+
npx skills add massgen/skills --all
171
+
# Then in Claude Code, Cursor, Copilot, etc.:
172
+
# /massgen "Your complex task"
171
173
172
-
# Try the round evaluator paradigm
173
-
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a website for an AI startup with polished visuals and interactive elements"
174
+
# Try the Session Viewer
175
+
uv run massgen viewer --pick
174
176
```
175
177
176
178
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1242,25 +1244,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1242
1244
1243
1245
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
1244
1246
1245
-
### Recent Achievements (v0.1.61)
1247
+
### Recent Achievements (v0.1.62)
1246
1248
1247
-
**🎉 Released: March 9, 2026**
1249
+
**🎉 Released: March 11, 2026**
1248
1250
1249
-
#### Round Evaluator Paradigm
1250
-
-**Round Evaluator Subagent Type** ([#986](https://github.com/massgen/MassGen/pull/986)): New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment
1251
-
-**Orchestrator Refactoring**: Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
-**General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1253
+
-**Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1254
+
-**Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
1253
1255
1254
-
#### Evaluation Improvements
1255
-
-**Improved Evaluation Prompts** ([#986](https://github.com/massgen/MassGen/pull/986)): Clearer, more actionable feedback with task plan injection
1256
-
-**Simplified Config**: Simplified config handling for evaluation parameters
1257
-
-**SUBAGENT.md Generality**: Improved SUBAGENT.md for broader subagent compatibility
1256
+
#### Session Viewer
1257
+
-**Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1258
+
-**Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
1258
1259
1259
-
#### Fixes
1260
-
-**Session Resumption** ([#986](https://github.com/massgen/MassGen/pull/986)): Fixed resumption from already-resumed logs
✅ **Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
1264
1268
1265
1269
✅ **Multimodal Tools, Subagent Enhancements & GPT-5.4 (v0.1.60)**: Rewritten read_media with clearer schema and MediaCallLedgerHook. Subagent enhancements with inherit_spawning_agent_backend, final_answer_strategy, per-agent subagent_agents. GPT-5.4 as default OpenAI flagship. Decomp mode cooperates with checklist workflow. Codex prompt caching fix.
1266
1270
@@ -1527,12 +1531,13 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1527
1531
1528
1532
We welcome community contributions to achieve these goals.
1529
1533
1530
-
### v0.1.62 Roadmap
1534
+
### v0.1.63 Roadmap
1531
1535
1532
-
Version 0.1.62 focuses on improving skill use and exploration:
1536
+
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
1533
1537
1534
1538
#### Planned Features
1535
-
-**Improve Skill Use and Exploration** ([#873](https://github.com/massgen/MassGen/issues/873)): Local skill execution, skill registry with hierarchical organization, and skill consolidation workflow
1539
+
-**Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1540
+
-**Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
0 commit comments