Skip to content

Commit 77f907f

Browse files
authored
Merge pull request #993 from massgen/docs_for_v0.1.62
docs: docs for v0.1.62
2 parents 59927a6 + a6d9848 commit 77f907f

File tree

13 files changed

+312
-192
lines changed

13 files changed

+312
-192
lines changed

CHANGELOG.md

Lines changed: 28 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
99

1010
## Recent Releases
1111

12+
**v0.1.62 (March 11, 2026)** - MassGen Skill & Viewer
13+
New general-purpose MassGen Skill with 4 modes (general, evaluate, plan, spec) for use from Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
14+
1215
**v0.1.61 (March 9, 2026)** - Round Evaluator Paradigm
1316
New round evaluator subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
1417

@@ -18,11 +21,33 @@ Rewritten read_media with clearer schema and MediaCallLedgerHook for media call
1821
**v0.1.59 (March 4, 2026)** - Quality Round Improvements
1922
Planning improvements with auto-added improvements to task plan and plan review enhancements. Checklist and evaluation enhancements with better eval gen config and Gemini tool name normalization. Subagent behavior adjustments and media generation fixes.
2023

21-
**v0.1.58 (March 2, 2026)** - Multimodal Revamp, Nvidia NIM Backend & Quality Rethinking
22-
Comprehensive multimodal revamp with ElevenLabs TTS/STT, Nano Banana 2 image generation, and Grok multimedia. Nvidia NIM backend for NVIDIA Inference Microservices. Quality rethinking subagent for per-element craft improvements. Smarter checklists with improve/preserve listings. Logging architecture refactor and CLI mode flags.
23-
2424
---
2525

26+
## [0.1.62] - 2026-03-11
27+
28+
### Added
29+
- **MassGen Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
30+
- **Session Viewer** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` command for real-time observation of automation sessions with interactive session picker and web mode
31+
- **Headless Quickstart** ([#992](https://github.com/massgen/MassGen/pull/992)): Non-interactive setup via `--quickstart --headless` for CI/CD integration
32+
- **Web Quickstart** ([#992](https://github.com/massgen/MassGen/pull/992)): Browser-based setup flow via `--web-quickstart`
33+
- **Skill Auto-Sync** ([#992](https://github.com/massgen/MassGen/pull/992)): GitHub Actions workflow to auto-sync MassGen Skill to separate repository for easy installation
34+
35+
### Changed
36+
- **Claude Code Backend** ([#992](https://github.com/massgen/MassGen/pull/992)): Background task execution support and SDK MCP integration
37+
- **Codex Backend** ([#992](https://github.com/massgen/MassGen/pull/992)): Native filesystem access, JSONL event streaming, and MCP tool support
38+
- **Copilot Model Discovery** ([#992](https://github.com/massgen/MassGen/pull/992)): Runtime model fetching with metadata caching
39+
- **Planning & Evaluation** ([#992](https://github.com/massgen/MassGen/pull/992)): Better planning prompts with thoroughness support, removed should/could criteria to reduce output similarity
40+
- **CLI Enhancements** ([#992](https://github.com/massgen/MassGen/pull/992)): `--print-backends` table, viewer subcommand, multi-agent quickstart via `--quickstart-agent`
41+
42+
### Fixed
43+
- **Skill Viewer** ([#992](https://github.com/massgen/MassGen/pull/992)): Fixed skill viewer display and added convenience shell script
44+
- **Correctness Prompts** ([#992](https://github.com/massgen/MassGen/pull/992)): Updated correctness prompts for improved accuracy
45+
46+
### Technical Details
47+
- **Major Focus**: MassGen Skill & Viewer — general-purpose skill, session observation, backend improvements
48+
- **PRs Merged**: [#992](https://github.com/massgen/MassGen/pull/992) (evaluator-skill)
49+
- **Contributors**: @ncrispino (6 commits), @HenryQi (2 commits) and the MassGen team
50+
2651
## [0.1.61] - 2026-03-09
2752

2853
### Added

CONTRIBUTING.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -359,7 +359,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
359359

360360
## 🔧 Development Workflow
361361

362-
> **Important**: Our next version is v0.1.62. If you want to contribute, please contribute to the `dev/v0.1.62` branch (or `main` if dev/v0.1.62 doesn't exist yet).
362+
> **Important**: Our next version is v0.1.63. If you want to contribute, please contribute to the `dev/v0.1.63` branch (or `main` if dev/v0.1.63 doesn't exist yet).
363363
364364
### 1. Create Feature Branch
365365

@@ -368,7 +368,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
368368
git fetch upstream
369369

370370
# Create feature branch from dev/v0.1.60 (or main if dev branch doesn't exist yet)
371-
git checkout -b feature/your-feature-name upstream/dev/v0.1.62
371+
git checkout -b feature/your-feature-name upstream/dev/v0.1.63
372372
```
373373

374374
### 2. Make Your Changes
@@ -507,7 +507,7 @@ git push origin feature/your-feature-name
507507
```
508508

509509
Then create a pull request on GitHub:
510-
- Base branch: `dev/v0.1.62` (or `main` if dev branch doesn't exist yet)
510+
- Base branch: `dev/v0.1.63` (or `main` if dev branch doesn't exist yet)
511511
- Compare branch: `feature/your-feature-name`
512512
- Add clear description of changes
513513
- Link any related issues
@@ -617,7 +617,7 @@ Have a significant feature idea not covered by existing tracks?
617617
- [ ] Tests pass locally
618618
- [ ] Documentation is updated if needed
619619
- [ ] Commit messages follow convention
620-
- [ ] PR targets `dev/v0.1.62` branch (or `main` if dev branch doesn't exist yet)
620+
- [ ] PR targets `dev/v0.1.63` branch (or `main` if dev branch doesn't exist yet)
621621

622622
### PR Description Should Include
623623

README.md

Lines changed: 38 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ MassGen is a cutting-edge multi-agent framework that coordinates AI agents to so
4545
This project started with the "threads of thought" and "iterative refinement" ideas presented in [The Myth of Reasoning](https://docs.ag2.ai/latest/docs/blog/2025/04/16/Reasoning/), and extends the classic "multi-agent conversation" idea in [AG2](https://github.com/ag2ai/ag2). Here is a [video recording](https://www.youtube.com/watch?v=xM2Uguw1UsQ) of the background context introduction presented at the Berkeley Agentic AI Summit 2025.
4646

4747
<p align="center">
48-
<b>🤖 For LLM Agents:</b> <a href="AI_USAGE.md">AI_USAGE.md</a> - Complete automation guide to run MassGen inside an LLM
48+
<b>🧩 Use MassGen as a Skill:</b> <code>npx skills add massgen/skills --all</code> — then type invoke the skill in Claude Code, Cursor, Copilot, or 40+ other agents. <a href="https://github.com/massgen/skills">Learn more →</a>
4949
</p>
5050

5151
<p align="center">
@@ -69,7 +69,7 @@ This project started with the "threads of thought" and "iterative refinement" id
6969
<details open>
7070
<summary><h3>🆕 Latest Features</h3></summary>
7171

72-
- [v0.1.61 Features](#-latest-features-v0161)
72+
- [v0.1.62 Features](#-latest-features-v0162)
7373
</details>
7474

7575
<details open>
@@ -122,15 +122,15 @@ This project started with the "threads of thought" and "iterative refinement" id
122122
<details open>
123123
<summary><h3>🗺️ Roadmap</h3></summary>
124124

125-
- [Recent Achievements (v0.1.61)](#recent-achievements-v0161)
126-
- [Previous Achievements (v0.0.3 - v0.1.60)](#previous-achievements-v003---v0160)
125+
- [Recent Achievements (v0.1.62)](#recent-achievements-v0162)
126+
- [Previous Achievements (v0.0.3 - v0.1.61)](#previous-achievements-v003---v0161)
127127
- [Key Future Enhancements](#key-future-enhancements)
128128
- Bug Fixes & Backend Improvements
129129
- Advanced Agent Collaboration
130130
- Expanded Model, Tool & Agent Integrations
131131
- Improved Performance & Scalability
132132
- Enhanced Developer Experience
133-
- [v0.1.62 Roadmap](#v0162-roadmap)
133+
- [v0.1.63 Roadmap](#v0163-roadmap)
134134
</details>
135135

136136
<details open>
@@ -155,22 +155,24 @@ This project started with the "threads of thought" and "iterative refinement" id
155155

156156
---
157157

158-
## 🆕 Latest Features (v0.1.61)
158+
## 🆕 Latest Features (v0.1.62)
159159

160-
**🎉 Released: March 9, 2026**
160+
**🎉 Released: March 11, 2026**
161161

162-
**What's New in v0.1.61:**
163-
- **🔄 Round Evaluator Paradigm** - New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment.
164-
- **📝 Evaluation Improvements** - Improved evaluation prompts with task plan injection for context-aware assessment.
165-
- **🔧 Orchestrator Refactoring** - Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow.
162+
**What's New in v0.1.62:**
163+
- **🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
164+
- **👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
165+
- **⚡ Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
166166

167-
**Try v0.1.61 Features:**
167+
**Try v0.1.62 Features:**
168168
```bash
169-
# Install or upgrade
170-
pip install --upgrade massgen
169+
# Install the MassGen Skill for your AI agent
170+
npx skills add massgen/skills --all
171+
# Then in Claude Code, Cursor, Copilot, etc.:
172+
# /massgen "Your complex task"
171173

172-
# Try the round evaluator paradigm
173-
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a website for an AI startup with polished visuals and interactive elements"
174+
# Try the Session Viewer
175+
uv run massgen viewer --pick
174176
```
175177

176178
[See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1242,25 +1244,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
12421244

12431245
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
12441246

1245-
### Recent Achievements (v0.1.61)
1247+
### Recent Achievements (v0.1.62)
12461248

1247-
**🎉 Released: March 9, 2026**
1249+
**🎉 Released: March 11, 2026**
12481250

1249-
#### Round Evaluator Paradigm
1250-
- **Round Evaluator Subagent Type** ([#986](https://github.com/massgen/MassGen/pull/986)): New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment
1251-
- **Orchestrator Refactoring**: Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
1252-
- **New Config**: `round_evaluator_example.yaml` for easy adoption
1251+
#### MassGen Skill
1252+
- **General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1253+
- **Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1254+
- **Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
12531255

1254-
#### Evaluation Improvements
1255-
- **Improved Evaluation Prompts** ([#986](https://github.com/massgen/MassGen/pull/986)): Clearer, more actionable feedback with task plan injection
1256-
- **Simplified Config**: Simplified config handling for evaluation parameters
1257-
- **SUBAGENT.md Generality**: Improved SUBAGENT.md for broader subagent compatibility
1256+
#### Session Viewer
1257+
- **Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1258+
- **Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
12581259

1259-
#### Fixes
1260-
- **Session Resumption** ([#986](https://github.com/massgen/MassGen/pull/986)): Fixed resumption from already-resumed logs
1261-
- **Round Evaluation Prompts**: Improved round evaluation prompt clarity
1260+
#### Backend & Quickstart
1261+
- **Backend Improvements** ([#992](https://github.com/massgen/MassGen/pull/992)): Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
1262+
- **Quickstart Modes**: Headless quickstart (`--quickstart --headless`) for CI/CD, web quickstart (`--web-quickstart`) for browser-based setup
1263+
- **Evaluation & Planning**: Better planning prompts with thoroughness support, removed should/could criteria
12621264

1263-
### Previous Achievements (v0.0.3 - v0.1.60)
1265+
### Previous Achievements (v0.0.3 - v0.1.61)
1266+
1267+
**Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
12641268

12651269
**Multimodal Tools, Subagent Enhancements & GPT-5.4 (v0.1.60)**: Rewritten read_media with clearer schema and MediaCallLedgerHook. Subagent enhancements with inherit_spawning_agent_backend, final_answer_strategy, per-agent subagent_agents. GPT-5.4 as default OpenAI flagship. Decomp mode cooperates with checklist workflow. Codex prompt caching fix.
12661270

@@ -1527,12 +1531,13 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
15271531

15281532
We welcome community contributions to achieve these goals.
15291533

1530-
### v0.1.62 Roadmap
1534+
### v0.1.63 Roadmap
15311535

1532-
Version 0.1.62 focuses on improving skill use and exploration:
1536+
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
15331537

15341538
#### Planned Features
1535-
- **Improve Skill Use and Exploration** ([#873](https://github.com/massgen/MassGen/issues/873)): Local skill execution, skill registry with hierarchical organization, and skill consolidation workflow
1539+
- **Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1540+
- **Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
15361541

15371542
---
15381543

0 commit comments

Comments
 (0)