You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
New round evaluator subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
File-based delegation protocol for container-to-host subagent spawning. New builder subagent type for large artifact generation with fresh context. Claude Code reasoning parameters for updated SDK. Smarter convergence with substantiveness tracking and diagnostic report gating.
23
-
24
24
---
25
25
26
+
## [0.1.61] - 2026-03-09
27
+
28
+
### Added
29
+
-**Round Evaluator Subagent Type** ([#986](https://github.com/massgen/MassGen/pull/986)): New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment
30
+
-**`round_evaluator_example.yaml` Config** ([#986](https://github.com/massgen/MassGen/pull/986)): New example config for the round evaluator paradigm
31
+
32
+
### Changed
33
+
-**Orchestrator Refactoring** ([#986](https://github.com/massgen/MassGen/pull/986)): Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
34
+
-**Evaluation Prompts** ([#986](https://github.com/massgen/MassGen/pull/986)): Improved evaluation prompts for clearer, more actionable feedback with task plan injection
35
+
-**Simplified Config** ([#986](https://github.com/massgen/MassGen/pull/986)): Simplified config handling for evaluation parameters
36
+
-**SUBAGENT.md Generality** ([#986](https://github.com/massgen/MassGen/pull/986)): Improved SUBAGENT.md for broader subagent compatibility
37
+
38
+
### Fixed
39
+
-**Session Resumption** ([#986](https://github.com/massgen/MassGen/pull/986)): Fixed resumption from already-resumed logs
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -359,7 +359,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
359
359
360
360
## 🔧 Development Workflow
361
361
362
-
> **Important**: Our next version is v0.1.61. If you want to contribute, please contribute to the `dev/v0.1.61` branch (or `main` if dev/v0.1.61 doesn't exist yet).
362
+
> **Important**: Our next version is v0.1.62. If you want to contribute, please contribute to the `dev/v0.1.62` branch (or `main` if dev/v0.1.62 doesn't exist yet).
363
363
364
364
### 1. Create Feature Branch
365
365
@@ -368,7 +368,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
368
368
git fetch upstream
369
369
370
370
# Create feature branch from dev/v0.1.60 (or main if dev branch doesn't exist yet)
@@ -155,23 +155,22 @@ This project started with the "threads of thought" and "iterative refinement" id
155
155
156
156
---
157
157
158
-
## 🆕 Latest Features (v0.1.60)
158
+
## 🆕 Latest Features (v0.1.61)
159
159
160
-
**🎉 Released: March 6, 2026**
160
+
**🎉 Released: March 9, 2026**
161
161
162
-
**What's New in v0.1.60:**
163
-
-**🛠️ Multimodal Tool Improvements** - Rewritten `read_media` with clearer schema and `MediaCallLedgerHook` for tracking media calls.
164
-
-**🤖 Subagent Enhancements** - `inherit_spawning_agent_backend` for automatic backend inheritance, `final_answer_strategy` for child orchestrator policy, per-agent `subagent_agents` override.
165
-
-**🧠 GPT-5.4** - New default OpenAI flagship model across all coordination modes.
166
-
-**🔄 Decomp + Checklist Cooperation** - Decomp mode works with checklist workflow for quality-gated subtask iteration.
162
+
**What's New in v0.1.61:**
163
+
-**🔄 Round Evaluator Paradigm** - New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment.
164
+
-**📝 Evaluation Improvements** - Improved evaluation prompts with task plan injection for context-aware assessment.
165
+
-**🔧 Orchestrator Refactoring** - Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow.
167
166
168
-
**Try v0.1.60 Features:**
167
+
**Try v0.1.61 Features:**
169
168
```bash
170
169
# Install or upgrade
171
170
pip install --upgrade massgen
172
171
173
-
#Choose backend 'openai' with model 'gpt-5.4' in the setup wizard to start using GPT-5.4
174
-
uv run massgen --quickstart
172
+
#Try the round evaluator paradigm
173
+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a website for an AI startup with polished visuals and interactive elements"
175
174
```
176
175
177
176
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1233,31 +1232,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1233
1232
1234
1233
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
-**Per-Agent `subagent_agents`**: Per-agent override for subagent agent configs; orchestrator config file support with robust JSON parsing
1239
+
#### Round Evaluator Paradigm
1240
+
-**Round Evaluator Subagent Type** ([#986](https://github.com/massgen/MassGen/pull/986)): New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment
1241
+
-**Orchestrator Refactoring**: Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
✅ **Multimodal Tools, Subagent Enhancements & GPT-5.4 (v0.1.60)**: Rewritten read_media with clearer schema and MediaCallLedgerHook. Subagent enhancements with inherit_spawning_agent_backend, final_answer_strategy, per-agent subagent_agents. GPT-5.4 as default OpenAI flagship. Decomp mode cooperates with checklist workflow. Codex prompt caching fix.
1261
1256
1262
1257
✅ **Quality Round Improvements (v0.1.59)**: Auto-add improvements to task plan, plan review enhancements. Better eval gen config, checklist fixes, Gemini tool name normalization for MCP. Subagent behavior adjustments, Docker skill write access fixes. Video gen skill adjustments and impact metric restoration.
1263
1258
@@ -1522,9 +1517,9 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1522
1517
1523
1518
We welcome community contributions to achieve these goals.
1524
1519
1525
-
### v0.1.60 Roadmap
1520
+
### v0.1.62 Roadmap
1526
1521
1527
-
Version 0.1.60 focuses on improving skill use and exploration:
1522
+
Version 0.1.62 focuses on improving skill use and exploration:
1528
1523
1529
1524
#### Planned Features
1530
1525
-**Improve Skill Use and Exploration** ([#873](https://github.com/massgen/MassGen/issues/873)): Local skill execution, skill registry with hierarchical organization, and skill consolidation workflow
@@ -154,23 +154,22 @@ This project started with the "threads of thought" and "iterative refinement" id
154
154
155
155
---
156
156
157
-
## 🆕 Latest Features (v0.1.60)
157
+
## 🆕 Latest Features (v0.1.61)
158
158
159
-
**🎉 Released: March 6, 2026**
159
+
**🎉 Released: March 9, 2026**
160
160
161
-
**What's New in v0.1.60:**
162
-
-**🛠️ Multimodal Tool Improvements** - Rewritten `read_media` with clearer schema and `MediaCallLedgerHook` for tracking media calls.
163
-
-**🤖 Subagent Enhancements** - `inherit_spawning_agent_backend` for automatic backend inheritance, `final_answer_strategy` for child orchestrator policy, per-agent `subagent_agents` override.
164
-
-**🧠 GPT-5.4** - New default OpenAI flagship model across all coordination modes.
165
-
-**🔄 Decomp + Checklist Cooperation** - Decomp mode works with checklist workflow for quality-gated subtask iteration.
161
+
**What's New in v0.1.61:**
162
+
-**🔄 Round Evaluator Paradigm** - New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment.
163
+
-**📝 Evaluation Improvements** - Improved evaluation prompts with task plan injection for context-aware assessment.
164
+
-**🔧 Orchestrator Refactoring** - Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow.
166
165
167
-
**Try v0.1.60 Features:**
166
+
**Try v0.1.61 Features:**
168
167
```bash
169
168
# Install or upgrade
170
169
pip install --upgrade massgen
171
170
172
-
#Choose backend 'openai' with model 'gpt-5.4' in the setup wizard to start using GPT-5.4
173
-
uv run massgen --quickstart
171
+
#Try the round evaluator paradigm
172
+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a website for an AI startup with polished visuals and interactive elements"
174
173
```
175
174
176
175
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1232,31 +1231,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1232
1231
1233
1232
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
-**Per-Agent `subagent_agents`**: Per-agent override for subagent agent configs; orchestrator config file support with robust JSON parsing
1238
+
#### Round Evaluator Paradigm
1239
+
-**Round Evaluator Subagent Type** ([#986](https://github.com/massgen/MassGen/pull/986)): New `round_evaluator` subagent type that delegates evaluation to specialized evaluator subagents for deeper quality assessment
1240
+
-**Orchestrator Refactoring**: Major orchestrator refactoring (+1,189 lines) to support the round evaluation workflow
✅ **Multimodal Tools, Subagent Enhancements & GPT-5.4 (v0.1.60)**: Rewritten read_media with clearer schema and MediaCallLedgerHook. Subagent enhancements with inherit_spawning_agent_backend, final_answer_strategy, per-agent subagent_agents. GPT-5.4 as default OpenAI flagship. Decomp mode cooperates with checklist workflow. Codex prompt caching fix.
1260
1255
1261
1256
✅ **Quality Round Improvements (v0.1.59)**: Auto-add improvements to task plan, plan review enhancements. Better eval gen config, checklist fixes, Gemini tool name normalization for MCP. Subagent behavior adjustments, Docker skill write access fixes. Video gen skill adjustments and impact metric restoration.
1262
1257
@@ -1521,9 +1516,9 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1521
1516
1522
1517
We welcome community contributions to achieve these goals.
1523
1518
1524
-
### v0.1.60 Roadmap
1519
+
### v0.1.62 Roadmap
1525
1520
1526
-
Version 0.1.60 focuses on improving skill use and exploration:
1521
+
Version 0.1.62 focuses on improving skill use and exploration:
1527
1522
1528
1523
#### Planned Features
1529
1524
-**Improve Skill Use and Exploration** ([#873](https://github.com/massgen/MassGen/issues/873)): Local skill execution, skill registry with hierarchical organization, and skill consolidation workflow
0 commit comments