[nightshift] Parallel sub-agents, automerge, and reviewer selection by rjpower · Pull Request #4391 · marin-community/marin

rjpower · 2026-04-03T15:55:01Z

Rewrite nightshift_cleanup.py to spawn one scout agent per subproject in
parallel git worktrees, then merge results into a single PR. Each scout
gets 400 turns to find one meaty improvement in its assigned subproject,
then a merge agent cherry-picks all results, runs lint+tests, and opens
one unified PR.

Also add to both cleanup and doc-drift workflows: rebase on origin/main
before pushing, enable automerge via gh pr merge --auto --squash, and
select a reviewer from git log on changed files. Cleanup workflow timeout
bumped from 240 to 360 minutes for parallel agent overhead.

Rewrite nightshift_cleanup.py to spawn one scout agent per subproject in parallel git worktrees, then merge results into a single PR. Add rebase on origin/main, gh pr merge --auto --squash, and git-log-based reviewer selection to both cleanup and doc-drift workflows. Bump cleanup timeout to 360 minutes.

claude · 2026-04-03T15:55:18Z

Claude finished @rjpower's task in 1m 37s —— View job

PR Review: Parallel sub-agents, automerge, and reviewer selection

Read changed files and gather context
Review code for correctness and issues
Post review feedback

Summary

Good architectural change — splitting into parallel scout agents per subproject with a merge agent is a solid design that should improve coverage and reduce wasted turns. The doc-drift automerge/reviewer additions are clean. I left 7 inline comments on nightshift_cleanup.py; here's the high-level:

Issues to address

tempfile.mktemp is deprecated (line 156) — TOCTOU race. Use NamedTemporaryFile(delete=False) instead.
Redundant git fetch in each scout (line 149) — main() already fetches at line 236 before spawning scouts. Running 4 concurrent fetches of the same ref is wasteful.
Unhandled scout exceptions kill the entire run (lines 246-248) — If one scout's worktree setup fails (check=True), future.result() re-raises and aborts all other scouts. Wrap with try/except so surviving scouts still produce results.
Unhandled json.JSONDecodeError (line 184) — If a scout writes malformed JSON, the run crashes. Needs a try/except.
.nightshift-worktrees not in .gitignore (line 141) — If cleanup fails partway, this directory could end up tracked.
cleanup_worktrees fragile on partial failure (lines 221-227) — If git worktree remove fails for one child, rmdir() raises OSError. Consider shutil.rmtree as fallback.
Reviewer selection uses email, but --add-reviewer expects username (line 128) — git log --format='%ae' returns emails, not GitHub usernames. This will fail for most contributors. Need an email→username mapping step (e.g. via gh api).

Design observations

400 turns per scout × 4 scouts is substantial. The workflow timeout bump to 360 minutes should cover it, but worth monitoring actual runtimes.
The merge agent max-turns dropped from 800 → 200, which seems reasonable for cherry-pick + PR creation.
The doc-drift changes are straightforward prompt additions — no concerns there.

claude · 2026-04-03T15:56:32Z

+        cwd=repo_root,
+    )
+
+    result_file = tempfile.mktemp(suffix=".json", prefix=f"nightshift-{worktree_name}-")


tempfile.mktemp is deprecated and has a TOCTOU race — the path could be claimed between creation and the scout writing to it. Use tempfile.NamedTemporaryFile(delete=False) instead:

Suggested change

result_file = tempfile.mktemp(suffix=".json", prefix=f"nightshift-{worktree_name}-")

with tempfile.NamedTemporaryFile(suffix=".json", prefix=f"nightshift-{worktree_name}-", delete=False) as f:

result_file = f.name

claude · 2026-04-03T15:56:35Z

+    subprocess.run(["git", "branch", "-D", branch_name], capture_output=True, cwd=repo_root)
+
+    # Create worktree on a fresh branch from origin/main
+    subprocess.run(["git", "fetch", "origin", "main"], check=True, cwd=repo_root, capture_output=True)


Each of the 4 scouts runs git fetch origin main independently. Since they execute in parallel via ThreadPoolExecutor, this creates 4 concurrent fetches of the same ref. The fetch already happens once in main() at line 236 before scouts are launched — this one could be removed (or guarded with a note that the worktree creation depends on origin/main being fresh).

claude · 2026-04-03T15:56:39Z

+        futures = {pool.submit(run_scout, sp, date, repo_root): sp for sp in SUBPROJECTS}
+        for future in as_completed(futures):
+            subproject, result, worktree_path = future.result()


If any scout raises an exception (e.g. the worktree git checkout fails with check=True), future.result() will re-raise it here and kill the entire run — including any scouts still in-flight. Consider wrapping with try/except to let other scouts finish:

for future in as_completed(futures): try: subproject, result, worktree_path = future.result() except Exception: sp = futures[future] logger.exception("Scout %s failed", sp) scout_results.append({"subproject": sp, "status": "error", "summary": "Scout raised an exception"}) continue ...

claude · 2026-04-03T15:56:42Z

+    )
+
+    result = {"subproject": subproject, "status": "error", "summary": "Scout did not produce a result file"}
+    if Path(result_file).exists():


Malformed JSON from the scout will crash with an unhandled json.JSONDecodeError. Worth wrapping:

Suggested change

if Path(result_file).exists():

if Path(result_file).exists():

try:

result = json.loads(Path(result_file).read_text())

except json.JSONDecodeError:

result = {"subproject": subproject, "status": "error", "summary": "Scout produced invalid JSON"}

Path(result_file).unlink()

claude · 2026-04-03T15:56:44Z

+def run_scout(subproject: str, date: str, repo_root: Path) -> tuple[str, dict, str]:
+    """Run a single scout agent in a git worktree. Returns (subproject, result_dict, worktree_path)."""
+    worktree_name = f"nightshift-scout-{subproject.replace('/', '-')}"
+    worktree_path = repo_root / ".nightshift-worktrees" / worktree_name


.nightshift-worktrees is not in .gitignore. If a scout or the cleanup function fails partway through, this directory (or its git metadata) could end up tracked. Consider adding it to .gitignore.

claude · 2026-04-03T15:56:47Z

+def cleanup_worktrees(repo_root: Path) -> None:
+    """Remove all nightshift scout worktrees."""
+    worktrees_dir = repo_root / ".nightshift-worktrees"
+    if worktrees_dir.exists():
+        for child in worktrees_dir.iterdir():
+            subprocess.run(["git", "worktree", "remove", "--force", str(child)], capture_output=True, cwd=repo_root)
+        worktrees_dir.rmdir()


If any child directory inside .nightshift-worktrees was not a worktree (or was already removed), worktrees_dir.rmdir() will silently succeed. But if git worktree remove fails for one child, rmdir() will raise OSError because the directory isn't empty. Consider using shutil.rmtree as a fallback, or at least logging failures from the worktree-remove loop.

claude · 2026-04-03T15:56:50Z

+6. Pick a reviewer by finding who recently touched the changed files:
+   ```
+   git log --format='%ae' -20 -- <changed_files> | sort | uniq -c | sort -rn | head -5
+   ```


The reviewer selection uses --format='%ae' (email), but gh pr edit --add-reviewer expects a GitHub username, not an email. This will work only if the email happens to match the username. You'd need to map emails → GitHub usernames (e.g. via gh api or by using --format='%aN' and matching against known contributors).

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8aad18d3c7

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-03T15:58:51Z

+
+6. Pick a reviewer by finding who recently touched the changed files:
+   ```
+   git log --format='%ae' -20 -- <changed_files> | sort | uniq -c | sort -rn | head -5


Use GitHub logins for reviewer assignment

This workflow instructs the agent to select reviewers from git log --format='%ae', which yields email addresses, but gh pr edit --add-reviewer accepts reviewer logins (per gh pr edit help: --add-reviewer <login>). In practice this causes reviewer assignment to fail (or pick invalid identities), so the new auto-reviewer step in cleanup can break PR automation when the selected value is an email.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-03T15:58:51Z

+
+    result = {"subproject": subproject, "status": "error", "summary": "Scout did not produce a result file"}
+    if Path(result_file).exists():
+        result = json.loads(Path(result_file).read_text())


Handle malformed scout JSON instead of crashing merge run

run_scout is intended to tolerate scout failures (check=False on the Claude call), but it unconditionally parses the result file with json.loads(...) and does not catch decode/read errors. If any scout writes partial/invalid JSON, future.result() raises and aborts the whole cleanup run before merge, which defeats the fault-tolerant parallel design.

Useful? React with 👍 / 👎.

Fix tempfile.mktemp TOCTOU race with NamedTemporaryFile(delete=False). Remove redundant git fetch from scouts (main() fetches once). Wrap future.result() to isolate scout failures. Handle malformed scout JSON. Use shutil.rmtree as fallback in cleanup_worktrees. Switch reviewer selection from git log emails to gh api GitHub logins. Add .nightshift-worktrees to .gitignore.

…4391) Rewrite nightshift_cleanup.py to spawn one scout agent per subproject in parallel git worktrees, then merge results into a single PR. Each scout gets 400 turns to find one meaty improvement in its assigned subproject, then a merge agent cherry-picks all results, runs lint+tests, and opens one unified PR. Also add to both cleanup and doc-drift workflows: rebase on origin/main before pushing, enable automerge via gh pr merge --auto --squash, and select a reviewer from git log on changed files. Cleanup workflow timeout bumped from 240 to 360 minutes for parallel agent overhead.

rjpower added the agent-generated Created by automation/agent label Apr 3, 2026

claude Bot reviewed Apr 3, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Apr 3, 2026

View reviewed changes

rjpower added 2 commits April 3, 2026 09:03

Move nightshift worktrees under .claude/ which is already gitignored

d7029a3

rjpower merged commit d3b7c64 into main Apr 3, 2026
36 checks passed

rjpower deleted the claude/suspicious-hertz branch April 3, 2026 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[nightshift] Parallel sub-agents, automerge, and reviewer selection#4391

[nightshift] Parallel sub-agents, automerge, and reviewer selection#4391
rjpower merged 3 commits intomainfrom
claude/suspicious-hertz

rjpower commented Apr 3, 2026

Uh oh!

claude Bot commented Apr 3, 2026 •

edited

Loading

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

claude Bot Apr 3, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	result_file = tempfile.mktemp(suffix=".json", prefix=f"nightshift-{worktree_name}-")
	with tempfile.NamedTemporaryFile(suffix=".json", prefix=f"nightshift-{worktree_name}-", delete=False) as f:
	result_file = f.name

Conversation

rjpower commented Apr 3, 2026

Uh oh!

claude Bot commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Parallel sub-agents, automerge, and reviewer selection

Summary

Issues to address

Design observations

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude Bot commented Apr 3, 2026 •

edited

Loading