fix(snap): snap build filter + post-release follow-up by lalalune · Pull Request #7830 · elizaOS/eliza

lalalune · 2026-05-20T05:25:14Z

Cherry-pick of snap turbo filter fix to main for v2.0.2 release builds.

Greptile Summary

This PR cherry-picks the snap turbo filter fix (removing the incorrect /schema subpath from --filter=@elizaos/scenario-runner) to main for the v2.0.2 release, bundled with a broad post-release follow-up touching benchmarks, training scripts, QA tooling, and generated app-core files.

Snap fix: snapcraft.yaml corrects the turbo workspace filter from @elizaos/scenario-runner/schema (invalid) to @elizaos/scenario-runner, ensuring the package is included in snap builds.
Benchmark/orchestrator hardening: adapters.py extracts a retrying _docker_info_available() helper (3 attempts, 20 s timeout each), expands vision-language bundle manifest detection to three schema locations, and runner.py adds failed_scenarios/interrupted_run quarantine reasons plus overall_score to the score-extraction lookup chain.
Path and naming corrections: run.mjs fixes the repo-root path (2→3 levels up) and renames app-lifeops/app-training references to plugin-lifeops/plugin-training; check-mobile-artifacts.test.ts updates Android paths from packages/app to packages/app-core/platforms/android.

Confidence Score: 3/5

The snap fix is minimal and correct, but the breadth of post-release follow-up across 65 files warrants careful review before merging to main.

The e2e test file calls discoverDocsRoutes() at module load time with no guard around readdirSync — if the content/ directory is absent in any CI job, every test in the file fails as a collection error rather than being skipped. The remaining changes are well-scoped with accompanying tests.

packages/cloud-frontend/tests/e2e/cloud-routes.spec.ts needs a try/catch or existence check around walk(CONTENT_DIR) before the call at module scope.

Important Files Changed

Filename	Overview
packages/app-core/packaging/snap/snapcraft.yaml	Core fix: removes `/schema` subpath from turbo filter so `@elizaos/scenario-runner` workspace is correctly included in snap builds.
packages/cloud-frontend/tests/e2e/cloud-routes.spec.ts	Adds docs route discovery and two new dashboard routes with API mocks; `discoverDocsRoutes()` called at module scope without error handling — ENOENT on missing content dir fails all tests.
packages/benchmarks/orchestrator/adapters.py	Extracts `_docker_info_available()` with 3-attempt retry logic; expands vision-language bundle detection to accept `kernels` and `files` schema fields in addition to `runtime`.
packages/benchmarks/orchestrator/runner.py	Adds `failed_scenarios` and `interrupted_run` quarantine reasons; adds `overall_score` key to score extraction lookups in three places.
packages/feed/packages/agents/src/rubrics/index.ts	New file defining archetype rubrics and hash utilities; `getAllRubricsHash` double-counts `DEFAULT_RUBRIC` in its hash input.
packages/scripts/launch-qa/run.mjs	Fixes repo-root path (2→3 levels up), removes stale test file references, and renames `app-lifeops`/`app-training` plugin paths to `plugin-lifeops`/`plugin-training`.
packages/training/scripts/convert_hermes_to_eliza.py	Rewrites glaive format parsing to handle ASSISTANT: speaker, extract tools from system prompt JSON, and preserve intermediate assistant turns.
packages/chip/rtl/interconnect/axi4/e1_axi4_interconnect.sv	Inlines `w_active[m].slave` directly into signal assignments, removing the `automatic` local variable; semantically equivalent.
packages/benchmarks/eliza-adapter/eliza_adapter/woobench.py	Drops `"system"` from the role allowlist when building bridge message history; system-role turns in `recent_history` are now silently excluded.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[_has_terminal_bench_docker_backend] --> B[_docker_info_available]
    C[_has_hermes_sandbox_backend] -->|no MODAL tokens| B
    B --> D{docker in PATH?}
    D -->|no| E[return False]
    D -->|yes| F[attempt loop\nmax 3 tries]
    F --> G[docker info --format ServerVersion\ntimeout=20s]
    G -->|returncode == 0| H[return True]
    G -->|error / timeout| I{more attempts?}
    I -->|yes| J[sleep 0.25s]
    J --> F
    I -->|no| K[return False]

Comments Outside Diff (1)

packages/cloud-frontend/tests/e2e/cloud-routes.spec.ts, line 76-80 (link)

discoverDocsRoutes() called at module scope without error handling

walk(CONTENT_DIR) calls readdirSync with no try-catch. If CONTENT_DIR (../../content relative to the test file) doesn't exist in the CI environment — e.g., docs aren't checked out in an agent-focused workflow — readdirSync throws ENOENT at module initialization, preventing the entire test file from loading and failing every test in it as a collection error rather than a skipped test.

_{Reviews (1): Last reviewed commit: "fix(snap): use base package name instead..." | Re-trigger Greptile}

Greptile also left 2 inline comments on this PR.

…ompts - packages/agent/src/api/registry-service.ts: comment 'Babylon-compatible' → 'Feed-compatible' - packages/cloud-shared/src/lib/services/referrals.ts: comment 'Babylon's pattern' → 'Feed's pattern' - packages/core/src/features/trajectories/export.ts: comment 'Babylon-specific' → 'Feed-specific' - packages/docs/changelog.mdx: 'Babylon agent terminal' / 'Babylon prediction market game' → Feed - packages/training/scripts/rl/attacker_trainer.py: DEFENDER_SYSTEM_PROMPT 'Babylon agent' → 'Feed agent' Remaining 'babylon' refs left intentionally: - plugins/plugin-babylon/* and apps/babylon.json (per user instruction: Babylon.js 3D plugin shell) - BabylonOperatorSurface/BabylonTuiView (Babylon.js 3D UI) - elizaOS/prr@babylon branch ref in .github/workflows/run-prr.yml (real upstream branch) - BabylonSocial/scambench import URLs in packages/benchmarks/scambench/ (external dataset) - 'babylon layout' historical comment in tests/rl/conftest.py Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The snap build was failing with 'No package found with name @elizaos/scenario-runner/schema'. That's a TypeScript subpath import — not a workspace package name. Use the base name '@elizaos/scenario-runner' which actually resolves in the workspace. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-05-20T05:25:21Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a3269e4d-506d-4f2d-b61f-ed8763e387eb

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch develop

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

greptile-apps · 2026-05-20T05:29:21Z

+export function getAllRubricsHash(): string {
+  return createHash("sha256")
+    .update(`${Object.values(RUBRICS).sort().join("::")}${DEFAULT_RUBRIC}`)
+    .digest("hex")
+    .substring(0, 16);
+}


getAllRubricsHash() appends DEFAULT_RUBRIC twice: once because every archetype in RUBRICS maps to DEFAULT_RUBRIC (so Object.values(RUBRICS).sort().join("::") already contains it 12 times), and again via the explicit concatenation. If per-archetype rubrics are ever added, the appended DEFAULT_RUBRIC would still inflate the hash, making it inconsistent with the intent of hashing "all unique rubric content".

Suggested change

export function getAllRubricsHash(): string {

return createHash("sha256")

.update(`${Object.values(RUBRICS).sort().join("::")}${DEFAULT_RUBRIC}`)

.digest("hex")

.substring(0, 16);

}

export function getAllRubricsHash(): string {

return createHash("sha256")

.update([...new Set(Object.values(RUBRICS))].sort().join("::"))

.digest("hex")

.substring(0, 16);

}

greptile-apps · 2026-05-20T05:29:22Z

                "content": turn["content"],
            }
            for turn in recent_history
-            if turn["role"] in {"system", "user", "assistant", "agent"}
+            if turn["role"] in {"user", "assistant", "agent"}
            and str(turn["content"]).strip() != system_hint


Silent drop of "system" turns in conversation history

The filter now excludes turns with role == "system" from recent_history. Previously, system-role turns that didn't match system_hint would pass through to the bridge. If any adapter or history builder emits mid-conversation system turns (e.g., injected context updates), they will now be silently dropped without warning, potentially leaving the agent with incomplete context. Worth confirming this is intentional for all adapters feeding recent_history.

claude · 2026-05-20T05:38:36Z

Claude encountered an error after 0s —— View job

I'll analyze this and get back to you.

github-actions · 2026-05-20T06:04:21Z

LifeOps Multi-Tier Benchmark

Suite: smoke — Tiers requested: large,frontier

`large`

LifeOps Multi-Tier Benchmark

Tier: large
Suite: smoke

`frontier`

LifeOps Multi-Tier Benchmark

Tier: frontier
Suite: smoke

Artifacts: lifeops-multi-tier-large-26143213496, lifeops-multi-tier-frontier-26143213496

Shaw and others added 10 commits May 19, 2026 22:14

fix(ci): update launch qa and rtl checks

e73bc52

chore(rename): Babylon→Feed in macos README bundled-app list

88762f1

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

chore: checkpoint latest local updates

eb58046

chore: commit latest feed and training updates

68b94cf

chore: commit latest audit and training test updates

5f1d39e

chore: commit latest package and training state

f6f1669

chore: commit benchmark training updates

51fa258

chore: commit package update batch

150480b

lalalune merged commit 9c77824 into main May 20, 2026
53 of 71 checks passed

greptile-apps Bot reviewed May 20, 2026

View reviewed changes

github-actions Bot added build Docs Tests core labels May 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(snap): snap build filter + post-release follow-up#7830

fix(snap): snap build filter + post-release follow-up#7830
lalalune merged 10 commits into
mainfrom
develop

lalalune commented May 20, 2026 •

edited by greptile-apps Bot

Loading

Uh oh!

coderabbitai Bot commented May 20, 2026

Review skipped

Uh oh!

Uh oh!

greptile-apps Bot May 20, 2026

Uh oh!

greptile-apps Bot May 20, 2026

Uh oh!

claude Bot commented May 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lalalune commented May 20, 2026 • edited by greptile-apps Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Flowchart

Comments Outside Diff (1)

Uh oh!

coderabbitai Bot commented May 20, 2026

Review skipped

Uh oh!

Uh oh!

greptile-apps Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 20, 2026

LifeOps Multi-Tier Benchmark

large

LifeOps Multi-Tier Benchmark

frontier

LifeOps Multi-Tier Benchmark

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lalalune commented May 20, 2026 •

edited by greptile-apps Bot

Loading

claude Bot commented May 20, 2026 •

edited

Loading

`large`

`frontier`