[codex] Improve runtime reliability and validation workflows by huisezhiyin · Pull Request #7 · huisezhiyin/agent-experience-capitalization

huisezhiyin · 2026-05-09T08:07:06Z

Summary

This PR extends the current runtime reliability and validation workflow work with two targeted resilience fixes:

auto-finish now survives readonly user-cache asset paths when writing activation feedback back into assets.
Milvus lock diagnosis now safely self-heals stale dead-pid metadata instead of repeatedly surfacing the same stale-lock warning.

It also includes a focused validation pass over candidate backlog and unproven assets so the new prior/constraint assets have real help signals, not just stored inventory.

What changed

runtime/cli/main.py
- _apply_activation_feedback() now returns both activation_feedback and any write_warnings.
- asset effectiveness writes now use the same workspace fallback path logic as other user-cache outputs.
- auto-finish and explicit feedback both surface fallback warnings instead of failing on PermissionError.
runtime/storage/milvus_store.py
- milvus_lock_summary() now clears stale dead-pid lock metadata when no real flock owner exists.
- lock summaries expose stale_metadata_cleared so doctor/status can distinguish self-healed stale metadata from live locks.
tests/test_cli_flow.py
- adds a regression test for readonly primary user-cache asset writes during auto-finish feedback persistence.
tests/test_milvus_store.py
- adds stale-lock self-heal coverage and verifies live lock metadata is preserved.

Why

The recent review loop exposed two recurring runtime quality gaps:

save/feedback flows could still fail even after fallback support existed for trace/episode/dashboard outputs.
Milvus stale lock files could keep polluting diagnostics long after the owning process was gone.

Fixing both makes the runtime more robust and makes daily review output more trustworthy.

Validation

python3 -m unittest tests.test_cli_flow.CliFlowTests.test_cli_auto_finish_falls_back_when_feedback_asset_write_hits_primary_user_cache_root tests.test_cli_flow.CliFlowTests.test_cli_auto_finish_falls_back_when_primary_memory_root_is_unwritable tests.test_cli_flow.CliFlowTests.test_cli_auto_finish_records_activation_help_feedback_for_later_runs tests.test_cli_flow.CliFlowTests.test_cli_auto_finish_persists_asset_effectiveness_summary
python3 -m unittest tests.test_milvus_store.MilvusStoreLockTests
EXPCAP_STORAGE_PROFILE=user-cache EXPCAP_HOME=$HOME/.expcap scripts/expcap doctor --workspace "$PWD"

Runtime impact

Milvus remains healthy after the self-heal change.
Proof coverage improved through the follow-up validation pass.
The remaining doctor warning is operational backlog only: candidate_review_queue has 20 pending items.

huisezhiyin added 3 commits May 9, 2026 16:06

Trigger progressive recall from failed post-tool hooks

83d196c

Improve runtime reliability and unproven validation workflow

6b19fde

Harden feedback fallback and Milvus lock recovery

0f932e4

huisezhiyin added codex codex-automation labels May 12, 2026

huisezhiyin changed the title ~~[codex] Trigger progressive recall from failed post-tool hooks~~ [codex] Improve runtime reliability and validation workflows May 12, 2026

huisezhiyin marked this pull request as ready for review May 12, 2026 03:40

huisezhiyin merged commit 66472d3 into main May 12, 2026

huisezhiyin deleted the codex/continuous-runtime-recall-hook branch May 12, 2026 03:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Improve runtime reliability and validation workflows#7

[codex] Improve runtime reliability and validation workflows#7
huisezhiyin merged 3 commits into
mainfrom
codex/continuous-runtime-recall-hook

huisezhiyin commented May 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

huisezhiyin commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Why

Validation

Runtime impact

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

huisezhiyin commented May 9, 2026 •

edited

Loading