📘 Maestro Orchestrator — Orchestration Framework (fail-closed + HITL)

If uncertain, stop. If risky, escalate.
Research / educational governance simulations for agentic workflows.

⚠️ Purpose & Disclaimer (Research & Education)

This is a research/educational reference implementation (prototype).
Do not use it to execute or facilitate harmful actions (e.g., exploitation, intrusion, surveillance, impersonation, destruction, or data theft), or to violate any applicable terms/policies, laws, or internal rules of your services or execution environment.

This project focuses on education/research and defensive verification (e.g., log growth mitigation and validating fail-closed + HITL behavior).
It is not intended to publish exploitation tactics or facilitate wrongdoing.

Risk / Warranty / Liability

Use at your own risk: verify relevant terms/policies.
Isolated environment first: start with local smoke tests (no external networks; no real systems/data).
AS IS / no warranty: provided without warranty of any kind.
Limitation of liability: to the maximum extent permitted by applicable law, the author assumes no liability for damages arising from use of the code, documentation, or generated artifacts (including misuse by third parties).

Codebook disclaimer

The included codebook is a demo/reference artifact. Do not use it as-is in real deployments; create your own based on your requirements, threat model, and applicable policies/terms.
The codebook is for compact encoding/decoding of log fields and is NOT encryption (no confidentiality).

Testing & results disclaimer

Smoke tests and stress runs validate only the scenarios executed under specific runtime conditions.
They do not guarantee correctness, security, safety, or fitness for any purpose in real-world deployments. Results may vary depending on OS/Python versions, hardware, configuration, and operational use.

🇯🇵 Japanese version: README.ja.md

⚡ TL;DR

Fail-closed + HITL gating benches for negotiation/mediation-style workflows (research/education).
Reproducibility-first: seeded runs + pytest contract checks (vocabulary/invariants).
Audit-ready: minimal ARL logs; optional incident-only ARL indexing (INC#...) to avoid log bloat.

Overview

Maestro Orchestrator is a research / educational orchestration framework that prioritizes:

Fail-closed
If uncertain, unstable, or risky → do not continue silently.
HITL (Human-in-the-Loop)
Decisions that require human judgment are explicitly escalated.
Traceability
Decision flows are audit-ready and reproducible via minimal ARL logs.

This repository contains implementation references (doc orchestrators) and simulation benches for negotiation, mediation, governance-style workflows, and gating behavior.

Latest update (what changed in this repo)

This update adds a packaged zip bundle for the emergency contract simulator.

Added: docs/mediation_emergency_contract_sim_pkg.zip (v5.1.2 convenience bundle)
Why: quick download/run for reproducible smoke/stress runs (seeded), without changing entrypoints
Canonical source of truth (authoritative logic):
- mediation_emergency_contract_sim_v5_1_2.py
- pytest -q tests/test_v5_1_codebook_consistency.py
CI impact: none (docs artifact; not an entrypoint)
Note: zip bundles are generated/convenience artifacts (reviewable evidence, not authoritative logic). Review before use.

Fixes in this update (old draft → current)

This update fixes two scale issues found in the previous v5.1.2 draft:

Incident-only persistence mismatch
- Problem: some non-incident events were forced to persist as FULL ARL rows (e.g., evaluation/reward), which could bloat logs even on normal runs.
- Fix: evaluation/reward events are now emitted as SUMMARY (no forced persistence). ARL persistence remains incident-only.
Pre-context candidate growth under unique run_id
- Problem: when full_context_n > 0 and each run uses a unique run_id, in-memory candidate buffers could accumulate across large runs.
- Fix: per-run candidate buffers are explicitly dropped at the end of each run (drop_candidates_for_run(run_id)).

Quickstart (recommended path)

v5.1.x is recommended for reproducibility + contract checks; v4.x is kept as a legacy stable bench.
Start with one script, confirm behavior and logs, then expand.

1) Run the recommended emergency contract simulator (v5.1.2)

Optional bundle: docs/mediation_emergency_contract_sim_pkg.zip (convenience only)

python mediation_emergency_contract_sim_v5_1_2.py --runs 100

2) Run the contract tests (v5.1.x: simulator + codebook consistency)

pytest -q tests/test_v5_1_codebook_consistency.py

3) Inspect / pin the demo codebook (v5.1-demo.1)

log_codebook_v5_1_demo_1.json (demo codebook; pin the version when exchanging artifacts)
Note: codebook is NOT encryption (no confidentiality).

4) Optional: run the legacy stable bench (v4.8)

python mediation_emergency_contract_sim_v4_8.py
pytest -q tests/test_mediation_emergency_contract_sim_v4_8_smoke_metrics.py

5) Optional: inspect evidence bundle (v4.8 generated artifact)

docs/artifacts/v4_8_artifacts_bundle.zip

Evidence bundles (zip) are generated artifacts produced by tests/runs. The canonical source of truth is the generator scripts + tests.

Stress tests (safe-by-default)

v5.1.2 is designed to avoid memory blow-ups by default:

Aggregation-only mode (keep_runs=False default): no full per-run results kept in memory.
Optional: save ARL only on abnormal runs (incident indexing with INC#...).

A) Lightweight smoke → medium stress (recommended ramp)

# 1) Smoke
python mediation_emergency_contract_sim_v5_1_2.py --runs 200

# 2) Medium stress (still aggregation-only)
python mediation_emergency_contract_sim_v5_1_2.py --runs 10000 --seed 42

B) Force incidents (example: fabricate-rate 10% over 200 runs)

This should reliably create some abnormal runs and generate INC# files when enabled:

python mediation_emergency_contract_sim_v5_1_2.py \
  --runs 200 \
  --fabricate-rate 0.1 \
  --seed 42 \
  --save-arl-on-abnormal \
  --arl-out-dir arl_out \
  --max-arl-files 1000

Outputs (when abnormal runs occur):

arl_out/INC#000001__SIM#B000xx.arl.jsonl (incident ARL)
arl_out/incident_index.jsonl (one line per incident)
arl_out/incident_counter.txt (persistent counter)

Tip: keep --max-arl-files to cap disk growth.

Diagrams & docs

Browse all diagrams and bundles here: docs/README.md

Key diagrams:

Emergency contract overview (v5.1.2): docs/architecture_v5_1_2_emergency_contract_overview.png
Architecture (code-aligned): docs/architecture_code_aligned.png
Unknown-progress + HITL diagram: docs/architecture_unknown_progress.png
Multi-agent hierarchy: docs/multi_agent_hierarchy_architecture.png
Sentiment context flow: docs/sentiment_context_flow.png

Architecture (high level)

Audit-ready and fail-closed control flow:

agents
  → mediator (risk / pattern / fact)
  → evidence verification
  → HITL (pause / reset / ban)
  → audit logs (ARL)

Architecture (overview, v5.1.2)

Documentation-only. No logic changes.

Architecture (code-aligned diagrams)

The following diagram is aligned with the current code vocabulary. Documentation-only. No logic changes.

v5.0.1 → v5.1.2: What changed (delta)

v5.1.2 strengthens the simulator toward large-run stability and incident-only persistence.

Index + aggregation-only by default
- No per-run results kept in memory (prevents memory blow-ups on large --runs)
- Outputs focus on counters + HITL summary (optional items)
Incident indexing (optional)
- Abnormal runs are assigned INC#000001...
- Abnormal ARL saved as {arl_out_dir}/{incident_id}__{run_id}.arl.jsonl
- Index appended to {arl_out_dir}/incident_index.jsonl
- Persistent counter stored at {arl_out_dir}/incident_counter.txt

Still preserved:

Abnormal-only ARL persistence (pre-context + incident + post-context)
Tamper-evident ARL hash chaining (demo key default for OSS demo)
Fabricate-rate mixing + deterministic seeding (--fabricate-rate / --seed)

Core invariants:

sealed may be set only by ethics_gate / acc_gate
relativity_gate is never sealed (PAUSE_FOR_HITL, overrideable=True, sealed=False)

V1 → V4: What actually changed (conceptual)

mediation_emergency_contract_sim_v1.py demonstrates the minimum viable pipeline: a linear, event-driven workflow with fail-closed stops and minimal audit logs.

mediation_emergency_contract_sim_v4.py turns that pipeline into a repeatable governance bench by adding early rejection and controlled automation.

Added in v4:

Evidence gate (invalid/irrelevant/fabricated evidence triggers fail-closed stops)
Draft lint gate (draft-only semantics and scope boundaries)
Trust system (score + streak + cooldown)
AUTH HITL auto-skip (safe friction reduction via trust + grant, with ARL reasons)

V4 → V5: What changed (conceptual)

v4 focuses on a stable “emergency contract” governance bench with smoke tests and stress runners. v5 extends that bench toward artifact-level reproducibility and contract-style compatibility checks.

Added / strengthened in v5:

Log codebook (demo) + contract tests Enforces emitted vocabularies (layer/decision/final_decider/reason_code) via pytest.
Reproducibility surface (pin what matters) Pin simulator version, test version, and codebook version.
Tighter invariant enforcement Explicit tests/contracts around invariants reduce silent drift.

What did NOT change (still true in v5):

Research / educational intent
Fail-closed + HITL semantics
Use synthetic data only and run in isolated environments
No security guarantees (codebook is not encryption; tests do not guarantee safety in real-world deployments)

Execution examples

Doc orchestrator (reference implementation)

python ai_doc_orchestrator_kage3_v1_2_4.py

Emergency contract (recommended: v5.1.2) + contract tests

python mediation_emergency_contract_sim_v5_1_2.py
pytest -q tests/test_v5_1_codebook_consistency.py

Emergency contract (legacy stable bench: v4.8)

python mediation_emergency_contract_sim_v4_8.py
pytest -q tests/test_mediation_emergency_contract_sim_v4_8_smoke_metrics.py

Emergency contract (v4.4 stress)

python mediation_emergency_contract_sim_v4_4_stress.py --runs 10000 --out stress_results_v4_4_10000.json

Project intent / non-goals

Intent:

Reproducible safety and governance simulations
Explicit HITL semantics (pause/reset/ban)
Audit-ready decision traces (minimal ARL)

Non-goals:

Production-grade autonomous deployment
Unbounded self-directed agent control
Safety claims beyond what is explicitly tested

Data & safety notes

Use synthetic/dummy data only.
Prefer not to commit runtime logs; keep evidence artifacts minimal and reproducible.
Treat generated bundles (zip) as reviewable evidence, not canonical source.

License

Apache License 2.0 (see LICENSE)

Name		Name	Last commit message	Last commit date
Latest commit History 828 Commits
.github/workflows		.github/workflows
archive		archive
benchmarks		benchmarks
docs		docs
mediation_core		mediation_core
scripts		scripts
tests		tests
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
agents.yaml		agents.yaml
agents.yaml.md		agents.yaml.md
ai_alliance_persuasion_simulator.py		ai_alliance_persuasion_simulator.py
ai_doc_orchestrator_kage3_v1_2_2.py		ai_doc_orchestrator_kage3_v1_2_2.py
ai_doc_orchestrator_kage3_v1_2_2_1.py		ai_doc_orchestrator_kage3_v1_2_2_1.py
ai_doc_orchestrator_kage3_v1_2_3.py		ai_doc_orchestrator_kage3_v1_2_3.py
ai_doc_orchestrator_kage3_v1_2_4.py		ai_doc_orchestrator_kage3_v1_2_4.py
ai_doc_orchestrator_kage3_v1_3_5.py		ai_doc_orchestrator_kage3_v1_3_5.py
ai_doc_orchestrator_with_mediator_v1_0.py		ai_doc_orchestrator_with_mediator_v1_0.py
ai_governance_mediation_sim.py		ai_governance_mediation_sim.py
ai_hierarchy_dynamics_full_log_20250804.py		ai_hierarchy_dynamics_full_log_20250804.py
ai_hierarchy_simulation_log.py		ai_hierarchy_simulation_log.py
ai_mediation_all_in_one.py		ai_mediation_all_in_one.py
ai_mediation_governance_demo.py		ai_mediation_governance_demo.py
ai_mediation_hitl_reset_full_kage_arl公開用_rfl_relcodes_branches.py		ai_mediation_hitl_reset_full_kage_arl公開用_rfl_relcodes_branches.py
ai_mediation_hitl_reset_full_with_unknown_progress		ai_mediation_hitl_reset_full_with_unknown_progress
ai_pacd_simulation.py		ai_pacd_simulation.py
ai_reeducation_social_dynamics.py		ai_reeducation_social_dynamics.py
copilot_mediation_min.py		copilot_mediation_min.py
dialogue_consistency_mediator_v2_2_research.		dialogue_consistency_mediator_v2_2_research.
kage_end_to_end_confidential_loopguard_v1_0.py		kage_end_to_end_confidential_loopguard_v1_0.py
kage_orchestrator_diverse_v1.py		kage_orchestrator_diverse_v1.py
log_format.md		log_format.md
loop_policy_stage3.py		loop_policy_stage3.py
mediation_basic_example.py		mediation_basic_example.py
mediation_emergency_contract_sim_v1.py		mediation_emergency_contract_sim_v1.py
mediation_emergency_contract_sim_v4.py		mediation_emergency_contract_sim_v4.py
mediation_emergency_contract_sim_v4_1.py		mediation_emergency_contract_sim_v4_1.py
mediation_emergency_contract_sim_v4_4.py		mediation_emergency_contract_sim_v4_4.py
mediation_emergency_contract_sim_v4_4_stress.py		mediation_emergency_contract_sim_v4_4_stress.py
mediation_emergency_contract_sim_v4_6_full.py		mediation_emergency_contract_sim_v4_6_full.py
mediation_emergency_contract_sim_v4_7_full..py		mediation_emergency_contract_sim_v4_7_full..py
mediation_emergency_contract_sim_v4_7_full_fixed_regex.py		mediation_emergency_contract_sim_v4_7_full_fixed_regex.py
mediation_emergency_contract_sim_v4_8.py		mediation_emergency_contract_sim_v4_8.py
mediation_emergency_contract_sim_v5_0_1.py		mediation_emergency_contract_sim_v5_0_1.py
mediation_emergency_contract_sim_v5_1_2.py		mediation_emergency_contract_sim_v5_1_2.py
mediation_process_log.tpy		mediation_process_log.tpy
mediation_with_logging.py		mediation_with_logging.py
multi_agent_architecture_overview.webp		multi_agent_architecture_overview.webp
multi_agent_hierarchy_architecture.png		multi_agent_hierarchy_architecture.png
multi_agent_mediation_with_reeducation.py		multi_agent_mediation_with_reeducation.py
pytest.ini		pytest.ini
rank_transition_sample.py		rank_transition_sample.py
requirements.txt		requirements.txt
run_benchmark_kage3_v1_3_5.py		run_benchmark_kage3_v1_3_5.py
run_benchmark_profiles_v1_0.py		run_benchmark_profiles_v1_0.py
stress_report_v4_7_draft_lint_100k_seed42.json		stress_report_v4_7_draft_lint_100k_seed42.json
stress_results_v4_4_1000.json		stress_results_v4_4_1000.json
stress_results_v4_4_10000.json		stress_results_v4_4_10000.json
stress_results_v4_6_100000 .json		stress_results_v4_6_100000 .json
stress_results_v4_6_100000.json		stress_results_v4_6_100000.json
test_end_to_end_confidential_loopguard_v1_0.py		test_end_to_end_confidential_loopguard_v1_0.py

Folders and files

Latest commit

History

Repository files navigation

📘 Maestro Orchestrator — Orchestration Framework (fail-closed + HITL)

⚠️ Purpose & Disclaimer (Research & Education)

Risk / Warranty / Liability

Codebook disclaimer

Testing & results disclaimer

⚡ TL;DR

Overview

Latest update (what changed in this repo)

Fixes in this update (old draft → current)

Quickstart (recommended path)

1) Run the recommended emergency contract simulator (v5.1.2)

2) Run the contract tests (v5.1.x: simulator + codebook consistency)

3) Inspect / pin the demo codebook (v5.1-demo.1)

4) Optional: run the legacy stable bench (v4.8)

5) Optional: inspect evidence bundle (v4.8 generated artifact)

Stress tests (safe-by-default)

A) Lightweight smoke → medium stress (recommended ramp)

B) Force incidents (example: fabricate-rate 10% over 200 runs)

Diagrams & docs

Architecture (high level)

Architecture (overview, v5.1.2)

Architecture (code-aligned diagrams)

v5.0.1 → v5.1.2: What changed (delta)

V1 → V4: What actually changed (conceptual)

V4 → V5: What changed (conceptual)

Execution examples

Project intent / non-goals

Data & safety notes

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages