This repository uses a multi-agent workflow to build a family-safe 3D game engine (Godot + Ollama) with hexagonal architecture.
- Product: child-focused sandbox/tycoon creation engine (ages 6–8 with parent co-creation)
- Runtime stack: Godot 4.x + GDScript
- AI runtime: Ollama (tool-calling)
- Team model: Orchestrator + specialist agents + mandatory cross-agent review
- Git
- Godot 4.x
- Ollama (installed and running locally)
- At least one coding agent CLI (Codex / Claude / Copilot / Mistral)
Optional but recommended:
- Claude Code CLI (
claude) - GitHub Copilot in VS Code
- Mistral Vibe CLI
.ai/tasks/backlog.yaml(source of truth for work).ai/README.md(team process)AGENTS.md(Codex role + hard rules)CLAUDE.md(architecture/review role).github/copilot-instructions.md(Copilot implementation rules).ai/workflows/definition-of-done.mdand.ai/workflows/handoff-checklist.md
- Keep domain logic isolated from engine and LLM adapters (hexagonal boundaries).
- AI mutations must be reversible.
- Child mode defaults to safe presets and bounded choices.
- Parent mode unlocks advanced actions via approvals.
- Every completed task needs cross-agent review.
- Open the repo and read
backlog.yaml. - Pick one task where
status: todoand dependencies are satisfied. - Set task status to
in_progress. - Dispatch work to the right specialist:
./scripts/agent-dispatch.sh codex TASK-00X./scripts/agent-dispatch.sh claude TASK-00X./scripts/agent-dispatch.sh copilot TASK-00X./scripts/agent-dispatch.sh mistral TASK-00X
- Implement only the scoped task and acceptance criteria.
- Request cross-agent review using
.ai/contracts/review.schema.json. - Verify Definition of Done, then mark task
doneinbacklog.yaml.
- Codex: orchestrator + implementation/refactoring + safety-heavy implementation
- Claude: architecture decomposition and boundary/safety review
- Copilot: implementation acceleration (small, testable units)
- Mistral: systems/economy/gameplay-loop analysis
- Plan in small tasks (single specialty).
- Implement with narrow file scope.
- Produce handoff summary:
- files changed
- assumptions/risks
- exact verification commands
- Perform cross-agent review.
- Resolve disagreements by checking
docs/requirements/*and documenting orchestrator decision.
Use these as your functional and architectural source docs:
docs/requirements/architecture-requirements.mddocs/requirements/functionality-requirements.mddocs/requirements/technology-requirements.mddocs/requirements/ui-ux-requirements.md
The setup in this repo aligns with:
- GitHub Copilot repository instructions and
AGENTS.mdbehavior (GitHub Docs) - Claude Code CLI +
CLAUDE.mdinstruction-driven workflows (Claude Code Docs) - Ollama local-first runtime and API-first integration model (Ollama Docs)
Helpful links:
- https://docs.github.com/en/copilot/how-tos/configure-custom-instructions/add-repository-instructions
- https://code.claude.com/docs/en/overview
- https://code.claude.com/docs/en/cli-reference
- https://docs.ollama.com/quickstart
- https://docs.ollama.com/api
A task is complete only when:
- acceptance criteria in backlog are met,
- hexagonal boundaries are preserved,
- kid safety + parent control impact are reviewed,
- AI/tool-calling behavior is documented when relevant,
- cross-agent review is recorded,
- relevant tests/checks are run.
Local quality-gate entrypoints:
./scripts/ci/check-domain-isolation.sh./scripts/ci/run-prompt-regression.sh./scripts/ci/run-safety-gates.sh./scripts/ci/run-safety-compliance-regression.sh./scripts/ci/run-abuse-regression.sh./scripts/ci/run-persistence-resilience.sh./scripts/ci/run-mvp-acceptance-suite.sh./scripts/ci/run-kid-parent-ai-regression-matrix.sh./scripts/ci/run-usability-kpi-pipeline.sh./scripts/ci/run-performance-gates.sh./scripts/run-quality-gates.sh
Performance gate defaults:
./scripts/ci/run-performance-gates.shruns all configured tiers (tier1,tier2) by default.- For faster local iteration, run a single tier:
CHOYCE_PERF_RUN_ALL_TIERS=0 CHOYCE_HARDWARE_TIER=tier1 ./scripts/ci/run-performance-gates.sh
Release-readiness docs for all deployment modes:
docs/release/launch-checklist.mddocs/release/runbook-local-only.mddocs/release/runbook-family-cloud.mddocs/release/runbook-classroom.mddocs/release/release-exit-criteria.md
Manual evidence completeness check:
./scripts/ci/validate-manual-qa-artifacts.sh
Use focused suites during active implementation/review:
godot4 --headless --path . --script tests/contracts/run_task_027_tests.gdgodot4 --headless --path . --script tests/contracts/run_task_032_tests.gdgodot4 --headless --path . --script tests/contracts/run_task_044_tests.gdgodot4 --headless --path . --script tests/contracts/run_task_047_tests.gd
If you need the full contract run:
./scripts/run-contract-tests.sh
Note:
- Some legacy contract files may still emit known baseline parse/runtime noise; use
./scripts/run-quality-gates.shas the primary gate for merge decisions.
Track what each agent should work on next:
./scripts/agent-watch.sh codex./scripts/agent-watch.sh claude./scripts/agent-watch.sh copilot./scripts/agent-watch.sh mistral
Continuous monitoring:
- Linux with
watch:watch -n 15 ./scripts/agent-watch.sh codex - macOS fallback:
while true; do ./scripts/agent-watch.sh codex; sleep 15; clear; done
Testing wave currently includes:
TASK-057automated inbound shell regression (owner: copilot)TASK-058automated persistence resilience regression (owner: codex)TASK-059manual pre-network kid-parent gameplay/trust charter (owner: mistral)TASK-060manual safety/compliance operations drill (owner: claude)
See .ai/tasks/backlog.yaml for authoritative status/dependencies.