ZeroClaw Operations Runbook

This runbook is for operators who maintain availability, security posture, and incident response.

Last verified: February 18, 2026.

Scope

Use this document for day-2 operations:

starting and supervising runtime
health checks and diagnostics
safe rollout and rollback
incident triage and recovery

For first-time installation, start from one-click-bootstrap.md.

Runtime Modes

Mode	Command	When to use
Foreground runtime	`zeroclaw daemon`	local debugging, short-lived sessions
Foreground gateway only	`zeroclaw gateway`	webhook endpoint testing
User service	`zeroclaw service install && zeroclaw service start`	persistent operator-managed runtime

Baseline Operator Checklist

Validate configuration:

zeroclaw status

Verify diagnostics:

zeroclaw doctor
zeroclaw channel doctor

Start runtime:

zeroclaw daemon

For persistent user session service:

zeroclaw service install
zeroclaw service start
zeroclaw service status

Health and State Signals

Signal	Command / File	Expected
Config validity	`zeroclaw doctor`	no critical errors
Channel connectivity	`zeroclaw channel doctor`	configured channels healthy
Runtime summary	`zeroclaw status`	expected provider/model/channels
Daemon heartbeat/state	`~/.zeroclaw/daemon_state.json`	file updates periodically

Logs and Diagnostics

macOS / Windows (service wrapper logs)

~/.zeroclaw/logs/daemon.stdout.log
~/.zeroclaw/logs/daemon.stderr.log

Linux (systemd user service)

journalctl --user -u zeroclaw.service -f

Incident Triage Flow (Fast Path)

Snapshot system state:

zeroclaw status
zeroclaw doctor
zeroclaw channel doctor

Check service state:

zeroclaw service status

If service is unhealthy, restart cleanly:

zeroclaw service stop
zeroclaw service start

If channels still fail, verify allowlists and credentials in ~/.zeroclaw/config.toml.
If gateway is involved, verify bind/auth settings ([gateway]) and local reachability.

Secret Leak Incident Response (CI Gitleaks)

When sec-audit.yml reports a gitleaks finding or uploads SARIF alerts:

Confirm whether the finding is a true credential leak or a test/doc false positive:
- review gitleaks.sarif + gitleaks-summary.json artifacts
- inspect changed commit range in the workflow summary
If true positive:
- revoke/rotate the exposed secret immediately
- remove leaked material from reachable history when required by policy
- open an incident record and track remediation ownership
If false positive:
- prefer narrowing detection scope first
- only add allowlist entries with explicit governance metadata (owner, reason, ticket, expires_on)
- ensure the related governance ticket is linked in the PR
Re-run Sec Audit and confirm:
- gitleaks lane green
- governance guard green
- SARIF upload succeeds

Safe Change Procedure

Before applying config changes:

backup ~/.zeroclaw/config.toml
apply one logical change at a time
run zeroclaw doctor
restart daemon/service
verify with status + channel doctor

Rollback Procedure

If a rollout regresses behavior:

restore previous config.toml
restart runtime (daemon or service)
confirm recovery via doctor and channel health checks
document incident root cause and mitigation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZeroClaw Operations Runbook

Scope

Runtime Modes

Baseline Operator Checklist

Health and State Signals

Logs and Diagnostics

macOS / Windows (service wrapper logs)

Linux (systemd user service)

Incident Triage Flow (Fast Path)

Secret Leak Incident Response (CI Gitleaks)

Safe Change Procedure

Rollback Procedure

Related Docs

FilesExpand file tree

operations-runbook.md

Latest commit

History

operations-runbook.md

File metadata and controls

ZeroClaw Operations Runbook

Scope

Runtime Modes

Baseline Operator Checklist

Health and State Signals

Logs and Diagnostics

macOS / Windows (service wrapper logs)

Linux (systemd user service)

Incident Triage Flow (Fast Path)

Secret Leak Incident Response (CI Gitleaks)

Safe Change Procedure

Rollback Procedure

Related Docs