Date: 2026-02-07 Owner: Platform Team
- Phase 1 implementation completed on branch
feat/phase1-node-auth-lifecycle(PR #66). - Phase 2 implementation completed on branch
feat/phase2-runtime-validation-v2(stacked on Phase 1 branch). - Ran
npm audit+npm audit fixon branchfeat/phase2-runtime-validation-v2; safe lockfile updates applied. - [~] Phase 3 implementation started on branch
feat/phase3-shared-protocol-adoption.
- Re-assess dependency risk with fresh
npm audit --json. - Apply non-breaking lockfile remediation (
npm audit fix --package-lock-only). - Confirm current audit state has no high/critical findings.
- Document remediation strategy, ownership, and exception process in
docs/security/dependency-remediation-plan.md. - Add CI security gate (
npm run security:audit) to fail on high/critical vulnerabilities. - Link dependency triage ownership/cadence policy in
docs/DEPENDENCY_TRIAGE_WORKFLOW.md.
- Create and approve ADRs for token transport, shared protocol package, and command reliability.
- Maintain
docs/compatibility.mdwith every release. - Enforce CI gates for lint, tests, build, and typecheck.
- Add contract-test placeholder in CI for shared protocol adoption.
Definition of done:
- All docs merged and linked from README.
- CI blocks PRs when lint/test/build/typecheck fail.
- Implement short-lived session token acquire/refresh flow.
- Move WS auth to header or subprotocol.
- Disable query-token auth in production mode.
- Add reconnect logic with token refresh.
- Add tests for expired, revoked, and unavailable auth scenarios.
Definition of done:
- Token rotation works without manual intervention.
- Query-token usage blocked in production.
- Validate all inbound commands before dispatch.
- Validate outbound telemetry/events before send.
- Add strict unknown-command handling.
- Add structured error logging with correlation IDs.
Definition of done:
- Invalid payload paths have deterministic test coverage.
- Replace local protocol types with
@kaonis/woly-protocol. - Remove duplicate protocol declarations.
- Add protocol version negotiation at connect.
- Add cross-repo contract tests.
- Add CI workflow to enforce protocol compatibility checks.
- Define external publish readiness + rollback workflow for
@kaonis/woly-protocolindocs/PROTOCOL_PUBLISH_WORKFLOW.md. - [~] Publish
@kaonis/woly-protocolwhen an external consumer release requires it (followdocs/PROTOCOL_PUBLISH_WORKFLOW.md).
Definition of done:
- Protocol compatibility is enforced in CI.
- Add idempotency guard for duplicate command delivery.
- Track local command lifecycle for diagnostics.
- Add timeout and bounded retry policies.
- Ensure acknowledgment retry semantics are safe.
Definition of done:
- Duplicate deliveries do not cause duplicate side effects.
- Add event sampling/debounce strategy.
- Add payload size caps/chunking strategy.
- Define queue-and-flush policy during C&C outage.
- Add stale-host data detection.
Definition of done:
- Node remains stable under event spikes and reconnect storms.
- Emit reconnect/auth/schema/latency metrics.
- Add startup diagnostics with build and protocol version.
- Publish incident runbooks.
- Execute canary to staged rollout policy.
Definition of done:
- On-call can isolate auth, protocol, command, or network failures quickly.