Context
Hits the on-device memory / latency budget.
Scope
- Per-tensor symmetric int8 for predictor + action encoder with calibration over 1k reference windows.
- Q4_K_M for Carbon via llama.cpp toolchain.
- Evaluate quality drop against RFC-0016 §3.3 budget.
Out of Scope
- AWQ / GPTQ alternatives (Phase 4 if needed).
Design Reference
- RFC: rfcs/0010-on-device-personal-genome-deployment.md §3.3
- RFC: rfcs/0016-performance-budget.md §3.3
Acceptance Criteria
Parent tracking issue: #14
Context
Hits the on-device memory / latency budget.
Scope
Out of Scope
Design Reference
Acceptance Criteria
Parent tracking issue: #14