64 lines (52 loc) · 2.48 KB

Changelog

All notable changes to Agent SRE will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased]

Added

MeasurementStore ABC with InMemoryMeasurementStore (thread-safe, default) and SQLiteMeasurementStore (durable, survives agent restarts) backends for SLI measurement persistence — closes #645.
CalibrationDeltaSLI: new built-in SLI that tracks the running gap between an agent's stated confidence and its empirical success rate (calibration drift). Registered in SLIRegistry by default. Reference: PDR DOI 10.5281/zenodo.19339987.
SLI.__init__ and all built-in SLI subclasses now accept an optional store keyword argument. Omitting it preserves identical backward-compatible behaviour.
_validate_db_path() utility function rejects non-file URI schemes (e.g. http://) passed to SQLiteMeasurementStore.
26 new tests in tests/unit/test_sli_persistence.py covering both stores, thread-safety, SQLite durability, input validation, and CalibrationDeltaSLI.

Changed

InMemoryMeasurementStore is now thread-safe (uses threading.Lock).
SLI._measurements is preserved as a backward-compatible alias pointing into the in-memory store's row list when the default backend is used.

[0.3.0] - 2026-02-19

Added

ARCHITECTURE.md documenting 7-engine architecture
OpenTelemetry integration for distributed tracing
SLO-as-Code YAML definitions with error budgets
Incident runbook templates for common agent failures
Golden signal traces for agent observability
Chaos scheduling engine with 9 fault templates
Blue-green deployment support for agent rollouts
Cost optimization engine with budget guardrails
Prometheus/Grafana dashboards for SLO monitoring
GitHub Actions canary deployment action

Changed

Improved burn rate alert thresholds
Enhanced error budget calculation precision

[0.2.0] - 2026-02-01

Added

Core SLO Engine with 7 SLI types
Replay Engine for deterministic capture/replay
Progressive Delivery engine (shadow, canary, rollback)
Chaos Engineering engine with fault injection
Cost Guard engine with anomaly detection
Incident Manager with auto-detection and postmortem
Full test suite

[0.1.0] - 2026-01-26

Added

Initial release
Basic SLO definitions and evaluation
Error budget tracking
Agent OS and AgentMesh integration