Skip to content

Latest commit

 

History

History
64 lines (52 loc) · 2.48 KB

File metadata and controls

64 lines (52 loc) · 2.48 KB

Changelog

All notable changes to Agent SRE will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased]

Added

  • MeasurementStore ABC with InMemoryMeasurementStore (thread-safe, default) and SQLiteMeasurementStore (durable, survives agent restarts) backends for SLI measurement persistence — closes #645.
  • CalibrationDeltaSLI: new built-in SLI that tracks the running gap between an agent's stated confidence and its empirical success rate (calibration drift). Registered in SLIRegistry by default. Reference: PDR DOI 10.5281/zenodo.19339987.
  • SLI.__init__ and all built-in SLI subclasses now accept an optional store keyword argument. Omitting it preserves identical backward-compatible behaviour.
  • _validate_db_path() utility function rejects non-file URI schemes (e.g. http://) passed to SQLiteMeasurementStore.
  • 26 new tests in tests/unit/test_sli_persistence.py covering both stores, thread-safety, SQLite durability, input validation, and CalibrationDeltaSLI.

Changed

  • InMemoryMeasurementStore is now thread-safe (uses threading.Lock).
  • SLI._measurements is preserved as a backward-compatible alias pointing into the in-memory store's row list when the default backend is used.

[0.3.0] - 2026-02-19

Added

  • ARCHITECTURE.md documenting 7-engine architecture
  • OpenTelemetry integration for distributed tracing
  • SLO-as-Code YAML definitions with error budgets
  • Incident runbook templates for common agent failures
  • Golden signal traces for agent observability
  • Chaos scheduling engine with 9 fault templates
  • Blue-green deployment support for agent rollouts
  • Cost optimization engine with budget guardrails
  • Prometheus/Grafana dashboards for SLO monitoring
  • GitHub Actions canary deployment action

Changed

  • Improved burn rate alert thresholds
  • Enhanced error budget calculation precision

[0.2.0] - 2026-02-01

Added

  • Core SLO Engine with 7 SLI types
  • Replay Engine for deterministic capture/replay
  • Progressive Delivery engine (shadow, canary, rollback)
  • Chaos Engineering engine with fault injection
  • Cost Guard engine with anomaly detection
  • Incident Manager with auto-detection and postmortem
  • Full test suite

[0.1.0] - 2026-01-26

Added

  • Initial release
  • Basic SLO definitions and evaluation
  • Error budget tracking
  • Agent OS and AgentMesh integration