release: v2.3.0 — plugin governance, developer tooling, security hardening

imran-siddique · Copilot · imran-siddique · commit 8897ff5723a3 · 2026-03-25T21:13:59.000-07:00
- Bump all 7 core packages to v2.3.0
- Add RELEASE_NOTES_v2.3.0.md
- Update CHANGELOG.md with v2.3.0 entry

Co-authored-by: Copilot &lt;223556219+Copilot@users.noreply.github.com&gt;
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -12,21 +12,44 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ## [Unreleased]
 
+## [2.3.0] - 2026-03-26
+
 ### Added
-- Demo `--include-attacks` flag for adversarial scenario testing (prompt injection, tool alias bypass, SQL bypass).
-- .NET `SagaStep.MaxAttempts` property replacing deprecated `MaxRetries`.
-- `ContentHashInterceptor` for SHA-256 tool identity verification at intercept time.
-- `ToolRegistry` content hashing — computes and verifies handler integrity at registration and execution.
-- `PolicyEngine.freeze()` method with `MappingProxyType` immutability and mutation audit log.
-- `QuorumConfig` for M-of-N approval requirements in `EscalationHandler`.
-- Escalation fatigue detection — auto-DENY when agents exceed configurable rate threshold.
-- `EscalationRequest.votes` field for per-approver vote tracking.
+- MCP server allowlist/blocklist and plugin trust tiers (#425, #426)
+- Plugin schema adapters and batch evaluation (#424, #429)
+- Governance policy linter CLI command (#404)
+- Pre-commit hooks for plugin manifest validation (#428)
+- GitHub Actions action for governance verification (#423)
+- Event bus, task outcomes, diff policy, and sandbox provider (#398, #396, #395, #394)
+- Graceful degradation, budget policies, and audit logger (#410, #409, #400)
+- JSON schema validation for governance policies (#305, #367)
+- 14 launch-ready tutorials (07–20) covering all toolkit features
+- Tutorials landing page README with learning paths (#422)
+- Copilot instructions with PR review checklist (#413)
+- Pytest markers for slow and integration tests (#375)
+- Reference integration example for plugin marketplace governance (#427)
+
+### Changed
+- Renamed PyPI package `agent-runtime` → `agentmesh-runtime` (name collision with AutoGen) (#444)
+- Renamed PyPI package `agent-marketplace` → `agentmesh-marketplace` (#439)
+
+### Fixed
+- ESRP pipeline `each` directive syntax in Verify stages
+- ESRP pipelines updated to use `ESRP_CERT_IDENTIFIER` secret
+- Hardcoded service connection name (ADO compile-time requirement) (#421)
+- License format updated to SPDX string (setuptools deprecation) in agent-compliance and agent-lightning
+- Corrected license reference in AgentMesh README from Apache 2.0 to MIT (#436)
+- .NET GovernanceMetrics test isolation — flush listener before baseline (#417)
+- Dependency confusion + pydantic dependency fix (#412)
+- Enforced maintainer approval for all external PRs (#392)
 
 ### Security
-- Replaced XOR placeholder encryption with AES-256-GCM in DMZ module.
-- Added Security Model & Limitations section to README.
-- Added security advisories to SECURITY.md for CostGuard and thread safety fixes.
-- Hardened against agent sandbox escape vectors (tool aliasing, runtime policy self-modification, approval fatigue).
+- Moved all ESRP config to pipeline secrets (#370)
+
+### Documentation
+- Standardized package README badges (#373)
+- Added README files to example and skill integration directories (#371, #372, #390)
+- Added requirements for example directories (#372)
 
 ## [2.2.0] - 2026-03-17
 
diff --git a/RELEASE_NOTES_v2.3.0.md b/RELEASE_NOTES_v2.3.0.md
@@ -0,0 +1,270 @@
+# Agent Governance Toolkit v2.3.0
+
+> [!IMPORTANT]
+> **Community Preview Release** — All packages published from this repository (PyPI, npm, NuGet)
+> are **community preview releases** for testing and evaluation purposes only. They are **not**
+> official Microsoft-signed releases. Official Microsoft-signed packages published via ESRP
+> Release will be available in a future release.
+
+**Plugin governance, developer tooling, and hardened security — 97 commits since v2.2.0.**
+
+This release introduces a full plugin governance layer (MCP server allowlist/blocklist, schema
+adapters, trust tiers), developer-facing tooling (policy linter CLI, pre-commit hooks, GitHub
+Actions action), runtime reliability primitives (event bus, task outcomes, graceful degradation,
+budget policies), and 14 new tutorials. It also includes significant security hardening across the
+entire codebase and two PyPI package renames to avoid namespace collisions.
+
+## 🚀 What's New
+
+### Plugin Governance & MCP Server Controls
+
+- **MCP server allowlist/blocklist** — Enforces marketplace-level policies on which MCP servers
+  plugins can use through `MCPServerPolicy` with allowlist/blocklist modes. Validates plugin
+  manifests and rejects non-compliant plugins during registration (#425, #426, #434)
+- **Plugin trust tiers** — Classify plugins into trust levels (e.g., verified, community,
+  untrusted) with tier-based policy enforcement (#434)
+- **Plugin schema adapters** — Auto-detects and adapts Copilot-style and Claude-style plugin
+  manifest formats to the canonical `PluginManifest` schema, enabling multi-format plugin
+  support with capability extraction (#424, #429, #433)
+- **Batch plugin evaluation** — Evaluate multiple plugins against governance policies in a single
+  call for marketplace-scale validation (#429, #433)
+- **Reference integration example** — Complete example showing plugin marketplace governance
+  integration end-to-end (#427, #435)
+
+### Developer Tooling
+
+- **Governance policy linter CLI** — New `agent-compliance lint-policy <path>` command validates
+  YAML policy files for required fields, unknown operators/actions, deprecated names, and
+  conflicting rules with JSON/text output options (#404, #432)
+- **Pre-commit hooks** — Two new hooks for local development: `validate-plugin-manifest` (checks
+  plugin.json schema compliance) and `evaluate-plugin-policy` (evaluates manifests against
+  governance policies before commit) (#428, #431)
+- **GitHub Actions action** — Composite action at `action/action.yml` wrapping governance
+  verification commands (`governance-verify`, `marketplace-verify`, `policy-evaluate`, `all`)
+  with configurable inputs, structured outputs, and support for plugin marketplace PR
+  workflows (#423, #430)
+- **JSON schema validation** — Governance policy files are now validated against a formal JSON
+  schema, catching structural errors before runtime (#305, #367)
+
+### Runtime Reliability & Observability
+
+- **Event bus** — Cross-gate publish/subscribe system (`GovernanceEventBus`) enabling loose
+  coupling between governance gates (PolicyEvaluator, TrustGate, CircuitBreaker) with standard
+  event types for policy violations, trust changes, circuit state, and budget overages
+  (#398, #415)
+- **Task outcomes** — `TaskOutcomeRecorder` tracks agent task successes/failures with
+  severity-based scoring, diminishing returns on success boosts, time-based score recovery,
+  and per-agent trust state management (#396, #415)
+- **Diff policy** — Evaluate only the delta between previous and current policy state to reduce
+  overhead on incremental policy updates (#395, #415)
+- **Sandbox provider** — Pluggable sandbox provider abstraction for swapping isolation backends
+  (#394, #415)
+- **Graceful degradation** — `agent_os.compat` module provides no-op fallbacks
+  (`NoOpPolicyEvaluator`, `NoOpGovernanceMiddleware`) allowing consumers to optionally depend
+  on the toolkit without try/except boilerplate (#410, #414)
+- **Budget policies** — `BudgetPolicy` dataclass defines resource consumption limits (max tokens,
+  tool calls, cost, duration) with `BudgetTracker` for monitoring usage and detecting overages
+  with detailed violation reasons (#409, #414)
+- **Audit logger** — Structured audit logging for governance decisions with pluggable backends
+  (#400, #414)
+- **Policy evaluation heatmap** — Visual heatmap added to the SRE dashboard showing policy
+  evaluation patterns and hotspots (#309, #326)
+- **Compliance grading** — `compliance_grade()` method added to `GovernanceAttestation` for
+  calculating compliance scores (#346)
+
+### Tutorials & Learning Paths
+
+- **14 new tutorials (07–20)** — Launch-ready tutorials covering all toolkit features including
+  plugin governance, budget policies, event bus, graceful degradation, MCP server controls,
+  and more
+- **Tutorials landing page** — New README with structured learning paths guiding users from
+  beginner to advanced topics (#422)
+
+### CI/CD & ESRP
+
+- **PR review orchestrator** — Collapses multiple agent review comments into a single unified
+  summary on pull requests (#345)
+- **Dependency confusion pre-commit hook** — Detects unregistered package names before commit,
+  plus weekly CI audit job (#350)
+- **Markdown link checker** — CI workflow to catch broken links in documentation (#323)
+- **ESRP NuGet signing** — Updated NuGet signing config with Client ID and Key Vault
+  integration (#359, #361, #363, #365)
+
+## ⚠️ Breaking Changes
+
+### PyPI Package Renames
+
+Two PyPI packages have been renamed to avoid namespace collisions:
+
+| Old Name | New Name | Reason |
+|----------|----------|--------|
+| `agent-runtime` | `agentmesh-runtime` | Name collision with AutoGen team's `agent-runtime` package (#444) |
+| `agent-marketplace` | `agentmesh-marketplace` | Consistent `agentmesh` namespace alignment (#439) |
+
+**Migration:** Update your `requirements.txt` or `pyproject.toml`:
+
+```diff
+- agent-runtime
++ agentmesh-runtime
+
+- agent-marketplace
++ agentmesh-marketplace
+```
+
+## 🔒 Security
+
+- **Fork RCE hardening** — Hardened `pull_request_target` workflows against fork-based remote
+  code execution [MSRC-111178] (#353)
+- **Dependency confusion** — Comprehensive remediation across the entire codebase: replaced all
+  unregistered PyPI package names, added weekly audit CI, added pre-commit detection hook
+  (#325, #328, #349, #350, #351, #352)
+- **MD5 → SHA-256 migration** — All cryptographic hash usage migrated from MD5 to SHA-256
+  (#349, #351)
+- **ESRP secrets** — Moved all ESRP configuration values to pipeline secrets (#370)
+- **Maintainer approval enforcement** — All external PRs now require maintainer approval (#392)
+- **SECURITY.md** — Added security policy files to all packages (#354)
+- **LangChain crypto hardening** — Hardened cryptographic fallback in LangChain integration (#354)
+- **24 security findings addressed** — Comprehensive sweep across codebase (#303)
+- **Agent sandbox escape hardening** — Strengthened isolation boundaries against escape
+  vectors (#297)
+- **OWASP Agentic AI hardening** — Proactive hardening against OWASP Agentic AI Top 10
+  themes
+- **47 negative security tests** — Adversarial scenario test suite added
+- **101 additional tests** — CA security, MCP integration, and audit stub coverage
+- **OpenSSF Scorecard fixes** — Dangerous-workflow, signed-releases, and pinned-deps
+  improvements (#356)
+
+## 🐛 Bug Fixes
+
+- Corrected license reference in AgentMesh README from Apache 2.0 to MIT (#436)
+- Hardcoded service connection name in ESRP pipelines (ADO compile-time requirement) (#421)
+- ESRP pipeline fixes for `each` directive syntax in Verify stages and `ESRP_CERT_IDENTIFIER`
+  secret usage
+- Fixed .NET `GovernanceMetrics` test isolation — flush listener before baseline assertion (#417)
+- Fixed dependency confusion + pydantic dependency issues (#411, #412)
+- Followup cleanup for recently merged community PRs (#393)
+- Bumped `cryptography` package, migrated `PyPDF2` → `pypdf`, scoped workflow permissions (#355)
+- Filled community PR gaps — replaced bare excepts, `print` → `logging`, added `py.typed`
+  markers, LICENSE fixes (#344)
+- Improved CLI error messages in `register` and `policy` commands (#314)
+- `SagaStep.MaxRetries` rename + behavioral fault injection + lint fix (#295)
+- Pre-announcement security hardening and demo improvements (#296)
+- Restored `read-all` at workflow level for Scorecard verification (#327)
+- Reverted unsafe merged PRs #357 and #362 (#391)
+
+## 📚 Documentation
+
+- Added copilot-instructions.md with PR review checklist (#413)
+- Standardized package README badges across all packages (#373)
+- Added README files to example directories and skill integrations (#371, #372, #390)
+- Added requirements files for example directories (#372)
+- Refreshed all design proposals — updated status, added 5 new proposals (#348)
+- Added inline comments to Helm chart `values.yaml` (#341)
+- Updated framework integration star counts to current values (#329)
+- Added comprehensive docstrings to `mcp_adapter.py` classes (#324)
+- Added testing guide for external testers and customers (#313)
+- Added integration author guide for contributors (#311)
+
+## 📦 Dependencies
+
+### GitHub Actions
+
+| Package | From | To |
+|---------|------|----|
+| `actions/attest-sbom` | 2.2.0 | 4.1.0 |
+| `actions/attest-build-provenance` | 2.4.0 | 4.1.0 |
+| `actions/github-script` | 7.0.1 | 8.0.0 |
+| `actions/setup-node` | 4.4.0 | 6.3.0 |
+| `actions/stale` | 9.1.0 | 10.2.0 |
+| `actions/upload-artifact` | 4.6.2 | 7.0.0 |
+| `anchore/sbom-action` | 0.23.1 | 0.24.0 |
+| `ossf/scorecard-action` | 2.4.0 | 2.4.3 |
+| `sigstore/gh-action-sigstore-python` | 3.0.0 | 3.2.0 |
+
+### npm Dev Dependencies
+
+- Bumped `eslint` (#387)
+- Bumped `typescript` (#385, #386)
+- Bumped `yaml` (#384)
+- Bumped `@typescript-eslint/eslint-plugin` (#381, #292)
+- Bumped `@typescript-eslint/parser` (#286, #288)
+- Bumped `@vitest/coverage-v8` (#289, #380)
+- Bumped `@types/node` (#283, #291)
+
+### Python
+
+- Bumped `cryptography` (#355)
+- Migrated `PyPDF2` → `pypdf` (#355)
+
+## 🧹 Internal
+
+- Removed unused imports with autoflake in a2a-protocol (#340)
+- Added pytest markers for slow and integration tests (#375)
+- Added 10 AI-powered GitHub Actions workflows (#294)
+
+## Packages
+
+### Python (PyPI)
+
+| Package | Version | Status |
+|---------|---------|--------|
+| `agent-os-kernel` | 2.3.0 | Community Preview |
+| `agentmesh-platform` | 2.3.0 | Community Preview |
+| `agent-hypervisor` | 2.3.0 | Community Preview |
+| `agentmesh-runtime` | 2.3.0 | Community Preview _(renamed from `agent-runtime`)_ |
+| `agentmesh-marketplace` | 2.3.0 | Community Preview _(renamed from `agent-marketplace`)_ |
+| `agent-sre` | 2.3.0 | Community Preview |
+| `agent-governance-toolkit` | 2.3.0 | Community Preview |
+| `agent-lightning` | 2.3.0 | Community Preview |
+
+### npm
+
+| Package | Version | Status |
+|---------|---------|--------|
+| `@microsoft/agentmesh-sdk` | 1.0.0 | Community Preview |
+| `@microsoft/agentmesh-mcp-proxy` | 1.0.0 | Community Preview |
+| `@microsoft/agentos-mcp-server` | 1.0.1 | Community Preview |
+| `@microsoft/agentmesh-copilot-governance` | 0.1.0 | Community Preview |
+| `@microsoft/agentmesh-mastra` | 0.1.0 | Community Preview |
+| `@microsoft/agentmesh-api` | 0.1.0 | Community Preview |
+| `@microsoft/agent-os-copilot-extension` | 1.0.0 | Community Preview |
+
+### NuGet
+
+| Package | Version | Status |
+|---------|---------|--------|
+| `Microsoft.AgentGovernance` | 2.3.0 | Community Preview |
+
+## Contributors
+
+- @imran-siddique
+- @dependabot
+- @matt-van-horn
+- @jhawpetoss6-collab
+- @Bob
+- @AuthorPrime
+- @Copilot
+- @parsa-faraji-alamouti
+- @umesh-pal
+- @xavier-garceau-aranda
+- @zeel-desai
+- @aryan
+- @sharath-k
+- @yuchengpersonal
+
+## What's Coming
+
+- Official Microsoft-signed releases via ESRP Release (pending onboarding approval)
+- PyPI package ownership transfer to `microsoft` account
+- npm `@microsoft` scope activation via ESRP
+- NuGet Authenticode + NuGet package signing
+
+## Full Changelog
+
+See [CHANGELOG.md](CHANGELOG.md) for the complete list of changes.
+
+**Full Changelog:** https://github.com/microsoft/agent-governance-toolkit/compare/v2.2.0...v2.3.0
+
+## License
+
+[MIT](LICENSE) — © Microsoft Corporation
diff --git a/packages/agent-compliance/pyproject.toml b/packages/agent-compliance/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 
 [project]
 name = "agent-governance-toolkit"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — Unified installer and runtime policy enforcement for the Agent Governance Toolkit"
 readme = "README.md"
 license = "MIT"
diff --git a/packages/agent-hypervisor/pyproject.toml b/packages/agent-hypervisor/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "agent-hypervisor"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — Agent Hypervisor: Runtime supervisor for multi-agent Shared Sessions with Execution Rings, Joint Liability, Saga Orchestration, and hash-chained audit trails"
 readme = "README.md"
 license = "MIT"
diff --git a/packages/agent-lightning/pyproject.toml b/packages/agent-lightning/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 
 [project]
 name = "agent-lightning"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — Agent-Lightning RL integration for the Agent Governance Toolkit: governed training with policy enforcement"
 readme = "README.md"
 license = "MIT"
diff --git a/packages/agent-mesh/pyproject.toml b/packages/agent-mesh/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "agentmesh-platform"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — The Secure Nervous System for Cloud-Native Agent Ecosystems - Identity, Trust, Reward, Governance"
 readme = "README.md"
 license = {text = "MIT"}
diff --git a/packages/agent-os/pyproject.toml b/packages/agent-os/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "agent-os-kernel"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — A kernel architecture for governing autonomous AI agents with Nexus Trust Exchange"
 readme = "README.md"
 license = "MIT"
diff --git a/packages/agent-runtime/pyproject.toml b/packages/agent-runtime/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "agentmesh-runtime"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — AgentMesh Runtime: Execution supervisor for multi-agent sessions with privilege rings, saga orchestration, and audit trails"
 readme = "README.md"
 license = "MIT"
diff --git a/packages/agent-sre/pyproject.toml b/packages/agent-sre/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 
 [project]
 name = "agent-sre"
-version = "2.2.0"
+version = "2.3.0"
 description = "Community Edition — Reliability Engineering for AI Agent Systems"
 readme = "README.md"
 license = "MIT"