All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
Important
All releases are currently public preview releases. They are Microsoft-signed and production-quality but may have breaking changes before GA.
- Unified
agtCLI with plugin discovery, doctor command, and 79 tests (#924) - Governance Dashboard — real-time agent fleet visibility (#925)
- Agent Lifecycle Management — provisioning to decommission (#923)
- Agent Discovery Package — shadow AI discovery & inventory (#921)
- Quantum-Safe Signing — ML-DSA-65 alongside Ed25519 (#927)
- Vendor Independence Enforcement across all core packages
- OWASP ASI 2026 Taxonomy Migration with reference architecture
- PromptDefenseEvaluator — 12-vector prompt audit (#854)
- EU AI Act Risk Classifier (
agentmesh.governance.EUAIActRiskClassifier) — structured risk classification per Article 6 and Annex III, with Art. 6(1) Annex I safety-component path, Art. 6(3) exemptions, GDPR Art. 4(4) profiling override, and configurable YAML categories for regulatory updates (#756)
- Patched dependency verification bypass and trust handshake DID forgery (#920)
- Hardened CLI Error Handling — standardized sanitized JSON error output across all 7 ecosystem tools to prevent internal information disclosure (CWE-209)
- Audit Log Whitelisting — implemented strict key-whitelisting in
agentmesh auditJSON output to prevent accidental leakage of sensitive agent internal state - CLI Input Validation — added regex-based validation for agent identifiers (DIDs/names) in registration and verification commands to prevent injection attacks
- Repo hygiene: MIT headers, compliance disclaimers, dependency confusion, network bindings (#926)
- CI: pyyaml added to agent-compliance direct dependencies
- Code samples updated to v3 API
- Various dependency bumps (cryptography, path-to-regexp, etc.)
- Modern Agent Architecture overview for enterprise sharing
- NIST AI RMF 1.0 alignment assessment
- MCP governance consolidated into docs/compliance/
- Policy-as-code tutorial chapter 4
- Added
EUAIActRiskClassifierusage example and API docs topackages/agent-mesh/README.md - Updated
QUICKSTART.mdandTutorial 04 — Audit & Compliancewith secure JSON error handling examples and schema details - Added "Secure Error Handling" sections to primary documentation to guide users on interpreting sanitized machine-readable outputs
- Comprehensive security audit remediation (29 findings fixed)
- CI injection prevention: moved all github.event expressions to env blocks
- Supply chain hardening: dependency confusion fixes, npm lockfiles, Dockerfile pinning
- Docker/infra: removed hardcoded passwords, wildcard CORS, added .dockerignore exclusions
- Code quality: XSS prevention in VS Code webviews, Rust panic safety
- Version pinning compliance across all pyproject.toml and Cargo.toml files
- Extended dependency confusion detection script coverage
- Rust SDK (
agentmeshcrate) for native governance integration - Go SDK module for policy, trust, audit, and identity
- Trust report CLI command (
agentmesh trust report) - Secret scanning workflow (Gitleaks)
- 4 new fuzz targets (prompt injection, MCP scanner, sandbox, trust scoring)
- Dependabot coverage expanded to 13 ecosystems (+ cargo, gomod, nuget, docker)
- 7 new tutorials (Rust SDK, Go SDK, delegation chains, budgets, security, SBOM, MCP scan)
- ESRP Release publishing for Rust crates (crates.io)
- Entra Agent ID adapter for managed identity integration
- Secure code generation templates with AST validation
- SBOM generation (SPDX/CycloneDX) with Ed25519 artifact signing
- Tenant isolation checklist and private endpoint deployment examples
- ADO build failures: shebang position (TS18026), Express 5 type narrowing (TS2345)
- NuGetCommand@2 → DotNetCoreCLI@2 for Ubuntu 24.04 compatibility
- path-to-regexp ReDoS vulnerability (8.3.0 → 8.4.0)
- Python 3.10 CI matrix exclusions for packages requiring >=3.11
- TypeScript eslint peer dependency conflicts resolved
- Rust crate dependency pins (rand 0.8, sha2 0.10, thiserror 1)
- Ruff lint errors in agent-sre (E741, F401, E401)
- Policy provider test mock contract alignment
- Dify integration removed from CI (archived package)
- Notebook dependency scanner regex hardened
- PUBLISHING.md rewritten with full Microsoft compliance policies (MCR, ESRP, Conda, PMC)
- Branch protection: 13 required status checks, dismiss stale reviews, squash-only merges
- README updated with 5 SDK languages, 20+ framework integrations, security tooling table
- Official Microsoft-Signed Public Preview — all packages are now published via ESRP Release with Microsoft signing
- All package descriptions updated from "Community Edition" to "Public Preview"
- All Development Status classifiers standardized to "4 - Beta"
- Package
agent-lightningrenamed toagentmesh-lightningon PyPI - All personal author references replaced with Microsoft Corporation
- Contact email updated to agentgovtoolkit@microsoft.com
- Removed all merge conflict markers from docs
- Updated all old PyPI package name references (agent-runtime → agentmesh-runtime, agent-lightning → agentmesh-lightning) across README, QUICKSTART, tutorials, workflows, and scripts
- ESRP pipeline service connection hardcoded for ADO compile-time requirement
- ESRP pipeline
eachdirective syntax fixed in Verify stages - License format updated to SPDX string (setuptools deprecation fix)
- MCP server allowlist/blocklist and plugin trust tiers (#425, #426)
- Plugin schema adapters and batch evaluation (#424, #429)
- Governance policy linter CLI command (#404)
- Pre-commit hooks for plugin manifest validation (#428)
- GitHub Actions action for governance verification (#423)
- Event bus, task outcomes, diff policy, and sandbox provider (#398, #396, #395, #394)
- Graceful degradation, budget policies, and audit logger (#410, #409, #400)
- JSON schema validation for governance policies (#305, #367)
- 14 launch-ready tutorials (07–20) covering all toolkit features
- Tutorials landing page README with learning paths (#422)
- Copilot instructions with PR review checklist (#413)
- Pytest markers for slow and integration tests (#375)
- Reference integration example for plugin marketplace governance (#427)
- Renamed PyPI package
agent-runtime→agentmesh-runtime(name collision with AutoGen) (#444) - Renamed PyPI package
agent-marketplace→agentmesh-marketplace(#439) - Renamed PyPI package
agent-lightning→agentmesh-lightning(name collision on PyPI)
- ESRP pipeline
eachdirective syntax in Verify stages - ESRP pipelines updated to use
ESRP_CERT_IDENTIFIERsecret - Hardcoded service connection name (ADO compile-time requirement) (#421)
- License format updated to SPDX string (setuptools deprecation) in agent-compliance and agent-lightning
- Corrected license reference in AgentMesh README from Apache 2.0 to MIT (#436)
- .NET GovernanceMetrics test isolation — flush listener before baseline (#417)
- Dependency confusion + pydantic dependency fix (#412)
- Enforced maintainer approval for all external PRs (#392)
- Moved all ESRP config to pipeline secrets (#370)
- Standardized package README badges (#373)
- Added README files to example and skill integration directories (#371, #372, #390)
- Added requirements for example directories (#372)
- ESRP Release ADO pipeline for PyPI publishing (
pipelines/pypi-publish.yml) - ESRP Release ADO pipeline for npm publishing (
pipelines/npm-publish.yml) - npm build + pack job in GitHub Actions publish workflow
- Community preview disclaimers across all READMEs, release notes, and package descriptions
PUBLISHING.mdguide covering PyPI, npm, and NuGet publishing requirementsagent-runtimere-export wrapper package (src/agent_runtime/__init__.py)RELEASE_NOTES_v2.2.0.mdcreate_policies_from_config()API — load security policies from YAML config filesSQLPolicyConfigdataclass andload_sql_policy_config()for structured policy loading- 10 sample policy configs in
examples/policies/(sql-safety, sql-strict, sql-readonly, sandbox-safety, prompt-injection-safety, mcp-security, semantic-policy, pii-detection, conversation-guardian, cli-security-rules) - Configurable security rules across 7 modules: sandbox, prompt injection, MCP security, semantic policy, PII detection, conversation guardian, CLI checker
- GitHub Actions
publish.ymlno longer publishes to PyPI (build + attest only) - Python package author updated to
Microsoft Corporationwith team DL (all 7 packages) - npm packages renamed to
@microsoftscope (from@agentmesh,@agent-os, unscoped) - npm package author set to
Microsoft Corporation(all 9 packages) - All package descriptions prefixed with
Community Edition - License corrected to MIT where mismatched (agent-mesh classifier, 2 npm packages)
create_default_policies()— emits runtime warning directing users tocreate_policies_from_config()with explicit YAML configs
- Expanded SQL policy deny-list to block GRANT, REVOKE, CREATE USER, EXEC xp_cmdshell, UPDATE without WHERE, MERGE INTO
- Externalized all hardcoded security rules to YAML configuration across 7 modules
agent-runtimebuild failure (invalid parent-directory hatch reference)- Missing
License :: OSI Approved :: MIT Licenseclassifier in 3 Python packages - Incorrect repository URLs in 2 npm packages
2.1.0 - 2026-03-15
Multi-language SDK readiness, TypeScript full parity, .NET NuGet hardening, 70+ commits since v1.1.0. This release makes the toolkit a true polyglot governance layer — Python, TypeScript, and .NET are all first-class citizens with install instructions, quickstarts, and package metadata ready for registry publishing.
- TypeScript SDK full parity (— PolicyEngine + AgentIdentity) — rich policy evaluation with 4 conflict resolution strategies, expression evaluator, rate limiting, YAML/JSON policy documents, Ed25519 identity with lifecycle/delegation/JWK/JWKS/DID export, IdentityRegistry with cascade revocation. 136 tests passing. (#269)
- @agentmesh/sdk 1.0.0 — TypeScript package now publish-ready with
exportsfield,prepublishOnlybuild hook, correctrepository.directory, MIT license. - Multi-language README — root README now surfaces Python (PyPI), TypeScript (npm), and .NET (NuGet) install sections, badges, quickstart code, and a multi-SDK packages table.
- Multi-language QUICKSTART — getting started guide now covers all three SDKs with code examples.
- Semantic Kernel + Azure AI Foundry added to framework integration table.
- 5 standalone framework quickstarts — one-file runnable examples for LangChain, CrewAI, AutoGen, OpenAI Agents, Google ADK.
- Competitive comparison page — vs NeMo Guardrails, Guardrails AI, LiteLLM, Portkey (
docs/COMPARISON.md). - GitHub Copilot Extension — agent governance code review extension for Copilot.
- Observability integrations — Prometheus, OpenTelemetry, PagerDuty, Grafana (#49).
- NIST RFI mapping — question-by-question mapping to NIST AI Agent Security RFI 2026-00206 (#29).
- Performance benchmarks — published BENCHMARKS.md with p50/p99 latency, throughput at 50 concurrent agents (#231).
- 6 comprehensive governance tutorials — policy engine, trust & identity, framework integrations, audit & compliance, agent reliability, execution sandboxing (#187).
- Azure deployment guides — AKS, Azure AI Foundry, Container Apps, OpenClaw sidecar.
- agent-governance (formerly
ai-agent-compliance): Renamed PyPI package for better discoverability. - README architecture disclaimer reframed from apology to confidence — leads with enforcement model, composes with container isolation (#240).
- README tagline updated for OWASP 10/10 discoverability.
- .NET NuGet metadata enhanced — Authors, License, RepositoryUrl, Tags, ReadmeFile in csproj.
- All example install strings updated from
ai-agent-compliance[full]toagent-governance[full]. - Demo fixed: legacy
agent-hypervisorpath →agent-runtime. - BENCHMARKS.md: fixed stale "VADP version" reference.
- Demo fixed: legacy
agent-hypervisorpath →agent-runtime. - BENCHMARKS.md: fixed stale "VADP version" reference.
- .NET bug sweep — thread safety, error surfacing, caching, disposal fixes (#252).
- Behavioral anomaly detection implemented in RingBreachDetector.
- CLI edge case tests and input validation for agent-compliance (#234).
- Cross-package import errors breaking CI resolved (#222).
- OWASP-COMPLIANCE.md broken link fix + Copilot extension server hardening (#270).
- CostGuard org kill switch bypass — crafted IEEE 754 inputs (NaN/Inf/negative) could bypass organization-level kill switch. Fixed with input validation + persistent
_org_killedflag (#272). - CostGuard thread safety — bound breach history + Lock for concurrent access (#253).
- ErrorBudget._events bounded with
deque(maxlen=N)to prevent unbounded growth (#172). - VectorClock thread safety + integrity type hints (#243).
- Block
importlibdynamic imports in sandbox (#189). - Centralize hardcoded ring thresholds and constants (#188).
- Phase 3 architecture rename propagated across 52 files (#221).
- Deferred architecture extractions — slim OS init, marketplace, lightning (#207).
- Architecture naming review and layer consolidation (#206).
- agentmesh-integrations migrated into monorepo (#138).
- CI test matrix updated with agentmesh-integrations packages (#226).
- OpenSSF Scorecard improved from 5.3 to ~7.7 (#113, #137).
# Python
pip install agent-governance-toolkit[full]
# TypeScript
npm install @agentmesh/sdk
# .NET
dotnet add package Microsoft.AgentGovernance- agent-runtime: Version bump to align with mono-repo versioning
- Block
importlibdynamic imports in sandbox (#189)
- agent-runtime: Centralize hardcoded ring thresholds and constants (#188)
1.1.0 - 2026-03-08
15 issues closed, 339+ tests added, 12 architectural features shipped — in 72 hours from first analysis to merged code. This release transforms the toolkit from a well-structured v1.0 into an enterprise-hardened governance layer with real adversarial durability.
- Policy conflict resolution engine — 4 declared strategies (
DENY_OVERRIDES,ALLOW_OVERRIDES,PRIORITY_FIRST_MATCH,MOST_SPECIFIC_WINS) with 3-tier policy scope model (global → tenant → agent) and auditable resolution trace. Answers the question every security architect will ask: "if two policies conflict, which wins?" (#91) - Session policy pinning —
create_context()now deep-copies policy so running sessions get immutable snapshots. Mid-flight policy mutations no longer leak into active sessions. (#92) - Tool alias registry — Canonical capability mapping for 7 tool families (30+ aliases) prevents policy bypass via tool renaming.
bing_searchcan no longer dodge aweb_searchblock. (#94) - Human-in-the-loop escalation —
EscalationPolicywithESCALATEtier,InMemoryApprovalQueue, andWebhookApprovalBackend. Adds the suspend-and-route-to-human path required by regulated industries (healthcare, finance, legal). (#81)
- Inter-package version compatibility matrix —
doctor()function with runtime compatibility checking across all 5 packages. Detects silent version skew before it causes trust handshake failures. (#83) - Credential lifecycle management — Wired
RevocationListintoCardRegistry.is_verified()so revoked credentials are actually rejected. Key rotation now has a kill path. (#82) - File-backed trust persistence —
FileTrustStorewith JSON persistence, atomic writes, and thread safety. Trust scores survive agent restarts — misbehaving agents can no longer reset reputation by crashing. (#86) - Policy schema versioning —
apiVersionfield with validation, migration tooling, and deprecation warnings. Schema evolution in v1.2+ won't silently break existing policy files. (#87)
- Bootstrap integrity verification —
IntegrityVerifierhashes 15 governance module source files and 4 critical function bytecodes (SHA-256) against a publishedintegrity.jsonmanifest. Detects supply chain tampering before any policy evaluation occurs. (#95) - Governance certification CLI —
agent-governance verifychecks all 10 OWASP ASI 2026 controls, generates signed attestations, and outputs shields.io badges for README embedding.agent-governance integrity --generatecreates baseline manifests for release signing.
- SIGKILL-analog process isolation — Real
os.kill(SIGKILL)for Linux,TerminateProcessfor Windows, with PID tracking and cgroup integration. Not a simulated kill — actual process-level termination. (#77) - OpenTelemetry observability —
GovernanceTracerwith distributed traces, span events for policy checks, custom metrics (policy evaluations, violations, latency histograms), and OTLP exporter integration. (#76) - Async concurrency safety —
asyncio.Lockguards on shared state,ConcurrencyStatstracking, deadlock detection with configurable timeouts. Concurrent agent evaluations no longer corrupt trust scores. (#75) - Policy-as-code CI pipeline —
PolicyCIclass with YAML linting, schema validation, conflict detection, and dry-run simulation. Integrates with GitHub Actions for PR-time policy validation. (#74) - Deep framework integrations —
LangChainGovernanceCallback,CrewAIGovernanceMiddleware,AutoGenGovernanceHookwith framework-specific lifecycle hooks, not just wrapper-level interception. (#73) - External audit trail integrity —
SignedAuditEntrywith Ed25519 signatures,HashChainVerifierfor tamper detection,FileAuditSinkfor append-only external storage. Cryptographic proof that audit logs haven't been modified. (#72) - Behavioral anomaly detection — Statistical anomaly detection for agent behavior patterns (tool call frequency, response time, error rate) with configurable sensitivity. Catches rogue agents before they violate explicit rules. (#71)
- Copilot auto-review workflow — Automated PR review on every pull request. (#70)
- 7 production module ports — Episodic Memory Kernel, CMVK, Self-Correcting Agent Kernel, Context-as-a-Service, Agent Control Plane, Trust Engine, Mute Agent infrastructure — ported from internal production with full test coverage. (#63–#69)
- 44 code scanning alerts resolved — CodeQL SAST findings across the entire repository including CWE-209 (error information exposure), CWE-116 (improper encoding), and CWE-20 (improper input validation). (#79)
- All cryptographic operations use real Ed25519 primitives (not placeholder/XOR).
- Prompt injection defense verified:
prompt_injection.py+ LlamaFirewall +OutputValidationMiddleware. - SLO alerting verified:
AlertManagerwith Slack, PagerDuty, Teams, and OpsGenie channels.
- 339+ new tests across all features with full assertion coverage.
- All 5 packages pass CI independently.
pip install agent-governance-toolkit[full]1.0.1 - 2026-03-06
- CODEOWNERS — Default and per-package code ownership for review routing.
- SBOM workflow — Generates SPDX-JSON and CycloneDX-JSON on every release
with GitHub attestation via
actions/attest-sbom.
- Microsoft org release — First publish from
microsoft/agent-governance-toolkit - Added MIT license headers to 1,159 source files across all packages.
- Migrated all 215 documentation URLs from personal repos to Microsoft org.
- Replaced personal email references with team alias (
agentgovtoolkit@microsoft.com). - Enhanced README with hero section, CI badge, navigation links, CLA/Code of Conduct sections.
- Bumped all 5 package versions from 1.0.0 to 1.0.1.
- Fixed
agentmeshPyPI link toagentmesh-platform(correct package name). - Removed internal feed reference from providers.py.
- Secret scan verified clean — no keys, tokens, or credentials in repository.
pip-auditverified 0 known vulnerabilities across all packages.- All 43 OSV vulnerabilities from v1.0.0 confirmed resolved.
- Archived 6 personal repos with deprecation banners and migration notices.
- Closed 83 open issues and annotated 596 closed items with migration links.
- Posted migration announcements to 89 stargazers.
- Enabled GitHub Discussions, 12 topic tags, OpenSSF Scorecard.
1.0.0 - 2026-03-04
- Agent OS Kernel (
agent-os-kernel) — Policy-as-code enforcement engine with syscall-style interception, OWASP ASI 2026 compliance, and Microsoft Agent Framework (MAF) native middleware adapter. - AgentMesh (
agentmesh) — Zero-trust inter-agent identity mesh with SPIFFE-based identity, DID-linked credentials, Microsoft Entra Agent ID adapter, and AI-BOM v2.0 supply-chain provenance. - Agent Runtime (
agent-runtime) — Runtime sandboxing with capability-based isolation, resource quotas, and Docker/Firecracker execution environments. - Agent SRE (
agent-sre) — Observability toolkit with chaos-engineering probes, canary deployment framework, and automated incident response. - Agent Compliance (
agent-governance, formerlyai-agent-compliance) — Unified compliance installer mapping OWASP ASI 2026 (10/10), NIST AI RMF, EU AI Act, and CSA Agentic Trust Framework. - Mono-repo CI/CD: lint (ruff) × 5 packages, test matrix (3 Python versions × 4 packages), security scanning (safety), CodeQL SAST (Python + JavaScript).
- Dependabot configuration for 8 ecosystems.
- OpenSSF Best Practices badge and Scorecard integration.
- Comprehensive governance proposal documents for standards bodies (OWASP, CoSAI, LF AI & Data).
- CVE-2025-27520 — Bumped
python-multipartto ≥0.0.20 (arbitrary file write). - CVE-2024-53981 — Bumped
python-multipartto ≥0.0.20 (DoS via malformed boundary). - CVE-2024-47874 — Bumped
python-multipartto ≥0.0.20 (Content-Type ReDoS). - CVE-2024-5206 — Bumped
scikit-learnto ≥1.6.1 (sensitive data leakage). - CVE-2023-36464 — Replaced deprecated
PyPDF2withpypdf≥4.0.0 (infinite loop). - Removed exception details from HTTP error responses (CWE-209).
- Redacted PII (patient IDs, SSNs) from example log output (CWE-532).
- Fixed ReDoS patterns in policy library regex (CWE-1333).
- Fixed incomplete URL validation in Chrome extension (CWE-20).
- Pinned all GitHub Actions by SHA hash.
- Pinned all Docker base images by SHA256 digest.
- Removed
gradle-wrapper.jarbinary artifact.