aegis
Policy-based governance for AI agent tool calls. YAML policies, approval gates, risk assessment, and audit logging across LangChain, OpenAI, Anthropic, and MCP.
README
<!-- mcp-name: io.github.Acacian/aegis --> <p align="center"> <h1 align="center">Aegis</h1> <p align="center"> <strong>The simplest way to govern AI agent actions. No infra. No lock-in. Just Python.</strong> </p> <p align="center"> <code>pip install agent-aegis</code> → YAML policy → governance in 5 minutes.<br/> <strong>Works with LangChain, CrewAI, OpenAI, Anthropic, MCP, and more.</strong> </p> </p>
<p align="center"> <a href="https://github.com/Acacian/aegis/actions/workflows/ci.yml"><img src="https://github.com/Acacian/aegis/actions/workflows/ci.yml/badge.svg" alt="CI"></a> <a href="https://pypi.org/project/agent-aegis/"><img src="https://img.shields.io/pypi/v/agent-aegis?color=blue&cacheSeconds=3600" alt="PyPI"></a> <a href="https://pypi.org/project/langchain-aegis/"><img src="https://img.shields.io/pypi/v/langchain-aegis?label=langchain-aegis&color=blue&cacheSeconds=3600" alt="langchain-aegis"></a> <a href="https://pypi.org/project/agent-aegis/"><img src="https://img.shields.io/pypi/pyversions/agent-aegis?cacheSeconds=3600" alt="Python"></a> <a href="https://github.com/Acacian/aegis/blob/main/LICENSE"><img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License"></a> <a href="https://acacian.github.io/aegis/"><img src="https://img.shields.io/badge/docs-acacian.github.io%2Faegis-blue" alt="Docs"></a> <a href="https://pypi.org/project/agent-aegis/"><img src="https://img.shields.io/pypi/dm/agent-aegis?label=downloads&color=brightgreen" alt="Downloads"></a> <a href="https://github.com/Acacian/aegis"><img src="https://img.shields.io/github/stars/Acacian/aegis?style=social" alt="GitHub stars"></a> <br/> <a href="https://github.com/Acacian/aegis/actions/workflows/ci.yml"><img src="https://img.shields.io/badge/tests-2238_passed-brightgreen" alt="Tests"></a> <a href="https://github.com/Acacian/aegis/actions/workflows/ci.yml"><img src="https://img.shields.io/badge/coverage-92%25-brightgreen" alt="Coverage"></a> <a href="https://acacian.github.io/aegis/playground/"><img src="https://img.shields.io/badge/playground-Try_it_Live-ff6b6b" alt="Playground"></a> </p>
<p align="center"> <a href="https://acacian.github.io/aegis/playground/"><strong>Try it Live in Your Browser</strong></a> • <a href="#quick-start">Quick Start</a> • <a href="#how-it-works">How It Works</a> • <a href="https://acacian.github.io/aegis/">Documentation</a> • <a href="#integrations">Integrations</a> • <a href="https://github.com/Acacian/aegis/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22">Contributing</a> </p>
<p align="center"> <b>English</b> • <a href="./README.ko.md">한국어</a> </p>
Without Aegis vs. With Aegis
Without Aegis — scattered if/else, no audit trail, breaks when you add new tools:
async def handle_agent_action(tool_name, args):
if tool_name == "delete_users":
raise Exception("Blocked") # fragile
if tool_name.startswith("bulk_") and args.get("count", 0) > 100:
approved = await ask_slack_approval(tool_name, args) # custom per-tool
if not approved:
raise Exception("Denied")
if tool_name == "deploy" and datetime.now().hour >= 18:
raise Exception("No deploys after hours") # hardcoded
result = await execute(tool_name, args)
# No audit. No consistency. Repeat for every agent, every framework.
return result
With Aegis — one YAML file governs everything, with audit trail and approval gates built in:
# policy.yaml
rules:
- name: block_deletes
match: { type: "delete*" }
approval: block
- name: bulk_approval
match: { type: "bulk_*" }
conditions: { param_gt: { count: 100 } }
approval: approve # asks human via Slack, CLI, Discord, etc.
- name: no_after_hours
match: { type: "deploy*" }
conditions: { time_after: "18:00" }
approval: block
from aegis import Policy, Runtime
runtime = Runtime(executor=my_executor, policy=Policy.from_yaml("policy.yaml"))
result = await runtime.run_one(action) # policy check + approval + audit — done.
One pip install. One YAML file. Works across LangChain, CrewAI, OpenAI, Anthropic, and MCP. Full audit trail, human approval gates, and regulatory compliance — without deploying a single server.
How It Works
Core Concepts
Aegis has 3 key components. You need to understand these to use it:
| Concept | What it is | Your responsibility |
|---|---|---|
| Policy | YAML rules that define what's allowed, what needs approval, and what's blocked. | Write the rules. |
| Executor | The adapter that actually does things (calls APIs, clicks buttons, runs queries). | Provide one, or use a built-in adapter. |
| Runtime | The engine that connects Policy + Executor. Evaluates rules, gates approval, executes, logs. | Create it. Call run_one() or plan() + execute(). |
The Pipeline
Every action goes through 5 stages. This happens automatically -- you just call runtime.run_one(action):
1. EVALUATE Your action is matched against policy rules (glob patterns).
→ PolicyDecision: risk level + approval requirement + matched rule
2. APPROVE Based on the decision:
- auto: proceed immediately (low-risk actions)
- approve: ask a human via CLI, Slack, Discord, Telegram, webhook, or email
- block: reject immediately (dangerous actions)
3. EXECUTE The Executor carries out the action.
Built-in: Playwright (browser), httpx (HTTP), LangChain, CrewAI, OpenAI, Anthropic, MCP
Custom: extend BaseExecutor (10 lines)
4. VERIFY Optional post-execution check (override executor.verify()).
5. AUDIT Every decision and result is logged to SQLite automatically.
Export: JSONL, webhook, or query via CLI/API.
Two Ways to Use
Option A: Python library (most common) -- no server needed.
Import Aegis into your agent code. Everything runs in the same process.
runtime = Runtime(executor=MyExecutor(), policy=Policy.from_yaml("policy.yaml"))
result = await runtime.run_one(Action("read", "crm"))
Option B: REST API server -- for non-Python agents (Go, TypeScript, etc.).
pip install 'agent-aegis[server]'
aegis serve policy.yaml --port 8000
curl -X POST localhost:8000/api/v1/evaluate \
-d '{"action_type": "delete", "target": "db"}'
# => {"risk_level": "CRITICAL", "approval": "block", "is_allowed": false}
Approval Handlers
When a policy rule requires approval: approve, Aegis asks a human. You choose how:
| Handler | How it works | Status |
|---|---|---|
| CLI (default) | Terminal Y/N prompt | Stable |
| Slack | Posts Block Kit message, polls thread replies | Stable |
| Discord | Sends rich embed, polls callback | Stable |
| Telegram | Inline keyboard buttons, polls getUpdates | Stable |
| Webhook | POSTs to any URL, reads response | Stable |
| Sends approval request via SMTP, polls mailbox | Beta | |
| Auto | Approves everything (for testing / server mode) | Stable |
| Custom | Extend ApprovalHandler with your own logic |
Stable |
Audit Trail
Every action is automatically logged to a local SQLite database. No setup required.
aegis audit # View all entries
aegis audit --risk-level HIGH # Filter by risk
aegis audit --tail # Live monitoring (1s poll)
aegis stats # Statistics per rule
aegis audit --format jsonl -o export.jsonl # Export
Quick Start
pip install agent-aegis
1. Generate a policy
aegis init # Creates policy.yaml with sensible defaults
# policy.yaml
version: "1"
defaults:
risk_level: medium
approval: approve
rules:
- name: read_safe
match: { type: "read*" }
risk_level: low
approval: auto
- name: bulk_ops_need_approval
match: { type: "bulk_*" }
conditions:
param_gt: { count: 100 } # Only when count > 100
risk_level: high
approval: approve
- name: no_deletes
match: { type: "delete*" }
risk_level: critical
approval: block
2. Add to your agent
import asyncio
from aegis import Action, Policy, Runtime
from aegis.adapters.base import BaseExecutor
from aegis.core.result import Result, ResultStatus
class MyExecutor(BaseExecutor):
async def execute(self, action):
print(f" Executing: {action.type} -> {action.target}")
return Result(action=action, status=ResultStatus.SUCCESS)
async def main():
async with Runtime(
executor=MyExecutor(),
policy=Policy.from_yaml("policy.yaml"),
) as runtime:
plan = runtime.plan([
Action("read", "crm", description="Fetch contacts"),
Action("bulk_update", "crm", params={"count": 150}),
Action("delete", "crm", description="Drop table"),
])
print(plan.summary())
results = await runtime.execute(plan)
asyncio.run(main())
3. See what happened
aegis audit
ID Session Action Target Risk Decision Result
1 a1b2c3d4... read crm LOW auto success
2 a1b2c3d4... bulk_update crm HIGH approved success
3 a1b2c3d4... delete crm CRITICAL block blocked
Features
Core — what you get out of the box:
| YAML policies | Glob matching, first-match-wins, smart conditions (time_after, param_gt, weekdays, regex, etc.) |
| 4-tier risk model | low / medium / high / critical with per-rule overrides |
| Approval gates | CLI, Slack, Discord, Telegram, email, webhook, or custom |
| Audit trail | Automatic SQLite logging. Export: JSONL, webhook, or query via CLI/API |
| 7 adapters | LangChain, CrewAI, OpenAI Agents, Anthropic, MCP, Playwright, httpx |
| REST API + Dashboard | aegis serve policy.yaml — web UI with KPIs, audit log, compliance reports |
Enterprise — production-grade governance:
| Cryptographic audit chain | SHA-256/SHA3-256 hash-linked tamper-evident trail (EU AI Act Art.12, SOC2 CC7.2) |
| Regulatory mapper | EU AI Act, NIST AI RMF, SOC2, ISO 42001, OWASP Agentic Top 10 — gap analysis + evidence |
| Behavioral anomaly detection | Per-agent profiling, auto-policy generation from observed behavior |
| RBAC | 12 permissions, 5 hierarchical roles, thread-safe AccessController |
| Multi-tenant isolation | TenantContext, quota enforcement, data separation |
| Policy versioning | Git-like commit, diff, rollback, tagging |
MCP Supply Chain Security — defense-in-depth for MCP tool calls:
| Tool poisoning detection | 10 regex patterns against Unicode-normalized text, schema recursion |
| Rug pull detection | SHA-256 hash pinning, definition change alerts |
| Argument sanitization | Path traversal, command injection, null byte detection |
| Trust scoring (L0-L4) | Automated trust levels from scan + pin + audit status |
| Vulnerability database | 8 built-in CVEs for popular MCP servers, version-range matching, auto-block |
| SBOM generation | CycloneDX-inspired bill of materials with vulnerability overlay |
| Session replay | Record/replay agent sessions with retroactive security scanning (20 patterns) |
Multi-Agent Governance:
| Cost circuit breaker | 17 model pricing entries, loop detection, hierarchical budgets, thread-safe |
| Cross-framework cost tracking | LangChain + OpenAI + Anthropic + Google → unified CostTracker |
| Multi-agent cost attribution | Delegation trees, subtree rollup, formatted attribution reports |
| A2A communication governance | Capability-gated messaging, PII/credential redaction, rate limiting, audit log |
| Policy-as-code Git integration | Diff formatting, impact analysis, drift detection, YAML export |
| OpenTelemetry export | Policy/cost/anomaly/MCP events → OTel spans, in-memory fallback |
Developer experience:
aegis scan |
AST-based detection of ungoverned AI calls in your codebase |
aegis probe |
Adversarial policy testing — glob bypass, missing coverage, escalation |
aegis autopolicy |
Natural language → YAML ("block deletes, allow reads") |
aegis score |
Governance coverage 0-100 with shields.io badge |
| Policy-as-Code SDK | Fluent PolicyBuilder API for programmatic construction |
| GitHub Action | CI/CD governance gates in your pipeline |
| 9 policy templates | Pre-built for CRM, finance, DevOps, healthcare, and more |
| Interactive playground | Try in browser — no install needed |
Real-World Use Cases
| Scenario | Policy | Outcome |
|---|---|---|
| Finance | Block bulk transfers > $10K without CFO approval | Agents can process invoices safely; large amounts trigger Slack approval |
| SaaS Ops | Auto-approve reads; require approval for account mutations | Support agents handle tickets without accidentally deleting accounts |
| DevOps | Allow deploys Mon-Fri 9-5; block after hours | CI/CD agents can't push to prod at 3am |
| Data Pipeline | Block DELETE on production tables; auto-approve staging | ETL agents can't drop prod data, even if the LLM hallucinates |
| Compliance | Log every external API call with full context | Auditors get a complete trail for SOC2 / GDPR evidence |
Policy Templates
Pre-built YAML policies for common industries. Copy one, customize it, deploy:
| Template | Use Case | Key Rules |
|---|---|---|
crm-agent.yaml |
Salesforce, HubSpot, CRM | Read=auto, Write=approve, Delete=block |
code-agent.yaml |
Cursor, Copilot, Aider | Read=auto, Shell=high, Deploy=block |
financial-agent.yaml |
Payments, invoicing | View=auto, Payments=approve, Transfers=critical |
browser-agent.yaml |
Playwright, Selenium | Navigate=auto, Click=approve, JS eval=block |
data-pipeline.yaml |
ETL, database ops | SELECT=auto, INSERT=approve, DROP=block |
devops-agent.yaml |
CI/CD, infrastructure | Monitor=auto, Deploy=approve, Destroy=block |
healthcare-agent.yaml |
Healthcare, HIPAA | Search=auto, PHI=approve, Delete=block |
ecommerce-agent.yaml |
Online stores | View=auto, Refund=approve, Delete=block |
support-agent.yaml |
Customer support | Read=auto, Respond=approve, Delete=block |
policy = Policy.from_yaml("policies/crm-agent.yaml")
Production Ready
| Aspect | Detail |
|---|---|
| 1,950+ tests, 92% coverage | Every adapter, handler, and edge case tested |
| Type-safe | mypy --strict with zero errors, py.typed marker |
| Performance | Policy evaluation < 1ms; auto-approved actions add < 5ms overhead |
| Fail-safe | Blocked actions never execute; can't be bypassed without policy change |
| Audit immutability | Results are frozen dataclasses; audit writes happen before returning |
| No magic | Pure Python, no monkey-patching, no global state |
Compliance & Audit
Aegis audit trails provide evidence for regulatory and internal compliance:
| Standard | What Aegis provides |
|---|---|
| SOC2 | Immutable audit log of every agent action, decision, and approval |
| GDPR | Data access documentation -- who/what accessed which system and when |
| HIPAA | PHI access trail with full action context and approval chain |
| Internal | Change management evidence, risk assessment per action |
Export as JSONL, query via CLI/API, or stream to external SIEM via webhook. For defense-in-depth with container isolation, see the Security Model guide.
Integrations
Works with the agent frameworks you already use:
pip install langchain-aegis # LangChain (standalone integration)
pip install 'agent-aegis[langchain]' # LangChain (adapter)
pip install 'agent-aegis[crewai]' # CrewAI
pip install 'agent-aegis[openai-agents]' # OpenAI Agents SDK
pip install 'agent-aegis[anthropic]' # Anthropic Claude
pip install 'agent-aegis[httpx]' # Webhook approval/audit
pip install 'agent-aegis[playwright]' # Browser automation
pip install 'agent-aegis[server]' # REST API server
pip install 'agent-aegis[all]' # Everything
<details> <summary><b>LangChain</b> -- govern any LangChain tool with one function call</summary>
Option A: langchain-aegis (recommended) — standalone integration package
pip install langchain-aegis
from langchain_aegis import govern_tools
# Add governance to existing tools — no other code changes
governed = govern_tools(tools, policy="policy.yaml")
agent = create_react_agent(model, governed)
Option B: AgentMiddleware — intercepts every tool call via LangChain's middleware protocol
from aegis.adapters.langchain import AegisMiddleware
middleware = AegisMiddleware(policy=Policy.from_yaml("policy.yaml"))
# Blocked calls return a ToolMessage explaining the policy violation
# Allowed calls proceed normally
Option C: Executor/Tool adapter
from aegis.adapters.langchain import LangChainExecutor, AegisTool
executor = LangChainExecutor(tools=[DuckDuckGoSearchRun()])
runtime = Runtime(executor=executor, policy=Policy.from_yaml("policy.yaml"))
</details>
<details> <summary><b>OpenAI Agents SDK</b> -- native guardrails + decorator governance</summary>
Option A: Native guardrails (recommended) — uses SDK's @tool_input_guardrail / @tool_output_guardrail
from agents import function_tool
from aegis import Policy
from aegis.adapters.openai_agents import (
create_aegis_input_guardrail,
create_aegis_output_guardrail,
)
policy = Policy.from_yaml("policy.yaml")
input_guard = create_aegis_input_guardrail(policy=policy, fail_closed=True)
output_guard = create_aegis_output_guardrail(policy=policy)
@function_tool(
tool_input_guardrails=[input_guard],
tool_output_guardrails=[output_guard],
)
def web_search(query: str) -> str:
"""Search the web -- Aegis evaluates before AND after execution."""
return do_search(query)
Option B: Decorator-based — wraps function with full governance pipeline
from aegis.adapters.openai_agents import governed_tool
@governed_tool(runtime=runtime, action_type="write", action_target="crm")
async def update_contact(name: str, email: str) -> str:
"""Update a CRM contact -- governed by Aegis policy."""
return await crm.update(name=name, email=email)
</details>
<details> <summary><b>CrewAI</b> -- global guardrail hook + per-tool governance</summary>
Option A: Global guardrail (recommended) — governs ALL tool calls across all Crews
from aegis.adapters.crewai import enable_aegis_guardrail
# One line — every tool call now goes through Aegis policy
provider = enable_aegis_guardrail(runtime=my_runtime)
Option B: Per-tool wrapper
from aegis.adapters.crewai import AegisCrewAITool
tool = AegisCrewAITool(runtime=runtime, name="governed_search",
description="Search with governance", action_type="search",
action_target="web", fn=lambda query: do_search(query))
</details>
<details> <summary><b>Anthropic Claude</b> -- govern tool_use calls</summary>
from aegis.adapters.anthropic import govern_tool_call
for block in response.content:
if block.type == "tool_use":
result = await govern_tool_call(
runtime=runtime, tool_name=block.name,
tool_input=block.input, target="my_system")
</details>
<details> <summary><b>httpx</b> -- governed REST API calls</summary>
from aegis.adapters.httpx_adapter import HttpxExecutor
executor = HttpxExecutor(base_url="https://api.example.com",
default_headers={"Authorization": "Bearer ..."})
runtime = Runtime(executor=executor, policy=Policy.from_yaml("policy.yaml"))
# Action types map to HTTP methods: get, post, put, patch, delete
plan = runtime.plan([Action("get", "/users"), Action("delete", "/users/1")])
</details>
<details> <summary><b>MCP (Model Context Protocol)</b> -- govern any MCP tool call</summary>
from aegis.adapters.mcp import govern_mcp_tool_call, AegisMCPToolFilter
# Option 1: Govern individual tool calls
result = await govern_mcp_tool_call(
runtime=runtime, tool_name="read_file",
arguments={"path": "/data.csv"}, server_name="filesystem")
# Option 2: Filter-based governance
tool_filter = AegisMCPToolFilter(runtime=runtime)
result = await tool_filter.check(server="filesystem", tool="delete_file")
if result.ok:
# Proceed with actual MCP call
pass
</details>
<details> <summary><b>REST API</b> -- govern from any language</summary>
pip install 'agent-aegis[server]'
aegis serve policy.yaml --port 8000
# Evaluate an action (dry-run)
curl -X POST http://localhost:8000/api/v1/evaluate \
-H "Content-Type: application/json" \
-d '{"action_type": "delete", "target": "db"}'
# => {"risk_level": "CRITICAL", "approval": "block", "is_allowed": false}
# Execute through full governance pipeline
curl -X POST http://localhost:8000/api/v1/execute \
-H "Content-Type: application/json" \
-d '{"action_type": "read", "target": "crm"}'
# Query audit log
curl http://localhost:8000/api/v1/audit?action_type=delete
# Hot-reload policy
curl -X PUT http://localhost:8000/api/v1/policy \
-H "Content-Type: application/json" \
-d '{"yaml": "rules:\n - name: block_all\n match: {type: \"*\"}\n approval: block"}'
</details>
<details> <summary><b>MCP Server</b> -- one-click governance for Claude, Cursor, VS Code, Windsurf</summary>
pip install 'agent-aegis[mcp]'
aegis-mcp-server --policy policy.yaml
Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{ "mcpServers": { "aegis": { "command": "uvx", "args": ["--from", "agent-aegis[mcp]", "aegis-mcp-server"] }}}
Cursor — add to .cursor/mcp.json:
{ "mcpServers": { "aegis": { "command": "uvx", "args": ["--from", "agent-aegis[mcp]", "aegis-mcp-server"] }}}
VS Code Copilot — add to .vscode/mcp.json:
{ "servers": { "aegis": { "command": "uvx", "args": ["--from", "agent-aegis[mcp]", "aegis-mcp-server"] }}}
Windsurf — add to ~/.codeium/windsurf/mcp_config.json:
{ "mcpServers": { "aegis": { "command": "uvx", "args": ["--from", "agent-aegis[mcp]", "aegis-mcp-server"] }}}
</details>
<details> <summary><b>Custom adapters</b> -- 10 lines to integrate anything</summary>
from aegis.adapters.base import BaseExecutor
from aegis.core.action import Action
from aegis.core.result import Result, ResultStatus
class MyAPIExecutor(BaseExecutor):
async def execute(self, action: Action) -> Result:
response = await my_api.call(action.type, action.target, **action.params)
return Result(action=action, status=ResultStatus.SUCCESS, data=response)
async def verify(self, action: Action, result: Result) -> bool:
return result.data.get("status") == "ok"
</details>
Policy Conditions
Go beyond glob matching with smart conditions:
rules:
# Block writes after business hours
- name: after_hours_block
match: { type: "write*" }
conditions:
time_after: "18:00"
risk_level: critical
approval: block
# Escalate bulk operations over threshold
- name: large_bulk_ops
match: { type: "update*" }
conditions:
param_gt: { count: 100 }
risk_level: high
approval: approve
# Only allow deploys on weekdays
- name: weekday_deploys
match: { type: "deploy*" }
conditions:
weekdays: [1, 2, 3, 4, 5]
risk_level: medium
approval: approve
Available: time_after, time_before, weekdays, param_eq, param_gt, param_lt, param_gte, param_lte, param_contains, param_matches (regex).
Semantic Conditions
Go beyond keyword matching with the two-tier semantic conditions engine:
rules:
- name: block_harmful_content
match: { type: "generate*" }
conditions:
semantic: "contains harmful, violent, or illegal content"
risk_level: critical
approval: block
Tier 1 uses fast built-in keyword matching. Tier 2 plugs in any LLM evaluator via the SemanticEvaluator protocol -- bring your own model for nuanced content analysis.
Governance Dashboard
Built-in web dashboard for real-time agent governance monitoring. No separate frontend build needed.
# Quick start
pip install 'agent-aegis[server]'
aegis serve policy.yaml
# Open http://localhost:8000
7 dashboard pages:
| Page | What it shows |
|---|---|
| Overview | KPI cards, action volume chart, risk distribution, compliance grade |
| Audit Log | Filterable/paginated history of all agent actions and decisions |
| Policy | Current rules, governance score (0-100), score breakdown |
| Anomalies | Agent behavior profiles, block rates, anomaly alerts |
| Compliance | SOC2/GDPR/governance reports with findings and letter grades |
| Regulatory | EU AI Act, NIST, SOC2, ISO 42001 gap analysis |
| System | Health status, version, API endpoints |
11 REST API endpoints under /api/v1/dashboard/ -- use programmatically or through the UI.
# Programmatic access
from aegis.server.app import create_app
app = create_app(
policy_path="policy.yaml",
audit_db_path="audit.db",
enable_dashboard=True,
anomaly_detector=detector, # optional
)
Deep Features
Advanced capabilities for production-grade agent governance.
Behavioral Anomaly Detection
Aegis learns per-agent behavior profiles and automatically detects anomalies -- no manual threshold tuning required.
from aegis.core.anomaly import AnomalyDetector
detector = AnomalyDetector()
# Feed observed actions to build per-agent behavior profiles
detector.observe(agent_id="agent-1", action_type="read", target="crm")
detector.observe(agent_id="agent-1", action_type="read", target="crm")
detector.observe(agent_id="agent-1", action_type="read", target="crm")
# Detect anomalies: rate spikes, bursts, new actions, unusual targets, high block rates
alerts = detector.check(agent_id="agent-1", action_type="delete", target="prod_db")
# => [Anomaly(type=NEW_ACTION, detail="action 'delete' never seen for agent-1")]
# Auto-generate a policy from observed behavior
learned_policy = detector.generate_policy(agent_id="agent-1")
Detects: rate spikes | burst patterns | never-seen actions | unusual targets | high block rates
Compliance Report Generator
Generate audit-ready compliance reports from your existing audit logs. No additional tooling needed.
aegis compliance --type soc2 --output report.json
aegis compliance --type gdpr --output gdpr-report.json
aegis compliance --type governance --days 30
from aegis.core.compliance import ComplianceReporter
reporter = ComplianceReporter(audit_store=runtime.audit_store)
report = await reporter.generate(report_type="soc2", days=90)
print(report.score) # 87.5
print(report.findings) # List of findings with severity
print(report.evidence) # Linked audit log entries
Supported report types: SOC2 | GDPR | Governance -- each with scoring, findings, and evidence links.
Policy Diff & Impact Analysis
Compare two policy files and understand exactly what changed and what impact it will have.
# Show added/removed/modified rules between two policies
aegis diff policy-v1.yaml policy-v2.yaml
# Replay historical actions against the new policy to see impact
aegis diff policy-v1.yaml policy-v2.yaml --replay audit.db
Rules: 2 added, 1 removed, 3 modified
+ bulk_write_block CRITICAL/block (new)
+ pii_access_approve HIGH/approve (new)
- legacy_allow_all LOW/auto (removed)
~ read_safe LOW/auto → LOW/auto conditions changed
~ deploy_prod HIGH/approve → CRITICAL/block risk escalated
~ bulk_ops MEDIUM/approve param_gt.count: 100 → 50
Impact (replayed 1,247 actions):
23 actions would change from AUTO → BLOCK
7 actions would change from APPROVE → BLOCK
Agent Trust Chain
Hierarchical agent identity with delegation and capability-scoped trust.
from aegis.core.trust import TrustChain, AgentIdentity, Capability
# Create a root agent with full capabilities
root = AgentIdentity(
agent_id="orchestrator",
capabilities=[Capability("*")], # glob matching
)
# Delegate a subset of capabilities (intersection semantics)
worker = root.delegate(
agent_id="data-worker",
capabilities=[Capability("read:*"), Capability("write:staging_*")],
)
# Worker can only do what both root AND delegation allow
chain = TrustChain()
chain.register(root)
chain.register(worker, parent=root)
# Verify capability at runtime
chain.can(worker, "read:crm") # True
chain.can(worker, "delete:prod_db") # False -- not in delegation
# Cascade revocation: revoking parent revokes all children
chain.revoke(root)
chain.can(worker, "read:crm") # False
Rate Limiter
Per-agent and global sliding-window rate limiting with glob-pattern matching on agent IDs.
from aegis.core.rate_limiter import RateLimiter
limiter = RateLimiter()
limiter.add_rule("agent-*", max_requests=100, window_seconds=60)
limiter.add_rule("agent-untrusted", max_requests=10, window_seconds=60)
limiter.check("agent-untrusted", action_type="write") # True (allowed)
# After 10 calls in 60s:
limiter.check("agent-untrusted", action_type="write") # False (rate limited)
RBAC (Role-Based Access Control)
12 granular permissions across 5 hierarchical roles. Thread-safe AccessController for multi-agent environments.
from aegis.core.rbac import AccessController, Role
ac = AccessController()
ac.assign_role("alice", Role.ADMIN)
ac.assign_role("bot-1", Role.OPERATOR)
ac.check("alice", "policy:write") # True
ac.check("bot-1", "policy:write") # False -- operators can execute, not configure
ac.check("bot-1", "action:execute") # True
Roles: viewer < operator < admin < policy_admin < super_admin
Policy Versioning
Git-like policy version control with commit, diff, rollback, and tagging.
from aegis.core.versioning import PolicyVersionStore
store = PolicyVersionStore("./policy-versions.json")
store.commit(policy, message="Initial production policy")
store.tag("v1.0")
# Later...
store.commit(updated_policy, message="Relax read rules")
diff = store.diff("v1.0", "HEAD") # See what changed
store.rollback("v1.0") # Revert to tagged version
Multi-Tenant Isolation
Context-based tenant isolation with quota enforcement and data separation.
from aegis.core.tenant import TenantContext, TenantRegistry, TenantIsolation
registry = TenantRegistry()
registry.register("acme-corp", quota={"max_actions_per_hour": 1000})
with TenantContext("acme-corp"):
# All policy evaluations, audit writes, and rate limits
# are automatically scoped to this tenant
result = await runtime.run_one(action)
Cryptographic Audit Chain
Tamper-evident audit trail with hash-linked entries. Meets EU AI Act Article 12 and SOC2 CC7.2 requirements.
aegis audit --verify # Verify chain integrity
aegis audit --export-chain # Export full hash chain
aegis audit --evidence soc2 # Generate compliance evidence package
from aegis.core.crypto_audit import CryptoAuditLogger
logger = CryptoAuditLogger(algorithm="sha3-256")
# Each entry is hash-linked to the previous -- any tampering breaks the chain
logger.log(action, decision, result)
assert logger.verify_chain() # True if untampered
Regulatory Compliance Mapper
Maps your governance posture against EU AI Act, NIST AI RMF, SOC2, and ISO 42001. Identifies gaps and generates evidence.
aegis regulatory --framework eu-ai-act # Gap analysis
aegis regulatory --framework nist-ai-rmf # NIST mapping
aegis regulatory --all --output report.json
29 regulatory requirements mapped across 4 frameworks with automatic evidence collection from audit logs.
Natural Language Policy Generation
Generate YAML policies from plain English. Two tiers: built-in keyword parser (no dependencies) and pluggable LLM evaluator (bring your own API key).
aegis autopolicy "block all deletes on production, allow reads, require approval for writes over $10K"
# Generated output:
version: '1'
defaults:
risk_level: medium
approval: approve
rules:
- name: delete_block
match: { type: "delete*", target: "prod*" }
risk_level: critical
approval: block
- name: read_auto
match: { type: "read*" }
risk_level: low
approval: auto
Tier 2 (LLM-backed): implement the PolicyGenerator protocol with your preferred provider (OpenAI, Anthropic, etc.) -- same pattern as SemanticEvaluator.
Adversarial Policy Probe
Automated testing for governance gaps. Probes for glob bypasses, missing coverage, escalation patterns, and overly permissive defaults.
aegis probe policy.yaml
# Aegis Policy Probe — 205 probes
# ==================================================
# Robustness score: 72/100
# Findings: 8
#
# CRITICAL [missing_coverage]
# Destructive action 'drop' on 'production' is auto-approved
# -> Add a rule to block or require approval for 'drop' actions
#
# HIGH [glob_bypass]
# 'bulk_delete' bypasses block rule 'no_deletes' (pattern: 'delete')
# -> Broaden the glob pattern to 'delete*' or add a rule for 'bulk_delete'
Probe categories: missing coverage | glob bypass | default fallthrough | escalation patterns | target gaps | wildcard rules
aegis scan -- Static Analysis
AST-based scanner that detects ungoverned AI tool calls in your Python codebase.
aegis scan ./src/
# Output:
# src/agents/mailer.py:42 openai.ChatCompletion.create() -- ungoverned
# src/agents/writer.py:18 anthropic.messages.create() -- ungoverned
# src/tools/search.py:7 langchain tool "web_search" -- ungoverned
#
# 3 ungoverned calls found. Run `aegis score` for governance coverage.
aegis score -- Governance Score
Quantify your governance coverage with a 0-100 score and generate a shields.io badge.
aegis score ./src/ --policy policy.yaml
# Governance Score: 84/100
# Governed calls: 21/25 (84%)
# Policy coverage: 18 rules covering 6 action types
# Anomaly detection: enabled
# Audit trail: enabled
#
# Badge: https://img.shields.io/badge/aegis_score-84-brightgreen
Add the badge to your repo:

Architecture
aegis/
core/ Action, Policy engine, Conditions, Risk levels, Retry, JSON Schema
core/anomaly Behavioral anomaly detection -- per-agent profiling, auto-policy generation
core/compliance Compliance report generator -- SOC2, GDPR, governance scoring
core/trust Agent trust chain -- hierarchical identity, delegation, revocation
core/semantic Semantic conditions engine -- keyword matching + LLM evaluator protocol
core/diff Policy diff & impact analysis -- rule comparison, action replay
core/rate_limiter Per-agent/global sliding-window rate limiting
core/rbac Role-based access control -- 12 permissions, 5 roles, AccessController
core/versioning Policy version control -- commit, diff, rollback, tagging
core/tenant Multi-tenant isolation -- context, registry, quota enforcement
core/crypto_audit Cryptographic audit chain -- hash-linked tamper-evident logs
core/replay Action replay engine -- what-if policy analysis
core/regulatory EU AI Act / NIST / SOC2 / ISO 42001 compliance mapper
core/webhooks Webhook notifications -- Slack, PagerDuty, JSON
core/builder Policy-as-Code SDK -- fluent PolicyBuilder API
core/autopolicy Natural language -> YAML policy generation (keyword + LLM)
core/probe Adversarial policy testing -- gap detection, bypass attempts
core/tiers Enterprise tier system -- feature gating with soft nudge
core/mcp_security MCP supply chain security -- poisoning, rug pull, sanitization, trust scoring
core/mcp_vuln_db MCP vulnerability database -- CVE matching, version ranges, auto-block
core/mcp_sbom MCP server SBOM generation -- tool catalog, vulnerability overlay, JSON export
core/budget Cost circuit breaker -- 17 model pricing, loop detection, hierarchical budgets
core/cost_callbacks Cross-framework cost tracking -- LangChain, OpenAI, Anthropic, Google
core/cost_attribution Multi-agent cost attribution -- delegation trees, subtree rollup
core/a2a_governance Agent-to-agent communication governance -- capability gates, content filter
core/policy_git Policy-as-code Git integration -- diff, impact analysis, drift detection
core/otel_export OpenTelemetry export -- governance events → OTel spans
core/session_replay Session replay -- record/replay with retroactive security scanning
adapters/ BaseExecutor, Playwright, httpx, LangChain, CrewAI, OpenAI, Anthropic, MCP
runtime/ Runtime engine, ApprovalHandler, AuditLogger (SQLite/JSONL/webhook/logging)
server/ REST API (Starlette ASGI) -- evaluate, execute, audit, policy endpoints
cli/ aegis validate | audit | schema | init | simulate | serve | stats |
scan | score | diff | compliance | regulatory | monitor
Why Aegis?
| Writing your own | Platform guardrails | Enterprise platforms | Aegis | |
|---|---|---|---|---|
| Setup | Days of if/else | Vendor-specific config | Kubernetes + procurement | pip install + YAML |
| Cross-framework | Rewrite per framework | Their ecosystem only | Usually single-vendor | LangChain + CrewAI + OpenAI + Anthropic + MCP |
| Audit trail | printf debugging | Platform logs only | Cloud dashboard | SQLite + JSONL + webhooks — local, no infra |
| Compliance | Manual documentation | None | Enterprise sales cycle | EU AI Act, NIST, SOC2, ISO 42001 built-in |
| Cost | Engineering time | Free-to-$$$ per vendor | $$$$ + infra | Free (MIT). Forever. |
CLI
aegis init # Generate starter policy
aegis validate policy.yaml # Validate policy syntax
aegis schema # Print JSON Schema (for editor autocomplete)
aegis simulate policy.yaml read:crm delete:db # Test policies without executing
aegis audit # View audit log
aegis audit --session abc --format json # Filter + format
aegis audit --tail # Live monitoring
aegis audit --format jsonl -o export.jsonl # Export
aegis stats # Policy rule statistics
aegis serve policy.yaml --port 8000 # Start REST API + dashboard (http://localhost:8000)
aegis scan ./src/ # Detect ungoverned AI tool calls (AST-based)
aegis score ./src/ --policy policy.yaml # Governance score (0-100) + badge
aegis diff policy-v1.yaml policy-v2.yaml # Compare policies
aegis diff policy-v1.yaml policy-v2.yaml --replay # Impact analysis with action replay
aegis compliance --type soc2 --output report.json # Generate compliance report
aegis autopolicy "block deletes, allow reads" # Generate policy from English
aegis probe policy.yaml # Adversarial policy testing
Roadmap
| Version | Status | Features |
|---|---|---|
| 0.1 | Released | Policy engine, 7 adapters (incl. MCP), CLI, audit (SQLite + JSONL + webhook), conditions, JSON Schema |
| 0.1.3 | Released | REST API server, retry/rollback, dry-run, hot-reload, policy merge, Slack/Discord/Telegram/email approval, simulate CLI, runtime hooks, stats, live tail |
| 0.1.4 | Released | Multi-agent foundations (agent_id, PolicyHierarchy, conflict detection), performance optimizations (compiled globs, batch audit, eval cache), security hardening, MCP/LangChain/CrewAI/OpenAI cookbooks |
| 0.1.5 | Released | Behavioral anomaly detection, compliance report generator (SOC2/GDPR), policy diff & impact analysis, semantic conditions engine, agent trust chain, aegis scan (static analysis), aegis score (governance scoring + badge) |
| 0.1.7 | Released | Cryptographic audit chain, rate limiter, RBAC, policy versioning, multi-tenant isolation, regulatory mapper (EU AI Act/NIST/SOC2/ISO 42001), webhook notifications, action replay, PolicyBuilder SDK, policy testing framework, real-time monitor, GitHub Action |
| 0.1.9 | Released | Web governance dashboard (7 pages, 11 API endpoints), aegis serve with dashboard, natural language autopolicy, adversarial probe |
| 0.2 | Released | LangChain AgentMiddleware, CrewAI GuardrailProvider, OpenAI Agents native guardrails, OWASP Agentic Top 10, HTML compliance reports, interactive playground + challenge |
| 0.3 | Released | MCP supply chain security (poisoning/rug pull/SBOM/vuln DB), cost circuit breaker (17 models), cross-framework cost tracking (LangChain/OpenAI/Anthropic/Google), A2A communication governance, session replay with retroactive scanning, OpenTelemetry export, policy Git integration |
| 0.4 | Q3 2026 | Centralized policy server, cross-agent audit correlation |
| 1.0 | 2027 | Distributed governance, hosted SaaS, SSO/SCIM |
Contributing
We welcome contributions! Check out:
- Good First Issues -- great starting points
- Contributing Guide -- setup, code style, PR process
- Architecture -- how the codebase is structured
git clone https://github.com/Acacian/aegis.git && cd aegis
make dev # Install deps + hooks
make test # Run tests
make lint # Lint + format check
make coverage # Coverage report
Or jump straight into a cloud environment:
Badge
Using Aegis? Add a badge to your project:
[](https://github.com/Acacian/aegis)
License
MIT -- see LICENSE for details.
Copyright (c) 2026 구동하 (Dongha Koo, @Acacian). Created March 21, 2026.
<p align="center"> <sub>Built for the era of autonomous AI agents.</sub><br/> <sub>If Aegis helps you, consider giving it a star -- it helps others find it too.</sub> </p>
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。
mcp-server-qdrant
这个仓库展示了如何为向量搜索引擎 Qdrant 创建一个 MCP (Managed Control Plane) 服务器的示例。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。