Classify the effect
Pattern-match the action name against 34 keywords across four risk tiers. delete_file → destructive. web_search → read. Unknown actions default to mutating — the safe default that routes through higher scrutiny.
AI agents execute many actions per minute, delegate tasks to each other, and require instant revocation when something goes wrong. AgentTrust ID™ issues opaque runtime tokens and evaluates actions routed through its SDK, MCP proxy, A2A adapter, and API session checks. Built for how agents actually work — across MCP, A2A, and direct API integrations.
AgentTrust ID authorizes actions that are routed through its SDKs, protocol adapters, or gateway checks. Here's the exact flow, step by step — from the actual code.
Pattern-match the action name against 34 keywords across four risk tiers. delete_file → destructive. web_search → read. Unknown actions default to mutating — the safe default that routes through higher scrutiny.
Verify action is within the session's immutable scope ceiling. Confirm the session mode (read-only blocks writes unless elevated). Validate agent ownership. Every session protected by HMAC-SHA256 integrity signatures.
After five calls, the checker examines denial rate and write ratio. If the session is probing unauthorized actions (30%+ denials) or making an unusual burst of writes (80%+), the action is escalated to a higher risk tier automatically.
Read actions → Fast Guard only. Mutating → Fast + Spot. Destructive and admin → Fast + Spot + Deep (Claude-backed review). Destructive actions fail closed if any Guardian tier is unreachable. There is no bypass path.
Return allow or deny with reasoning, confidence score, and guard tier. Extend the SHA-256-chained decision log. Update session metrics atomically via Redis Lua scripts. External services can verify the live session record and chain head through the accountability API.
Protocol convergence, behavioral analytics, and Deep Guard review — all in one platform. Every capability below is implemented with tests.
The MCP proxy, A2A adapter, and API session paths translate actions into the same unified check. One policy model can apply across supported integrations.
Routed actions are classified at runtime by matching against 34 keywords across four tiers. Operators can override classification per-tool when needed.
Sessions start in read-only mode. Elevation requires human approval, is time-boxed to five minutes, and scoped to specific actions. Admin actions can never be elevated — a hard safety boundary.
Monitors denial rates and write ratios per session. Anomaly signals can raise scrutiny by escalating effect classification when behavior moves into riskier patterns.
A2A delegations verified for chain continuity, scope narrowing (privileges only shrink), maximum depth of five hops, self-delegation prohibition, and expiration at every link.
When MCP servers request LLM inference, AgentTrust ID intercepts: rate-limits to ten requests per five-minute window, scans prompts for dangerous patterns, and routes through the full Guardian pipeline.
AgentTrust ID records authorization decisions and audit events with chain-continuity checks. The accountability API verifies live session integrity for active sessions; it is not presented as a full historical replay proof.
Agents from different organizations interact via opaque federation tokens. Server-side introspection verifies active tokens and revocation state before bridging them into local AgentTrust ID sessions. No PKI infrastructure required.
Supported SDK and protocol-adapter checks route through Fast Guard first. Higher-risk actions can escalate to Spot and Deep tiers, and production Guardian JWT verification fails closed when key material is missing.
Rule-based evaluation for routed checks. Checks capabilities, blocklists, and scope with a low-latency prefilter.
Policy engine. Risk scoring, cumulative analysis, credential age, action history. Evaluates mutating and higher.
AI reasoning via Claude. Context-aware evaluation with confidence scores. Destructive and admin actions only.
| Effect | Keywords | Pipeline | If Guardian Down |
|---|---|---|---|
| read | get, list, read, describe, search, view, fetch, query, head | Fast Guard only | N/A (Fast Guard only) |
| mutating | write, update, create, execute, invoke, modify, send, put, post, commit, push, deploy | Fast + Spot | Falls back to Fast Guard |
| destructive | delete, drop, destroy, purge, terminate, remove, truncate | Fast + Spot + Deep | Fail-closed (denied) |
| admin | admin, transfer_ownership, revoke, escalate, grant, impersonate | Fast + Spot + Deep | Fail-closed (denied) |
Core services are Go binaries. Redis backs session, revocation, cache, and rate-limit state; Postgres stores identity, policy, and audit data. Internal auth is implemented, and TLS/mTLS support is configurable per deployment path.
Check before execute. Report after. Elevation handled automatically. Every SDK includes Sessions, Approvals, Guard, and Actions APIs.
# Python SDK — pip install agenttrustid, available June 1, 2026 from agenttrustid import AgentTrustClient, AgentTrustGuard client = AgentTrustClient( base_url="https://your-agenttrust-endpoint.example", # configure your endpoint api_key="sk_live_...", ) # Guard pattern: check → execute → report guard = AgentTrustGuard(client, agent_id=agent_id) if guard.check("web_search", input_summary="..."): result = perform_search(query) guard.report("web_search", success=True, duration_ms=42) # Or inspect the decision directly — .allowed / .elevation_required / .approval_id decision = client.actions.check(agent_id=agent_id, tool_name="web_search", session_id=session_id) if decision.elevation_required: request_human_approval(decision.approval_id)
Open source under the Apache License 2.0. Source on GitHub from June 1, 2026.
We do one thing and do it well. These are complementary layers handled by other tools in your stack.
Tell us what you are building and we will be in touch.
We read every request personally. You will hear back at once we have reviewed.
Prefer email? contact@agenttrust.id