Architecture Overview

ShieldAgent enforces security at the boundary between AI agents and the tools they use. Where exactly that boundary sits in your stack is flexible — deploy the proxy inline between agent and upstream, or use the SDK's Verdict API to scan requests from within your own server code before executing them.

Where to Position ShieldAgent

ShieldAgent can be inserted at several points in the agent-to-tool call chain. The right position depends on what you own in your stack and the level of visibility you need. All positions use the same multi-stage security pipeline; only the integration method differs. Positions 2 and 3 use the SDK's Verdict API — your code sends request payloads to ShieldAgent for scanning and enforces the returned verdict.

Recommended

Position 1 — Proxy between Agent and MCP

The agent points its MCP client at the ShieldAgent proxy instead of the upstream server directly. No code changes to the agent or the MCP server. The proxy intercepts every tool call, runs the full security pipeline, and forwards approved requests. This is the primary deployment model and the one that gives you complete visibility into all tool calls.

Best when: you do not own the agent or MCP server, or you want zero-touch deployment. Network segmentation should enforce that agents cannot bypass the proxy.

Verdict API

Position 2 — Verdict API in your own API

If your agents call a REST API that you own and operate, use the ShieldAgent SDK to scan each incoming request before executing it. Your server calls client.scan() with the request payload. ShieldAgent runs the full multi-stage security pipeline and returns a verdict (allow / block / human_review). Your code enforces the verdict — execute if allowed, reject if blocked. The security pipeline runs on the ShieldAgent proxy, not in-process.

Best when: you own the API layer your agents call and want to add security scanning without changing your traffic flow. Your code stays in control — ShieldAgent scans and returns a verdict, you enforce it.

TypeScript

your-api-handler.ts

import { ShieldAgentClient } from '@shieldagent/sdk';

const client = new ShieldAgentClient({
  baseUrl: 'https://proxy.shieldagent.io',
  apiKey: '{your-api-key}',
});

async function handleAgentRequest(agentId: string, method: string, path: string, body: unknown) {
  const verdict = await client.scan({
    tenantId: '{your-tenant-id}',
    agentId,
    toolName: `api:${method}:${path}`,
    params: { method, path, body },
    context: { transport: 'rest' },
  });

  if (verdict.action === 'block') {
    throw new Error(`Request blocked by ShieldAgent: ${verdict.reason}`);
  }

  if (verdict.action === 'human_review') {
    return { status: 'pending_review', reviewId: verdict.reviewId };
  }

  return executeRequest(method, path, body);
}

Python

your_api_handler.py

from shieldagent import ShieldAgentClient

client = ShieldAgentClient(
    base_url="https://proxy.shieldagent.io",
    api_key="{your-api-key}",
)

async def handle_agent_request(agent_id: str, method: str, path: str, body: dict):
    verdict = await client.scan(
        tenant_id="{your-tenant-id}",
        agent_id=agent_id,
        tool_name=f"api:{method}:{path}",
        params={"method": method, "path": path, "body": body},
        context={"transport": "rest"},
    )

    if verdict.action == "block":
        raise HTTPException(status_code=403, detail=verdict.reason)

    if verdict.action == "human_review":
        return {"status": "pending_review", "review_id": verdict.review_id}

    return execute_request(method, path, body)

Verdict API

Position 3 — Verdict API in your own MCP server

If you operate an MCP server that agents connect to, add a client.scan() call before each tool handler. When an agent requests a tool, your server sends the tool name and arguments to ShieldAgent for scanning. ShieldAgent returns a verdict — your code executes the tool only if allowed. This gives you deep visibility and policy enforcement at the point where tools are defined, while keeping ShieldAgent centralized.

Best when: you build and maintain the MCP server and want security enforcement at the tool definition layer. Your server stays in control — ShieldAgent scans, you enforce. Richer context for policy evaluation than external proxy positioning.

TypeScript

your-mcp-server.ts

import { ShieldAgentClient } from '@shieldagent/sdk';

const client = new ShieldAgentClient({
  baseUrl: 'https://proxy.shieldagent.io',
  apiKey: '{your-api-key}',
});

async function handleToolCall(toolName: string, params: unknown, clientName?: string) {
  const verdict = await client.scan({
    tenantId: '{your-tenant-id}',
    clientHint: clientName,
    toolName,
    params,
    context: { transport: 'mcp' },
  });

  if (verdict.action === 'block') {
    throw new Error(`Tool blocked by ShieldAgent: ${verdict.reason}`);
  }

  if (verdict.action === 'human_review') {
    return { status: 'pending_review', reviewId: verdict.reviewId };
  }

  return executeToolHandler(toolName, params);
}

Python

your_mcp_server.py

from shieldagent import ShieldAgentClient

client = ShieldAgentClient(
    base_url="https://proxy.shieldagent.io",
    api_key="{your-api-key}",
)

async def handle_tool_call(tool_name: str, params: dict, client_name: str | None = None):
    verdict = await client.scan(
        tenant_id="{your-tenant-id}",
        client_hint=client_name,
        tool_name=tool_name,
        params=params,
        context={"transport": "mcp"},
    )

    if verdict.action == "block":
        raise MCPError(f"Tool blocked by ShieldAgent: {verdict.reason}")

    if verdict.action == "human_review":
        return {"status": "pending_review", "review_id": verdict.review_id}

    return execute_tool(tool_name, params)

Limited

Position 4 — Proxy in front of the agent

The proxy sits at the ingress of the agent itself — between the caller and the agent process. This is useful for inspecting and filtering the inputs that drive the agent (user prompts, orchestration messages, injected context). However, it cannot observe the outbound MCP calls the agent makes to upstream servers, so tool-level enforcement is not possible in this position alone. Use it as a complement to Positions 1–3, not a replacement.

Best when: combined with Position 1 or 2 to cover both inbound prompt inspection and outbound tool call enforcement. Not recommended as a standalone deployment — the outbound tool call path remains unprotected.

Kubernetes

Position 5 — Sidecar Proxy

ShieldAgent runs as a sidecar container alongside each agent (or alongside your MCP/API server). The agent sends requests to the sidecar instead of the upstream URL directly. No extra network hop — traffic stays on localhost. Same full pipeline as Position 1, but co-located for lower latency and per-agent isolation.

Best when: Kubernetes environments, per-agent isolation requirements, or microservice architectures. Also works as a sidecar of your server — co-locate ShieldAgent alongside your MCP/API server instead of the agent for infrastructure-enforced blocking without a network hop.

Enterprise

Position 6 — Post-Gateway

ShieldAgent sits behind your existing API gateway (Kong, Envoy, AWS API Gateway). The gateway handles authentication and routing, then forwards agent traffic to ShieldAgent for security scanning before it reaches the upstream. No agent code changes — the gateway routes transparently.

Best when: enterprises with existing API gateways that cannot change agent routing. Limitation: traffic bypassing the gateway is invisible to ShieldAgent.

Audit Only

Position 7 — Observer (Audit-Only)

Traffic flows directly from the agent to upstream without interception. A copy of each request/response is sent to ShieldAgent asynchronously for audit and detection. ShieldAgent cannot block threats in this mode — it provides detection, risk scoring, and compliance evidence without enforcement.

Best when: initial evaluation, compliance auditing, or brownfield environments where inline insertion is not yet possible. Zero latency impact.

Which position should I use?

Position	What you need to own	Agent code changes	Sees outbound tool calls	Latency impact	Enforcement
1 — Inline Proxy	Nothing	Point at proxy URL	Yes	+5-15ms	Infrastructure
2 — Verdict API (REST)	Your API server	Add SDK scan()	Yes	+5-15ms	Customer code
3 — Verdict API (MCP)	Your MCP server	Add SDK scan()	Yes	+5-15ms	Customer code
4 — Proxy in front of agent	Nothing	None	No (inputs only)	+5-15ms	Infrastructure (inputs only)
5 — Sidecar Proxy	Agent compute (pod/VM)	Point at localhost	Yes	+2-5ms	Infrastructure
6 — Post-Gateway	Gateway config	None	Yes (via gateway)	+5-15ms	Infrastructure
7 — Observer (Audit)	Mirror mechanism	None	Yes (async)	0ms	None (detection only)

Positions can be combined. For example: Position 4 (proxy in front of agent) + Position 1 (proxy between agent and MCP) gives you full coverage of both the input path and the outbound tool call path.

Passthrough vs. Verdict API

Two SDK integration patterns exist for customers who own their server. Choose based on whether ShieldAgent needs to forward traffic to a separate upstream.

Scenario	Pattern	How it works
Your server is a gateway to a different upstream	Passthrough (apiProxy / mcpProxy)	SDK sends request to ShieldAgent proxy. Proxy scans, then forwards to the real upstream and returns the response.
Your server IS the tool provider (no separate upstream)	Verdict API (scan)	SDK sends the payload to ShieldAgent. ShieldAgent returns a verdict (allow/block/human_review). Your server executes or rejects locally.

Full Coverage: Composing Positions

Deployment positions are not mutually exclusive. For defense in depth, deploy multiple ShieldAgent instances covering different traffic directions.

Same proxy binary, different configuration. Both instances share the same tenant, policies, and audit trail. Position 4 catches malicious inputs before the agent processes them; Position 1 or 5 catches dangerous outbound tool calls.

Enforcement Model

All deployment positions run the same multi-stage security pipeline — detection, policy evaluation, risk scoring, and audit are always handled by ShieldAgent. What differs is who enforces the verdict. Position 1 provides infrastructure-enforced blocking: ShieldAgent sits inline and stops the request before it ever reaches upstream. Positions 2 and 3 use the Verdict API: ShieldAgent detects threats and returns a verdict; your server code enforces it. This shared responsibility model is standard in the security industry — analogous to AWS WAF Count vs. Block mode or CrowdStrike Detect vs. Prevent mode.

Enforcement Comparison

Position	Enforced by	Block mechanism	Guarantee
1 — Inline Proxy	ShieldAgent	Proxy drops / rejects the request	Infrastructure — cannot be bypassed
2 — Verdict API (REST)	Your server code	Your handler throws or returns error	Code-level — depends on implementation
3 — Verdict API (MCP)	Your server code	Your tool handler rejects	Code-level — depends on implementation
4 — Proxy in front of agent	ShieldAgent	Proxy drops inbound prompt	Infrastructure (inputs only)
5 — Sidecar Proxy	ShieldAgent	Sidecar drops / rejects the request	Infrastructure — per-agent isolation
6 — Post-Gateway	ShieldAgent	Proxy drops / rejects after gateway	Infrastructure — for gateway-routed traffic
7 — Observer	None	No blocking — detection only	No enforcement guarantee

Responsibility Matrix

Responsibility	Positions 1 / 5 / 6 (Proxy)	Positions 2 & 3 (Verdict API)	Position 7 (Observer)
Threat detection	ShieldAgent	ShieldAgent	ShieldAgent
Policy evaluation	ShieldAgent	ShieldAgent	ShieldAgent
Risk scoring	ShieldAgent	ShieldAgent	ShieldAgent
Audit trail	ShieldAgent	ShieldAgent	ShieldAgent
Compliance documentation (Annex IV)	ShieldAgent	ShieldAgent	ShieldAgent
Blocking threats	ShieldAgent — infrastructure-guaranteed	Customer — code-level, verified via confirmExecution()	Not available — detection only
Enforcement verification	Automatic (proxy blocks before forwarding)	Optional — call confirmExecution() to close the audit loop	N/A

Compliance Framework Impact

Framework	Positions 1 / 5 / 6 (Proxy)	Positions 2 & 3 (Verdict API)	Position 7 (Observer)
SOC 2 CC6.1 (Logical access controls)	ShieldAgent is the complete logical access control.	ShieldAgent provides detection + audit. Your enforcement code is part of the combined control evaluated by auditors.	Detection and audit only. Does not satisfy logical access control requirements alone.
ISO 42001 A.6.2.6 (AI system monitoring)	Fully compliant.	Fully compliant — monitoring is provided regardless of enforcement topology.	Fully compliant — monitoring and audit are provided.
EU AI Act Art. 9 (Risk management)	ShieldAgent identifies, analyzes, evaluates, and mitigates threats.	ShieldAgent identifies, analyzes, and evaluates. You mitigate by enforcing verdicts. Both parties must be documented in Annex IV.	Identifies and analyzes only. Does not mitigate — does not satisfy blocking requirements for high-risk systems.
EU AI Act Art. 14 (Human oversight)	Human review via ShieldAgent dashboard.	Human review via dashboard + your explicit enforcement decision after receiving a human_review verdict.	Human review via dashboard (advisory only — no enforcement mechanism).
EU AI Act Annex IV (Technical documentation)	Auto-generated with infrastructure-enforced blocking proof.	Auto-generated — must include your enforcement mechanism and confirmExecution() rates as supporting evidence.	Auto-generated — must document that enforcement is not provided and describe compensating controls.

Security Pipeline

Every intercepted request travels through the same ordered pipeline regardless of which deployment position you use. Stages marked always run unconditionally; the rest are bypassed in shadow mode or when not configured.

Authalways

JWT verification, tenant and agent resolution, shadow-mode flag propagation.

Risk Hint

Cached risk score lookup — sets a per-agent rate-limit multiplier for the next stage.

Rate Limit

Sliding-window enforcement per tenant + global ceiling. Redis-backed across replicas.

Tool Access

Blocks tool discovery requests if the agent is not in the approved_tools allowlist (OWASP MCP #6).

Policy

Compiled rule evaluation (allow / deny / human_review). Shadow mode logs but never blocks.

Securityalways

Prompt injection (regex + ML), DLP (20+ finding types), tool drift, cross-origin abuse, excessive agency, server spoofing.

Risk Enforcement

Tier-based blocking when the agent risk score exceeds configured thresholds.

Auditalways

Tamper-evident hash-chain event written to PostgreSQL. Fire-and-forget, never blocks the request path.

Stages run in sequence. Any stage can short-circuit the pipeline by returning a deny decision. The audit stage always runs, even for denied requests.

Infrastructure Topologies

Independent of which position you use, you also choose how the ShieldAgent infrastructure itself is deployed. These topologies apply to the proxy and management API — choose based on compliance requirements and operational budget.

SaaS

Shared Multi-Tenant

All tenants share one proxy deployment. Isolation is logical — enforced by JWT claims, row-level security in PostgreSQL, and per-tenant rate limits. Lowest infrastructure cost.

Enterprise plans include dedicated isolation options. Contact your account team for details.

Topology	Isolation	Cost	Best For
Shared (SaaS)	Logical	Lowest	SaaS deployments

Component Overview

ShieldAgent is a monorepo with separate apps and shared packages. You only deploy the components you need — at minimum, the proxy, API, and a PostgreSQL database.

Component	Port	Role
Proxy	—	The security chokepoint. Intercepts every MCP message and REST API call. Runs the multi-stage security pipeline before forwarding to upstream.
Management API	—	Fastify REST API. CRUD for tenants, agents, policies, audit, risk, incidents. Real-time events via WebSocket.
Dashboard	—	Next.js security dashboard. Real-time monitoring, policy management, audit log exploration, agent registry.
MCP Scanner	—	CLI tool that inspects MCP servers for supply-chain risks, tool manifest drift, and dependency vulnerabilities.
ML Classifier	—	Python/FastAPI sidecar. Runs an ML model for prompt injection detection. Falls back gracefully when unavailable.
Audit Trail	—	Shared package. Batch writer with tamper-proof hash chain. Supports webhook and syslog export adapters.
Risk Engine	—	Shared package. Multi-factor risk scorer, behavioral anomaly detection, trend analysis, and alerting.
Policy Engine	—	Shared package. Compiles YAML policy rules into an optimized lookup index. Evaluated in-process by the proxy.

Supported Transports

The proxy supports both network and process-based MCP transports. Choose based on how your agent framework communicates with MCP servers.

Transport	Protocol	Use case
HTTP/SSE	HTTP streaming	Network agents, multi-replica deployments
stdio	Standard I/O	Agent spawns proxy as child process (local dev, desktop apps)
REST API proxy	HTTP methods	Pass-through for non-MCP REST APIs with policy enforcement

Scaling Considerations

Horizontal proxy scaling

The proxy is stateless on the request path. Add replicas behind a load balancer. Redis is shared across replicas for rate limiting, session tracking (SSE affinity), and the upstream routing cache. A Kubernetes HPA targeting CPU is the recommended scaling strategy.

SSE connection affinity

Server-Sent Event connections are long-lived. The proxy uses Redis-backed session tracking to route reconnects to the same replica. Configure your load balancer with sticky sessions (sessionAffinity: ClientIP in Kubernetes) as a fallback.

Database

PostgreSQL is the source of truth for all configuration and audit data. Use a managed database (RDS, Cloud SQL, Neon) with a read replica for audit log queries. The audit batch writer is designed to degrade gracefully under DB pressure — it buffers in memory with a dead-letter queue.

ML classifier

The ML classifier is optional. When unavailable, the proxy falls back to regex-only injection detection automatically. Scale the classifier independently of the proxy — it is CPU/GPU-bound and benefits from dedicated nodes.

Minimum Required Services

Not all components are required for every deployment. Start with the essentials and add services as needed.

Required

Proxy · Management API · PostgreSQL

Recommended

Dashboard · Redis (rate limiting + session affinity)

Optional

ML Classifier (enhanced injection detection) · MCP Scanner (supply-chain auditing) · Grafana + Prometheus (observability)