Impact-Site-Verification: ea5f7f08-74ae-4cca-af2a-01826f2587ea
Add real-time protection against prompt injection, data leaks, and other agent attacks
Drop-in protection for AI agents and AI coding extensions and Copilot. Enable or disable individual command, web, and LLM checks from the dashboard — or start from preset allow lists — to block prompt injection, jailbreaks, PII leaks, and data exfiltration
Drops into the agents you already run
Defense in depth for every prompt
Layered checks, run in parallel. Each request passes through multiple checks at once and short-circuits the moment a real threat is found — fast processing, low-latency replies.
You're in full control. Enable, disable, or customize checks easily from your dashboard. AgentGuards is fully hosted on secure AWS infrastructure.
Your data stays yours. We don't store your prompt content — only pass/fail results, used to improve detection.
Prompt injection & jailbreak
Heuristic patterns, an optional LLM judge, and Meta's PromptGuard ML model catch instruction-override and jailbreak attempts before they reach your model.
PII & secret redaction
Detect and redact emails, SSNs, credit cards, API keys, and tokens in prompts and tool inputs.
Data exfiltration & restricted topics
Block attempts to siphon data or steer the agent into off-limits territory, with policy-as-code you control.
Web-content defense
Purpose-built checks for fetched web pages — hidden text, HTML-comment injection, and XSS-as-injection vectors.
Per-tenant config & audit logs
Every customer gets isolated checks, their own keys, and a queryable audit trail of every decision.
Bring-your-own model key
Run the LLM-judge check on your own OpenAI key — your traffic, your spend, full control.
Watch it work
Real attacks, caught at the door.
Prompt-injection probes, instruction overrides and destructive commands get scored and stopped before your agent ever acts on them.
Frequently asked questions
Where is the service hosted?+
Our frontend and backend are deployed on AWS (eu-north-1 region). All traffic is served over HTTPS via an Application Load Balancer with ACM certificates.
How do I integrate AgentGuards?+
The quickest path is our MCP server — one command wires it into Claude Code, VS Code Copilot, or Codex. You can also call the REST API directly. See the integration guide →
What checks does AgentGuards run?+
Every request goes through up to 10 layered checks: prompt injection, jailbreak detection, PII detection, secret detection, data exfiltration, toxicity, restricted topics, web-content injection, an LLM semantic check, and a PromptGuard ML classifier. Checks short-circuit on the first confirmed threat.
Does AgentGuards store my prompts?+
No. We scan prompt content in memory and discard it immediately. Only metadata — request counts, token counts, blocked event types — is persisted for your usage dashboard.
Which LLMs and frameworks are supported?+
AgentGuards is model-agnostic. It sits in front of any LLM call as an HTTP guardrail. Native integrations exist for Claude Code, Gemini CLI, OpenAI Codex, and VS Code GitHub Copilot.
What happens when a threat is detected?+
The request is blocked before it reaches the model. You receive a JSON response with decision: "block" and a per-check breakdown showing which check triggered and why. Your LLM is never called.
Can I configure which checks run?+
Yes. Every check can be toggled on or off per tenant from your dashboard. Individual and higher plans can also edit custom detection patterns.
What counts as a request?+
Each guardrail evaluation — an input check, output validation, action authorization, policy evaluation, or gateway completion — is one metered request.
What happens when I hit my monthly limit?+
Requests beyond your plan's included quota are paused until the next cycle or an upgrade. You'll see usage and remaining quota in your dashboard, and we warn you before you run out.
Can I bring my own model key?+
Yes. The optional LLM-judge check runs on your own OpenAI key, so that spend stays on your account.
Do you offer on-prem?+
Enterprise can deploy in your own VPC or on-prem, with SLA, SSO, and a security review.
Ship your agent with guardrails on.
Start free in two clicks. No credit card, no sales call.