Vouch

Don't trust us.
Try us.

Zero inference overhead. TOON-native. Ed25519-signed verdicts. Pure deterministic evaluation in ~2ms — your costs don't move when theirs do.

Start Free Trial -> See how it works

◆ Live Evaluation — Production API ~2ms • 0 tokens • Live API

Pick a level or write your own plan

Plans tested: 0 ● 0 ● 0 ● 0

Every evaluation hits the production API. No signup required. Same endpoint your agents will use.

Token Cost

1,200+

Industry avg per safety call

GPU Hours

24/7

Always-on inference infrastructure

Latency

~2,000ms

Typical LLM safety evaluation

Tested in the Open.

Most governance tools publish accuracy on curated demos. We publish what happens when 342,000 plans hit the production API — benign DevOps, hostile exfiltration, credential harvesting, privilege escalation, prompt injection, and everything in between.

<3%

False positive rate

Benign plans blocked or held across mixed corpora

90.2%

Hostile plans caught

25k adversarial plans → stopped or flagged

342k

Plans tested through production API

Same endpoint you'll use

10/10

OWASP Agentic Top 10 covered

54 enforced patterns. Every one fires through the production API.

<3ms

Evaluation latency

Deterministic. Zero tokens. No LLM in the eval path.

122k

Novel Semantics

Calibrated on labeled plans. Reads intent, not just keywords.

Head to Head

Same 100,000 plans. Same labels. Same test.

Microsoft AGT

Adversarial catch 37.9%

Benign false positive 7.23%

Unique catches 64

Vouch

Adversarial catch 90.2%

Benign false positive 6.31%

Unique catches 13,137

Every number on this page was produced by sending plans through POST /api/v1/vouch. No tailored benchmarks. No proprietary datasets. Not opt-in components — enforced patterns that produce verdicts on every plan.

What we can't catch, we route.

Soft-language insider threats and restructured intent go to a cheap model review. Not a $0.03 frontier call — a targeted check on only the plans we flagged. The architecture catches what it can deterministically. What it can't, it flags.

Deterministic Governance.

One API call. Three verdicts. Zero tokens consumed. Built for the post-subsidy world.

Route

One middleware call. Works with LangChain, CrewAI, or plain HTTP. No SDK. No sidecar. No GPU instance running 24/7.

Evaluate

Deterministic pattern evaluation in ~2ms. No inference, no model drift, no token consumption. Same input, same verdict, every time.

Act

ACCEPTED plans flow. Dangerous plans queue. No retries, no rewrites, no false-positive cascades burning your quota on replanning loops.

Drop-in Integration.

Guard Claude Managed Agent sessions with one function call. Or add Vouch to any MCP-compatible agent. Both call the same API. Engine stays on our server.

vouch-guard

Claude Managed Agents

Every tool call evaluated before the sandbox executes it. ACCEPTED flows. RESTRICTED steers. BLOCKED stops. The agent loop handles retries naturally.

npm install vouch-guard

import { guardSession } from "vouch-guard";

// One call guards the entire session
const audit = await guardSession(client, sessionId, {
    apiKey: process.env.VOUCH_API_KEY,
});

// audit.accepted, audit.blocked, audit.restricted

vouch-mcp

Any MCP Agent

MCP server that gives any agent a vouch_evaluate tool. Works with Claude Code, Cursor, Windsurf, or any MCP-compatible host. Auto-provisions API keys.

npx vouch-mcp

// In your MCP config (claude_desktop_config.json, etc.)
{
  "mcpServers": {
    "vouch": {
      "command": "npx",
      "args": ["vouch-mcp"]
    }
  }
}

Both packages are open source. The evaluation engine is hosted — your plans are evaluated in ~2ms, verdicts signed with Ed25519, and nothing is stored. github.com/atlas-with-iris

Clear Economics.

No black boxes. No per-token metering. Flat, predictable, post-subsidy pricing.

Starter

$25 / mo

10,000 evaluations per month. Zero commitment.

10k evaluations / month
Full pipeline access
TOON-native responses
Ed25519 signed verdicts
PII scrubbing on all plans
Community support

Get Started

Pro

$100 / mo

50,000 evaluations per month. Built for production.

50k evaluations / month
Full pipeline access
Review severity scoring
Priority support
Email support + founder access

Go Pro

Enterprise

Custom

Dedicated instance. Custom lane work. Your traffic, your rules.

Unlimited evaluations
Dedicated managed instance
Custom governance lanes
Efficiency analytics
Direct founder access

Start free. 10,000 calls.

No credit card. No commitment. Just an email.

Check your inbox — your API key is on its way.

10,000 evaluations/month free. Upgrade anytime.

Don't trust us.Try us.