Skip to main content
30-Day Free Trial ยท No Credit Card Required

Govern Every AI Call. LLM and Tool.

The only AI gateway that covers both LLM calls and MCP tool calls โ€” two governed data planes, one control plane. Policy-as-Code, immutable audit, PII redaction, and cost control built in.

dvara-gatewayzsh
$docker run -p 8080:8080 -e OPENAI_API_KEY=sk-... \
> dvara/gateway:latest
DVARA Gateway v1.2.0 started on :8080
LLM Proxy ready MCP Proxy ready
$ curl -X POST localhost:8080/v1/chat/completions \
> -H "Authorization: Bearer $TOKEN" \
> -d '{"model":"gpt-4o","messages":[...]}'
policy: PASS pii-scan: CLEAN audit: SIGNED cost: $0.0032
{"status": 200, "model": "gpt-4o", "latency": "1.2s"}
10K+
Requests/sec on 4 cores
<10ms
Added latency P95
6
Provider adapters
60s
Zero to first request

Everything You Need to Govern AI at Scale

From quickstart to enterprise compliance. Full-featured gateway with governance layers that grow with your requirements.

๐Ÿ”„
Multi-Provider Unified API

Drop-in OpenAI-compatible endpoint. Route to OpenAI, Anthropic, Gemini, Azure, Bedrock, or Ollama with zero code changes. Streaming, structured outputs, and vision built-in.

๐Ÿ›ก
Policy-as-Code Engine

YAML DSL for model allowlists, token limits, MCP tool restrictions, budget-based model downgrades, and time-of-day rules. Dry-run against historical traffic before activation.

๐Ÿ”
Immutable Audit Trail

HMAC-SHA256 signed, append-only audit records. PostgreSQL RLS prevents tampering. SOC2, HIPAA, and GDPR evidence packages generated on demand.

๐Ÿค–
MCP Tool Call Governance

Separately deployed MCP Proxy governs every tool call. PII scanning on arguments and responses. Human approval gates for high-risk operations. Credential centralisation.

๐Ÿ’ฐ
FinOps & Cost Control

Real-time per-request cost calculation. Budget caps per tenant, team, or API key. Automatic model downgrade at soft limit. Chargeback reports and cost forecasting.

๐Ÿšจ
PII Detection & Redaction

Detect and redact emails, phone numbers, SSNs, credit cards, and custom patterns before they reach any provider. Configurable per-tenant: block, redact, or flag.

โšก
Intelligent Routing

Round-robin, weighted, capability-aware, latency-aware, and cost-aware routing. Circuit breakers, failover, and retry with exponential backoff. Canary and A/B testing.

๐Ÿ”
Full Observability

Prometheus metrics, OpenTelemetry distributed tracing, structured JSON logging. Pre-built Grafana dashboards. Unified traces spanning LLM turns and MCP tool calls.

๐ŸŒ€
Agentic AI Governance

Agent loop detection with auto-kill. Session tracking across LLM and MCP calls. Human-in-the-loop approval gates enforced at execution. Full session timeline and analytics.

Two Governed Data Planes. One Control Plane.

The LLM Gateway sits at the edge for model traffic. The MCP Proxy sits inside your perimeter for tool calls. Both share the same policies, audit trail, and API keys.

LLM Data Plane
</>
Your App
Agent / API Client
{โ€‰}
LLM Gateway
:8080
AI
Providers
OpenAI ยท Anthropic ยท Gemini
Control Plane
PoliciesAuthAuditBudgetsPII
MCP Data Plane
</>
Your App
Agent / IDE
{โ€‰}
MCP Proxy
:8070
DB
Tools
DB ยท FS ยท Slack ยท APIs

Built for Production Scale

Governance with zero performance tax. Every layer is optimised for throughput โ€” lightweight concurrency, non-blocking audit, multi-layer caching, and zero-copy streaming.

10,000+RPS
per node on 4 cores
Lightweight concurrency model handles thousands of simultaneous requests with minimal memory overhead.
<10ms
added gateway latency
Policy checks in under 3 ms. Audit writes in under 1 ยตs. Governance adds near-zero overhead to your AI calls.
<1ยตs
audit write latency
Append-only, crash-safe audit writes that never block the request path โ€” even at peak throughput.
99.99%
uptime SLA
Multi-region active-active with automatic failover in under 30 seconds. Data residency preserved during failover.
Lightweight Concurrency
Handles 10,000+ concurrent connections per node with minimal memory per request. No thread pool tuning required.
Zero-Copy Streaming
SSE tokens flushed immediately with no buffering or rewriting. First token arrives the instant the provider emits it.
Non-Blocking Audit
Append-only, crash-safe audit writes that happen off the critical path. Sub-microsecond latency with no performance impact.
Multi-Layer Caching
95%+ hit rate on policy, auth, and config lookups. In-process cache eliminates network round-trips on every request.
Automatic Failover
Per-provider health monitoring with instant failover on degradation. Requests reroute seamlessly โ€” your users never notice.
Fast Cold Start
Sub-100 ms startup with a minimal memory footprint. Ideal for auto-scaling, serverless, and edge deployments.

Governance Is the Architecture, Not a Feature

In every competitor, governance is bolted onto a proxy. In DVARA, governance is the design constraint that shaped every layer.

D1
Policy-as-Code with Dry-Run
Competitors: config flags or basic allowlists.

Full YAML DSL with version control, conflict detection at load time, and dry-run mode that tests against real historical traffic. Your auditors get proof that policies were tested before deployment.

D2
Immutable, HMAC-Signed Audit
Competitors: log to stdout or a mutable database.

Every record HMAC-SHA256 signed at write. PostgreSQL RLS blocks UPDATE/DELETE. Chronicle Queue writes < 1 microsecond with zero GC pressure. Tamper-evident by design.

D3
MCP Proxy โ€” No Competitor Has This
Competitors: governance stops at the LLM boundary.

Separately deployed, centrally governed MCP Proxy. Every tool call audited, policy-checked, and PII-scanned. Credential centralisation. Human approval gates enforced at execution.

D4
Agentic AI Governance
Competitors: nothing meaningful for agent loops.

Full OpenTelemetry trace spanning LLM turns + MCP calls. Agent loop detection with auto-kill. Human-in-the-loop approval for high-risk tool calls. Session-level cost and compliance summary.

Audit-Ready from Day One

Generate compliance evidence packages on demand. Every request is immutably logged, every policy decision recorded, every PII event tracked.

Compliance OverviewLive
SOC2
SOC 2 Type II
Ready
HIPAA
HIPAA
Ready
GDPR
GDPR
Ready
EU AI
EU AI Act
Ready
99.97%
Audit Coverage
<1ยตs
Write Latency
0
Tampered
24/7
Reports
Recent Audit Events
12:04:32LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
12:04:31MCP Tool Callpolicy: ALLOWpii: REDACTEDHMAC signed
12:04:29LLM Requestpolicy: DENYpii: CLEANHMAC signed
12:04:28MCP Tool Callpolicy: ALLOWpii: CLEANHMAC signed
12:04:26LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
12:04:24LLM Requestpolicy: ALLOWpii: REDACTEDHMAC signed
12:04:22MCP Tool Callpolicy: DENYpii: CLEANHMAC signed
12:04:20LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
12:04:32LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
12:04:31MCP Tool Callpolicy: ALLOWpii: REDACTEDHMAC signed
12:04:29LLM Requestpolicy: DENYpii: CLEANHMAC signed
12:04:28MCP Tool Callpolicy: ALLOWpii: CLEANHMAC signed
12:04:26LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
12:04:24LLM Requestpolicy: ALLOWpii: REDACTEDHMAC signed
12:04:22MCP Tool Callpolicy: DENYpii: CLEANHMAC signed
12:04:20LLM Requestpolicy: ALLOWpii: CLEANHMAC signed
Immutable Audit Trail
HMAC-SHA256 signed, append-only records. PostgreSQL RLS prevents tampering.
Scheduled Reports
Weekly or monthly compliance report delivery in PDF and JSON.
PII Detection Log
Every redaction event logged with action taken, pattern matched, and tenant context.
Policy Decision Records
Every request logs ALLOW/DENY with rule ID and reason. Full dry-run history.
Right to Erasure
GDPR tenant data purge pipeline with confirmation and audit trail.
Data Residency
Per-tenant region pinning. EU tenant traffic never leaves EU โ€” enforced during failover.

How DVARA Compares

The structural gaps in every competitor are not feature gaps โ€” they require architectural redesigns to close.

DVARALiteLLMPortkeyKong AICloudflareHeliconeAWS BedrockBifrost
Core Gateway
OpenAI-compatible unified APIโœ“โœ“โœ“โˆผโœ“โ€”โ€”โœ“
SSE streaming (zero-copy pass-through)โœ“โœ“โœ“โœ“โœ“โ€”โœ“โœ“
Structured outputs on Anthropic + Geminiโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Multi-provider (OpenAI, Anthropic, Gemini, Bedrock, Azure, Ollama)โœ“โœ“โœ“โˆผโˆผโœ“โˆผโœ“
Virtual thread concurrency (10K+ RPS)โœ“โˆผโˆผโœ“โœ“โ€”โœ“โˆผ
Routing & Resilience
Weighted routing + failoverโœ“โœ“โœ“โœ“โˆผโ€”โ€”โœ“
Latency-aware routing (EWMA P95)โœ“โˆผโˆผโ€”โ€”โ€”โ€”โˆผ
Cost-aware routingโœ“โ€”โˆผโ€”โ€”โ€”โ€”โ€”
Canary + A/B routingโœ“โ€”โˆผโ€”โ€”โ€”โ€”โ€”
Per-provider circuit breakerโœ“โˆผโˆผโœ“โ€”โ€”โ€”โˆผ
Capability-aware route filteringโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Policy & Governance
Policy-as-Code engine (YAML DSL)โœ“โ€”โ€”โˆผโ€”โ€”โ€”โ€”
Policy dry-run before activationโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Policy versioning + rollbackโœ“โ€”โ€”โˆผโ€”โ€”โ€”โ€”
RBAC + ABAC access controlโœ“โ€”โˆผโœ“โ€”โ€”โœ“โ€”
SSO (OIDC / SAML)โœ“โ€”โœ“โœ“โ€”โœ“โœ“โ€”
PII & Data Protection
PII detection + redaction (LLM + MCP)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
DLP custom patterns per tenantโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Reversible tokenisationโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
EU data residency guaranteedโœ“โ€”โ€”โˆผโ€”โ€”โˆผโ€”
Right to erasure (GDPR)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
MCP Proxy & Tool Governance
MCP tool calls proxied & governedโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
MCP server registry + credential storeโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
MCP argument-level policy rulesโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
MCP PII scanning (args + response)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
MCP rate limiting + circuit breakerโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Agentic AI Governance
Human approval gate (enforced at exec)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Agent loop detection + kill switchโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Multi-agent session trackingโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Full OTel trace: LLM turns + MCP callsโœ“โ€”โˆผโ€”โ€”โˆผโ€”โ€”
Audit & Compliance
Immutable HMAC-signed audit logโœ“โ€”โ€”โ€”โ€”โ€”โˆผโ€”
SOC2 / HIPAA / GDPR evidence packagesโœ“โ€”โ€”โ€”โ€”โ€”โˆผโ€”
EU AI Act compliance engineโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
SIEM export (Splunk, CloudWatch, Elastic)โœ“โ€”โˆผโˆผโ€”โˆผโœ“โ€”
Scheduled compliance reports (PDF + JSON)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
FinOps & Cost Control
Real-time cost per request (USD)โœ“โœ“โœ“โ€”โ€”โœ“โˆผโ€”
Budget caps (soft + hard) per tenantโœ“โˆผโˆผโ€”โ€”โˆผโ€”โ€”
Auto model downgrade on budget thresholdโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Chargeback reports per tenant/teamโœ“โ€”โˆผโ€”โ€”โˆผโ€”โ€”
Guardrails & Safety
Prompt firewall (pre + post filter pipeline)โœ“โ€”โˆผโˆผโˆผโ€”โœ“โ€”
Jailbreak detectionโœ“โ€”โˆผโ€”โ€”โ€”โœ“โ€”
Output sanitization (XSS, SQLi, SSRF)โœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
System prompt leakage detectionโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Content policy filters per tenantโœ“โ€”โˆผโ€”โˆผโ€”โœ“โ€”
Deployment & Infrastructure
Air-gapped / on-prem deploymentโœ“โˆผโ€”โœ“โ€”โ€”โ€”โ€”
Zero-dependency standalone modeโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”
Multi-region active-activeโœ“โ€”โˆผโœ“โœ“โ€”โœ“โ€”
Kubernetes Helm chart + HPAโœ“โœ“โˆผโœ“โ€”โ€”โ€”โ€”
Hot config propagation (< 5s)โœ“โ€”โ€”โˆผโ€”โ€”โ€”โ€”
Observability
Prometheus metrics + Grafana dashboardsโœ“โœ“โˆผโœ“โ€”โˆผโˆผโ€”
OpenTelemetry distributed tracingโœ“โˆผโˆผโˆผโ€”โ€”โˆผโ€”
Per-tenant analyticsโœ“โˆผโœ“โ€”โ€”โœ“โˆผโ€”
Anomaly detection for traffic patternsโœ“โ€”โ€”โ€”โ€”โ€”โ€”โ€”

Developers Love It. Compliance Requires It.

Every stakeholder gets exactly what they need from the same platform.

Developer
"I'm calling five LLM providers with five different SDKs and no fallback."

One endpoint. Drop-in compatibility. Route to any provider with automatic failover. Add streaming, rate limiting, and observability in minutes.

Platform Engineer
"12 teams calling LLMs directly. Agents hitting databases with no governance."

Two governed data planes, one control plane. AI gateway at the edge, tool proxy inside the perimeter. Adopt incrementally โ€” add tool governance when agents go to production.

Compliance Officer
"Auditors want proof of what AI systems were used and that no PII was leaked."

Immutable audit trail with tamper-evident signatures. PII detected and redacted before reaching providers. Compliance evidence packages generated on demand.

CISO
"Developers are calling GPT-4 from their laptops. We have no idea what data is leaving."

Hard policy enforcement at the gateway. PII blocked before it leaves your network. Role-based access to authorised models only. Every request logged and auditable.

CFO / FinOps
"AI spend is $40K/month and we can't tell which team is spending what."

Real-time cost tracking per request. Budget caps that automatically enforce. Monthly chargeback reports. Smart caching reduces repeated token spend by up to 40%.

CTO
"We need the governance layer we'd build in 18 months โ€” but we need it now."

Two governed data planes โ€” AI at the edge, tools inside the perimeter โ€” managed as one. Production-grade performance. Enterprise-ready from day one.

Start Your Free 30-Day Trial

Full access to all features. No credit card required. Set up in under 60 seconds.

Request Trial Access
Get your trial API key and deployment credentials within minutes.
30-day trial ยท No credit card ยท Full feature access ยท Cancel anytime

Simple, Transparent Pricing

Every plan includes the full platform. Pick the tier that matches your scale. No per-pod charges. No surprise invoices.

Included in every plan
โœ“Multi-provider unified API
โœ“Streaming + structured outputs
โœ“Intelligent routing + failover
โœ“Policy-as-Code engine + dry-run
โœ“Immutable audit trail
โœ“PII detection + redaction
โœ“SSO / RBAC / ABAC
โœ“MCP tool governance
โœ“Agent loop detection + kill switch
โœ“Cost engine + budget enforcement
โœ“Advanced routing (latency, cost, canary)
โœ“Human approval gates
โœ“Multi-region active-active
โœ“Air-gapped / on-prem deployment
โœ“Observability + metrics
โœ“Response caching
Free TrialStarterMost PopularGrowthScaleEnterprise+
TenantsUnlimited1 โ€“ 34 โ€“ 2021 โ€“ 100100+
Token CeilingUnlimited500M / month2B / month10B / monthNegotiated
DeploymentSelf-hostedSelf-hostedSelf-hostedSelf-hostedSelf-hosted
SupportCommunityStandard SLAStandard SLAPriority SLADedicated SLA
Cloud ManagedComing SoonComing SoonComing SoonComing SoonComing Soon
Volume Discountsโ€”โ€”โ€”โœ“โœ“
SaaS Reseller Licensingโ€”โ€”โ€”โœ“โœ“
Dedicated Success Managerโ€”โ€”โ€”โœ“โœ“
Custom Integrationsโ€”โ€”โ€”โœ“โœ“
On-Site Onboardingโ€”โ€”โ€”โ€”โœ“
Best ForEvaluate all features for 30 daysFirst production deployment. Single team.Platform team governing multiple teams.Large enterprise or internal AI platform.Global enterprise. Financial services. Government.
Start Free TrialTalk to SalesTalk to SalesTalk to SalesTalk to Sales

Add-On Modules

Compliance Pack
SOC2, HIPAA, GDPR evidence packages. EU AI Act engine. Scheduled report delivery. SIEM connectors. Right-to-erasure workflows.
Talk to Sales
Semantic Cache
Vector similarity caching. Configurable threshold per route. Cache hit rate and cost-avoided dashboards. Reduce repeated token spend by up to 40%.
Talk to Sales

See It in Action.
Start Your Free Trial Today.

Full access for 30 days. No credit card. Deploy in 60 seconds and see governance working on your first request.