Built on Bifrost — 20μs added latency

Enterprise AI Gateway
your team can trust

Route, protect, and audit every LLM request. PII redaction, data residency enforcement, tamper-evident logs, and cost analytics — all in a single open-source gateway.

docker compose up
# Start the full InferexGuard stack
$ make dev
Starting services...
✓ gateway ready http://localhost:8080 (pii-dlp + audit-log + data-residency)
✓ analytics ready http://localhost:8081 (cost dashboard + forecasting)
✓ compliance ready http://localhost:8082 (SOC2 / HIPAA / GDPR)
✓ prompts ready http://localhost:8083 (versioning + A/B testing)
# Drop-in OpenAI replacement:
$ curl http://localhost:8080/v1/chat/completions \
-H "X-Data-Residency: EU" \
-d '{"model":"gpt-4o","messages":[...]}'
Apache 2.0
SOC2 Ready
HIPAA Ready
GDPR Ready
23+ Providers
20μs
Added latency
p99 at 5,000 RPS
23+
LLM providers
OpenAI, Anthropic, Bedrock…
37
Tests passing
across all plugins & services
3
Compliance frameworks
SOC2, HIPAA, GDPR

Everything enterprise AI needs

Six production-ready enterprise capabilities built as Bifrost plugins and standalone services. Deploy together or pick what you need.

PII / DLP Protection

Automatically detect and redact SSNs, credit cards, emails, API keys, and more before they reach the LLM — or block the request entirely. Three modes: detect, redact, block.

HIPAAGDPR

Tamper-Evident Audit Log

SHA-256 chained audit records for every LLM request. Prompt hashes, response hashes, token counts, latency, and user identity — stored in PostgreSQL with compliance mode tagging.

SOC2HIPAA

Data Residency Routing

Tag requests with X-Data-Residency: EU and InferexGuard restricts routing to EU-based providers only. Fail-closed mode blocks non-compliant requests outright.

GDPREnterprise

Cost Analytics & Forecasting

Real-time cost dashboards grouped by team, model, or provider. Linear regression forecasting with 90% confidence bounds. Chargeback reports formatted for finance teams.

FinOpsDashboards

Compliance Reports

Generate SOC2, HIPAA, and GDPR audit evidence on demand. Async report generation with download as JSON. Supports YYYY, YYYY-MM, and YYYY-Qq period formats.

SOC2HIPAAGDPR

Prompt Registry

Version your prompts like code. Semver versioning, environment deployments (dev → staging → production), {{variable}} template rendering, and A/B traffic splitting.

PromptOpsA/B Testing

How it works

Three steps from zero to enterprise-grade AI infrastructure.

01

Drop-in replacement

Point your existing OpenAI SDK at InferexGuard gateway. No code changes required — same API, same request format.

openai = OpenAI(base_url="https://gateway.yourdomain.com/openai")
02

Request flows through enterprise pipeline

Every request passes through: PII/DLP scan → data residency check → provider routing → response scan → audit record.

X-Data-Residency: EU   → routes to EU providers only
X-Virtual-Key: team-a  → applies team budget & policies
03

Insights flow to your dashboards

Audit logs power the analytics service. Query cost by team, model, or provider. Generate compliance reports on demand.

GET /api/v1/cost/summary?period=30d&group_by=team_id
POST /api/v1/reports/generate  {"framework":"soc2","period":"2026-Q1"}

Enterprise Plugin Pipeline

Requestpii-dlpdata-residencyaudit-logProvider

Post-hooks run in reverse order — audit-log captures final state after all transformations

Ready to deploy?

Open source, self-hosted, and production-tested. Get the full enterprise AI gateway running in under 10 minutes.

✓ Apache 2.0 — free forever✓ Self-hosted — your data stays yours✓ 37 passing tests