New

Announcing AISIX: The AI-Native AI Gateway for LLMs and AI AgentsLearn More

🦀AISIX Native AI Gateway · Built with Rust

The AISIX
Native AI Gateway
for LLMs & AI Agents

Built on Rust for unparalleled stability and sub-millisecond overhead. Unified governance, cost control, and rich observability, all in one open-source solution.

Read the Docs View on GitHub

LLMs Supported

All major providers

Uptime Target

Enterprise reliability

< 0ms

Proxy Overhead

Sub-millisecond latency

🌐

Web App

📱

Mobile

🤖

AI Agent

🔌

API Client

AISIX Gateway

Authentication

Rate Limiting

Load Balancing

Observability

OpenAI

Claude

Gemini

DeepSeek

Mistral AI

— WHY AISIX

Everything You Need to Manage AI Traffic

A production-grade, Rust-powered AISIX gateway designed for performance, governance, and observability.

High Performance

Blazing Fast, Built with Rust

Native Rust data plane delivers sub-millisecond proxy overhead with minimal memory footprint. Handle millions of requests per second without breaking a sweat.

Unified Governance

One API, All Your LLMs

Manage all your LLM providers through a single, OpenAI-compatible API. Centralized configuration, authentication, and policy enforcement across your entire AI stack.

Intelligent Load Balancing

Multi-LLM Load Balancing

Dynamically distribute traffic across multiple LLM providers based on latency, cost, and availability. Weighted round-robin, least-connections, and custom strategies.

Precise Rate Limiting

Token & Request Rate Limiting

Fine-grained rate limiting by tokens, requests, or custom dimensions. Per-consumer, per-route, and cluster-wide policies to control costs and prevent abuse.

Security Guardrails

Enterprise-Grade Security

Protect your AI pipeline with prompt injection detection, content moderation, PII redaction, and comprehensive audit logging for regulatory compliance.

Rich Observability

Full-Stack Observability

Track every token, monitor latency distributions, and analyze traffic patterns in real-time. Native integration with Prometheus, Grafana, and ClickHouse.

— HOW IT WORKS

Production-Grade,
Cloud-Native Architecture

Control Plane and Data Plane separation ensures high scalability, zero-downtime upgrades, and enterprise-level reliability.

Admin User

Browser / Dashboard

Control Plane (CP)

AISIX Control Plane · Rust · Stateless

etcdConfig Center

RedisRate Limit

PrometheusMetrics

ClickHouseLog Server

Data Plane (DP)

AISIX Data Plane · Rust · Stateless · Horizontally Scalable

OpenAI

DeepSeek

Claude

Gemini

More...

Stateless Data Plane

Horizontally scalable with zero state — add or remove nodes instantly without data migration.

etcd-Based Config

Centralized configuration management with real-time propagation and hot-reload capabilities.

Separation of Concerns

Control Plane handles management; Data Plane handles traffic. Independent scaling and upgrades.

— DEVELOPER EXPERIENCE

Deploy in Minutes, Not Weeks

OpenAI-compatible API. Production-ready from day one.

Enterprise-grade security built into every request

TPM / RPM rate limiting — cap tokens-per-minute and requests-per-minute per key, per model
Guardrails — block prompt injection, PII leakage, and toxic content before it reaches the model
Per-key access control — restrict which models each API key can call
Concurrency limits — prevent single consumers from monopolising capacity

— INTEGRATIONS

100+ LLM Providers, One Gateway

Connect to any major LLM provider through a unified, OpenAI-compatible interface. No vendor lock-in, ever.

OpenAI

DeepSeek

Claude

Gemini

Mistral AI

Qwen

Amazon Bedrock

Azure OpenAI

100+ More

— PRICING

The Best Pricing Plans

Start free with our open-source core. Scale to enterprise when you're ready.

Open Source

$0/ forever

Everything you need to get started with AISIX

Connect 100+ LLM providers
Load balancing + RPM/TPM limits
Virtual key management
OpenTelemetry, Prometheus, Jaeger
Model guardrails

Get Started

Recommended

Enterprise

Custom

Cloud or Self-Hosted. For teams managing AI traffic at scale.

Includes all open-source features
Priority support + custom SLAs
Budgets, teams, and RBAC
Audit logs and compliance
Complete enterprise features

The AISIXNative AI Gatewayfor LLMs & AI Agents

Everything You Need to Manage AI Traffic

High Performance

Unified Governance

Intelligent Load Balancing

Precise Rate Limiting

Security Guardrails

Rich Observability

Production-Grade,Cloud-Native Architecture

Stateless Data Plane

etcd-Based Config

Separation of Concerns

Deploy in Minutes, Not Weeks

100+ LLM Providers, One Gateway

The Best Pricing Plans

Open Source

Enterprise

The AISIX
Native AI Gateway
for LLMs & AI Agents

Production-Grade,
Cloud-Native Architecture