200+ Models · OpenAI Compatible · Intelligent Routing

Intelligent
LLM Router

One OpenAI-compatible API that intelligently routes across 200+ models. Reduce your LLM costs by up to 80%.

quickstart.py
import openai

# Drop-in replacement — just change base_url
client = openai.OpenAI(
    api_key="ar-your-key",
    base_url="https://api.auraon.ai/v1"
)

response = client.chat.completions.create(
    model="auto"  # intelligent routing
)
0+
模型数量
覆盖所有主流服务商
0%
服务可用率
企业级稳定性保障
0%
成本节省
相比单一模型的平均降幅
0ms
P50 延迟
中位响应时间

Models

200+ models from every major provider

OpenAI, Anthropic, Google, Meta, Mistral, DeepSeek, and more — all through one unified API.

OpenAIfast

GPT-4o

Flagship multimodal model

vision
$0.005/1K
Anthropicmedium

Claude Opus 4

Most capable Claude model

reasoning
$0.015/1K
Anthropicfast

Claude Sonnet 4.6

Best coding model

coding
$0.003/1K
Googlemedium

Gemini 2.5 Pro

Advanced reasoning with 2M context

reasoning
$0.001/1K
DeepSeekslow

DeepSeek R1

Open-source reasoning champion

reasoning
$0.55/M
Metafast

Llama 3.3 70B

Best open-source general model

general
$0.23/M
Mistralfast

Mistral Large

European frontier model

coding
$0.002/1K
Alibabafast

Qwen 2.5 72B

Strong multilingual model

coding
$0.40/M
OpenAIfast

GPT-4o Mini

Fast and affordable

general
$0.15/M
Anthropicfast

Claude Haiku 4.5

Fastest Claude model

general
$0.80/M
OpenAIfast

GPT-4o

Flagship multimodal model

vision
$0.005/1K
Anthropicmedium

Claude Opus 4

Most capable Claude model

reasoning
$0.015/1K
Anthropicfast

Claude Sonnet 4.6

Best coding model

coding
$0.003/1K
Googlemedium

Gemini 2.5 Pro

Advanced reasoning with 2M context

reasoning
$0.001/1K
DeepSeekslow

DeepSeek R1

Open-source reasoning champion

reasoning
$0.55/M
Metafast

Llama 3.3 70B

Best open-source general model

general
$0.23/M
Mistralfast

Mistral Large

European frontier model

coding
$0.002/1K
Alibabafast

Qwen 2.5 72B

Strong multilingual model

coding
$0.40/M
OpenAIfast

GPT-4o Mini

Fast and affordable

general
$0.15/M
Anthropicfast

Claude Haiku 4.5

Fastest Claude model

general
$0.80/M
Metafast

Llama 3.3 70B

Best open-source general model

general
$0.23/M
Mistralfast

Mistral Large

European frontier model

coding
$0.002/1K
Alibabafast

Qwen 2.5 72B

Strong multilingual model

coding
$0.40/M
OpenAIfast

GPT-4o Mini

Fast and affordable

general
$0.15/M
Anthropicfast

Claude Haiku 4.5

Fastest Claude model

general
$0.80/M
OpenAIfast

GPT-4o

Flagship multimodal model

vision
$0.005/1K
Anthropicmedium

Claude Opus 4

Most capable Claude model

reasoning
$0.015/1K
Anthropicfast

Claude Sonnet 4.6

Best coding model

coding
$0.003/1K
Googlemedium

Gemini 2.5 Pro

Advanced reasoning with 2M context

reasoning
$0.001/1K
DeepSeekslow

DeepSeek R1

Open-source reasoning champion

reasoning
$0.55/M
Metafast

Llama 3.3 70B

Best open-source general model

general
$0.23/M
Mistralfast

Mistral Large

European frontier model

coding
$0.002/1K
Alibabafast

Qwen 2.5 72B

Strong multilingual model

coding
$0.40/M
OpenAIfast

GPT-4o Mini

Fast and affordable

general
$0.15/M
Anthropicfast

Claude Haiku 4.5

Fastest Claude model

general
$0.80/M
OpenAIfast

GPT-4o

Flagship multimodal model

vision
$0.005/1K
Anthropicmedium

Claude Opus 4

Most capable Claude model

reasoning
$0.015/1K
Anthropicfast

Claude Sonnet 4.6

Best coding model

coding
$0.003/1K
Googlemedium

Gemini 2.5 Pro

Advanced reasoning with 2M context

reasoning
$0.001/1K
DeepSeekslow

DeepSeek R1

Open-source reasoning champion

reasoning
$0.55/M

Features

Everything you need to build with AI

Auraon handles the infrastructure so you can focus on your product.

核心

智能路由

根据任务复杂度、成本和延迟要求,自动为每个请求选择最合适的模型。

省 80%

成本优化

将简单任务路由到低成本模型,复杂任务交给高性能模型,LLM 成本最高可降低 80%。

统一 API

一个 OpenAI 兼容端点接入 200+ 模型,无需修改任何代码即可切换模型。

全球节点

多区域部署,智能就近接入,确保全球用户获得最低延迟的访问体验。

实时分析

在统一仪表盘中监控所有应用的用量、成本、延迟和模型性能。

企业安全

SOC2 就绪的基础设施,支持请求日志、速率限制和精细化 API 密钥权限管理。

Integration

Drop-in OpenAI replacement

Already using OpenAI? Switch to Auraon in 30 seconds. Just change base_url and your API key — everything else stays the same.

  • 100% OpenAI API compatible
  • Auto model routing with `model: auto`
  • Streaming supported
  • Function calling & tool use
import openai

client = openai.OpenAI(
    api_key="br-your-api-key",
    base_url="https://api.auraon.ai/v1"
)

# Auraon auto-routes to the best model
response = client.chat.completions.create(
    model="auto",  # or specify: "claude-opus-4", "gpt-4o"
    messages=[
        {"role": "user", "content": "Explain quantum entanglement"}
    ]
)

print(response.choices[0].message.content)

Pricing

Simple, transparent pricing

Pay as you go. No subscriptions required.

入门版

$0.50/ 1k requests
  • 每月 10,000 次请求
  • 50+ 模型
  • 自动路由
  • 社区支持
  • 按量付费
Get started
Most popular

专业版

$0.35/ 1k requests
  • 每月 500,000 次请求
  • 200+ 模型
  • 智能路由 + 自动降级
  • 优先支持
  • 数据分析仪表盘
  • 信用卡支付
  • 自定义路由规则
Get started

企业版

$0.20/ 1k requests
  • 无限请求次数
  • 全部 200+ 模型
  • 专属基础设施
  • 7×24 SLA 支持
  • 高级数据分析
  • SSO + RBAC
  • 私有化部署
  • 定制 SLA
Contact us

Model costs are passed through at actual cost. Pricing above covers routing and infrastructure only.

Ready to reduce your LLM costs?

Join thousands of developers powering their AI apps with Auraon. Start free, no credit card required.