Vercel AI Gateway
vercel
Updated 3 weeks ago
Vercel AI Gateway is a powerful observability and routing layer for AI applications that provides analytics, cost tracking, and caching for requests to major AI providers. The platform supports unified access to models from OpenAI, Anthropic, Google, Meta, Mistral, and other providers through a single API endpoint. Vercel's AI Gateway enables rate limiting, request caching, and fallback mechanisms to improve reliability and reduce costs for AI-powered applications.
Browse 165 LLM models available from Vercel AI Gateway. Compare prices and features.
Models (165)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
Gemini 3.1 Pro |
gemini-3.1-pro-preview
|
$2.00 | $12.00 |
|
|||
|
|
OpenAI | GPT-5.2 Pro |
gpt-5.2-pro
|
$21.00 | $168.00 | |||
|
|
OpenAI | GPT-5.2 |
gpt-5.2
|
$1.75 | $14.00 |
|
||
|
|
Gemini 3 Pro |
gemini-3-pro-preview
|
$2.00 | $12.00 |
|
|||
|
|
Anthropic | Claude Opus 4.6 |
claude-opus-4.6
|
$5.00 | $25.00 |
|
||
|
|
Gemini 3 Flash |
gemini-3-flash
|
$0.50 | $3.00 |
|
|||
|
|
Anthropic | Claude Sonnet 4.6 |
claude-sonnet-4.6
|
$3.00 | $15.00 |
|
||
|
|
OpenAI | GPT-5.1 Thinking |
gpt-5.1-thinking
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.1 Instant |
gpt-5.1-instant
|
$1.25 | $10.00 | |||
|
|
Moonshot AI | Kimi K2.5 |
kimi-k2.5
|
$0.50 | $2.80 |
|
||
|
|
xAI | Grok-4 |
grok-4
|
$3.00 | $15.00 | |||
|
|
Anthropic | Claude Opus 4.5 |
claude-opus-4.5
|
$5.00 | $25.00 |
|
||
|
|
Gemini 2.5 Pro Preview 06-05 |
gemini-2.5-pro
|
$1.25 | $10.00 |
|
|||
|
|
Z.ai | GLM-4.7 |
glm-4.7
|
$0.43 | $1.75 |
|
||
|
|
OpenAI | GPT-5 |
gpt-5
|
$1.25 | $10.00 | |||
|
|
Anthropic | Claude 3.7 Sonnet |
claude-3.7-sonnet
|
$3.00 | $15.00 | |||
|
|
xAI | Grok-3 |
grok-3
|
$3.00 | $15.00 | |||
|
|
xAI | Grok-3 Mini |
grok-3-mini
|
$0.30 | $0.50 | |||
|
|
Xiaomi | MiMo-V2-Flash |
mimo-v2-flash
|
$0.10 | $0.30 |
|
||
|
|
Anthropic | Claude Sonnet 4.5 |
claude-sonnet-4.5
|
$3.00 | $15.00 |
|
||
|
|
OpenAI | o3 |
o3
|
$2.00 | $8.00 | |||
|
|
Gemini 2.5 Flash |
gemini-2.5-flash
|
$0.30 | $2.50 |
|
|||
|
|
Gemini 2.5 Flash |
gemini-2.5-flash-preview-09-2025
|
$0.30 | $2.50 |
|
|||
|
|
OpenAI | GPT-5 mini |
gpt-5-mini
|
$0.25 | $2.00 |
|
||
|
|
Meituan | LongCat-Flash-Thinking |
longcat-flash-thinking
|
$0.15 | $1.50 | |||
|
|
OpenAI | o4-mini |
o4-mini
|
$1.10 | $4.40 | |||
|
|
Minimax | MiniMax M2.1 |
minimax-m2.1
|
$0.30 | $1.20 | |||
|
|
Z.ai | GLM-4.6 |
glm-4.6
|
$0.45 | $1.80 | |||
|
|
Anthropic | Claude Opus 4.1 |
claude-opus-4.1
|
$15.00 | $75.00 | |||
|
|
Meituan | LongCat-Flash-Thinking-2601 |
longcat-flash-thinking-2601
|
- | - | |||
|
|
OpenAI | GPT OSS 120B |
gpt-oss-120b
|
$0.25 | $0.69 |
|
||
|
|
Anthropic | Claude Opus 4 |
claude-opus-4
|
$15.00 | $75.00 | |||
|
|
Z.ai | GLM-4.5 |
glm-4.5
|
$0.60 | $2.20 | |||
|
|
Minimax | MiniMax M2 |
minimax-m2
|
$0.30 | $1.20 | |||
|
|
OpenAI | o1 |
o1
|
$15.00 | $60.00 | |||
|
|
OpenAI | o3-mini |
o3-mini
|
$1.10 | $4.40 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
qwen3-next-80b-a3b-thinking
|
$0.15 | $1.50 | |||
|
|
Moonshot AI | Kimi K2 0905 |
kimi-k2-0905
|
$1.00 | $3.00 | |||
|
|
Anthropic | Claude Sonnet 4 |
claude-sonnet-4
|
$3.00 | $15.00 | |||
|
|
Z.ai | GLM-4.7-Flash |
glm-4.7-flash
|
- | - | |||
|
|
Z.ai | GLM-4.5-Air |
glm-4.5-air
|
$0.20 | $1.10 | |||
|
|
Nvidia | Nemotron 3 Nano (30B A3B) |
nemotron-3-nano-30b-a3b
|
$0.06 | $0.24 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-v3.1
|
$0.50 | $1.50 | |||
|
|
Gemini 2.0 Flash Thinking |
gemini-2.0-flash
|
$0.15 | $0.60 | ||||
|
|
Meituan | LongCat-Flash-Chat |
longcat-flash-chat
|
- | - | |||
|
|
Anthropic | Claude Haiku 4.5 |
claude-haiku-4.5
|
$1.00 | $5.00 |
|
||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
qwen3-next-80b-a3b-instruct
|
$0.09 | $1.10 | |||
|
|
OpenAI | GPT OSS 20B |
gpt-oss-20b
|
$0.07 | $0.30 | |||
|
|
OpenAI | GPT-5 nano |
gpt-5-nano
|
$0.05 | $0.40 |
|
||
|
|
Mistral | Magistral Medium |
magistral-medium
|
$2.00 | $5.00 | |||
|
|
OpenAI | GPT-4o |
gpt-4o
|
$2.50 | $10.00 | |||
|
|
Meta | Llama 4 Maverick |
llama-4-maverick
|
$0.24 | $0.97 | |||
|
|
Anthropic | Claude 3.5 Sonnet |
claude-3.5-sonnet
|
$3.00 | $15.00 | |||
|
|
Anthropic | Claude 3.5 Sonnet |
claude-3.5-sonnet-20240620
|
$3.00 | $15.00 | |||
|
|
OpenAI | GPT-4.1 |
gpt-4.1
|
$2.00 | $8.00 | |||
|
|
OpenAI | GPT-4.1 mini |
gpt-4.1-mini
|
$0.40 | $1.60 |
|
||
|
|
Gemini 2.5 Flash-Lite |
gemini-2.5-flash-lite
|
$0.10 | $0.40 |
|
|||
|
|
Gemini 2.5 Flash-Lite |
gemini-2.5-flash-lite-preview-09-2025
|
$0.10 | $0.40 |
|
|||
|
|
qwen | Qwen3 Max |
qwen3-max
|
$1.20 | $6.00 | |||
|
|
qwen | Qwen3 Max |
qwen3-max-thinking
|
$1.20 | $6.00 | |||
|
|
DeepSeek | DeepSeek-V3 |
deepseek-v3
|
$0.77 | $0.77 | |||
|
|
Meta | Llama 4 Scout |
llama-4-scout
|
$0.17 | $0.66 | |||
|
|
Gemini 2.0 Flash-Lite |
gemini-2.0-flash-lite
|
$0.08 | $0.30 | ||||
|
|
Anthropic | Claude 3 Opus |
claude-3-opus
|
$15.00 | $75.00 | |||
|
|
OpenAI | GPT-4.1 nano |
gpt-4.1-nano
|
$0.10 | $0.40 | |||
|
|
OpenAI | GPT-4 Turbo |
gpt-4-turbo
|
$10.00 | $30.00 | |||
|
|
qwen | Qwen3 235B A22B |
qwen3-235b-a22b-thinking
|
$0.30 | $2.90 | |||
|
|
Amazon | Nova Pro |
nova-pro
|
$0.80 | $3.20 | |||
|
|
Amazon | Nova Lite |
nova-lite
|
$0.06 | $0.24 | |||
|
|
Anthropic | Claude 3.5 Haiku |
claude-3.5-haiku
|
$0.80 | $4.00 | |||
|
|
OpenAI | GPT-4o mini |
gpt-4o-mini
|
$0.15 | $0.60 |
|
||
|
|
Amazon | Nova Micro |
nova-micro
|
$0.04 | $0.14 | |||
|
|
Anthropic | Claude 3 Haiku |
claude-3-haiku
|
$0.25 | $1.25 | |||
|
|
xAI | Grok Code Fast 1 |
grok-code-fast-1
|
$0.20 | $1.50 | |||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-v3.2
|
$0.26 | $0.38 | |||
|
|
xAI | Grok-4.1 Fast Non-Reasoning |
grok-4.1-fast-non-reasoning
|
$0.20 | $0.50 | |||
|
|
OpenAI | gpt-5-chat |
gpt-5-chat
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.2 Codex |
gpt-5.2-codex
|
$1.75 | $14.00 |
|
||
|
|
xAI | Grok-4.1 Fast Reasoning |
grok-4.1-fast-reasoning
|
$0.20 | $0.50 | |||
|
|
xAI | Grok-4 Fast Non-Reasoning |
grok-4-fast-non-reasoning
|
$0.20 | $0.50 | |||
|
|
OpenAI | gpt-oss-safeguard-20b |
gpt-oss-safeguard-20b
|
$0.08 | $0.30 | |||
|
|
Azure | text-embedding-3-small |
text-embedding-3-small
|
$0.02 | - | |||
|
|
Mistral | Mistral Small |
mistral-small
|
$0.10 | $0.30 | |||
|
|
Gemini 3 Pro Image |
gemini-3-pro-image
|
$2.00 | $120.00 | ||||
|
|
Gemini 2.5 Flash Image (Nano Banana) |
gemini-2.5-flash-image
|
$0.30 | $2.50 | ||||
|
|
Azure | Ministral 3B |
ministral-3b
|
$0.04 | $0.04 | |||
|
|
DeepSeek | deepseek-v3.2-thinking |
deepseek-v3.2-thinking
|
$0.28 | $0.42 |
|
||
|
|
Mistral | Mistral Embed |
mistral-embed
|
$0.10 | - | |||
|
|
Azure | GPT-5.2 Chat |
gpt-5.2-chat
|
$1.75 | $14.00 | |||
|
|
xAI | Grok-4 Fast Reasoning |
grok-4-fast-reasoning
|
$0.20 | $0.50 | |||
|
|
OpenAI | GPT-5.1 Codex Mini |
gpt-5.1-codex-mini
|
$0.25 | $2.00 | |||
|
|
OpenAI | GPT-5 Codex |
gpt-5-codex
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.1 Codex |
gpt-5.1-codex
|
$1.25 | $10.00 | |||
|
|
Azure | text-embedding-3-large |
text-embedding-3-large
|
$0.13 | - | |||
|
|
DeepSeek | DeepSeek-R1 |
deepseek-r1
|
$1.35 | $5.40 | |||
|
|
Gemini Embedding 001 |
gemini-embedding-001
|
$0.15 | - | ||||
|
|
Moonshot AI | Kimi K2 0711 (free) |
kimi-k2
|
$0.50 | $2.00 | |||
|
|
Mistral | mistral-large-3 |
mistral-large-3
|
$0.50 | $1.50 | |||
|
|
Perplexity | Sonar |
sonar
|
$1.00 | $1.00 | |||
|
|
Perplexity | Sonar Reasoning Pro |
sonar-reasoning-pro
|
$2.00 | $8.00 | |||
|
|
Azure | GPT-5.1 Codex Max |
gpt-5.1-codex-max
|
$1.25 | $10.00 | |||
|
|
Morph | Morph v3 Fast |
morph-v3-fast
|
$0.80 | $1.20 | |||
|
|
DeepSeek | deepseek-v3.1-terminus |
deepseek-v3.1-terminus
|
$0.27 | $1.00 | |||
|
|
qwen | Qwen3-Coder |
qwen3-coder
|
$0.40 | $1.60 | |||
|
|
qwen | Qwen3 32B |
qwen-3-32b
|
$0.10 | $0.30 | |||
|
|
Moonshot AI | Kimi K2 Thinking |
kimi-k2-thinking
|
$0.60 | $2.50 | |||
|
|
Perplexity | Sonar Pro |
sonar-pro
|
$3.00 | $15.00 | |||
|
|
Mistral | Pixtral 12B |
pixtral-12b
|
$0.15 | $0.15 | |||
|
|
xAI | Grok 2 Vision |
grok-2-vision
|
$2.00 | $10.00 | |||
|
|
Alibaba | Qwen3 Coder Plus |
qwen3-coder-plus
|
$1.00 | $5.00 | |||
|
|
Nvidia | Nemotron Nano 12B 2 VL (free) |
nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | |||
|
|
Moonshot AI | kimi-k2-thinking-turbo |
kimi-k2-thinking-turbo
|
$1.15 | $8.00 | |||
|
|
qwen | Qwen3 Embedding 8B |
qwen3-embedding-8b
|
$0.05 | - | |||
|
|
Mistral | Ministral 8B |
ministral-8b
|
$0.10 | $0.10 | |||
|
|
Z.ai | GLM-4.6V-Flash |
glm-4.6v-flash
|
- | - |
|
||
|
|
Azure | text-embedding-ada-002 |
text-embedding-ada-002
|
$0.10 | - | |||
|
|
Z.ai | GLM-4.7-FlashX |
glm-4.7-flashx
|
$0.06 | $0.40 | |||
|
|
OpenAI | GPT-3.5-turbo |
gpt-3.5-turbo
|
$0.50 | $1.50 | |||
|
|
qwen | Qwen3 Embedding 0.6B |
qwen3-embedding-0.6b
|
$0.01 | - | |||
|
|
Amazon | nova-2-lite |
nova-2-lite
|
$0.30 | $2.50 | |||
|
|
Mistral | mistral-medium |
mistral-medium
|
$0.40 | $2.00 | |||
|
|
Azure | GPT-5 Pro |
gpt-5-pro
|
$15.00 | $120.00 | |||
|
|
xAI | Grok 3 Fast |
grok-3-fast
|
$5.00 | $25.00 | |||
|
|
Morph | Morph v3 Large |
morph-v3-large
|
$0.90 | $1.90 | |||
|
|
Nvidia | Nemotron Nano 9B V2 (free) |
nemotron-nano-9b-v2
|
$0.06 | $0.23 | |||
|
|
Z.ai | glm-4.6v |
glm-4.6v
|
$0.30 | $0.90 | |||
|
|
ByteDance Seed | Seed 1.6 |
seed-1.6
|
$0.25 | $2.00 | |||
|
|
Alibaba | Qwen3 14B |
qwen-3-14b
|
$0.06 | $0.24 | |||
|
|
xAI | Grok 3 Mini Fast |
grok-3-mini-fast
|
$0.60 | $4.00 | |||
|
|
Azure | Codex Mini |
codex-mini
|
$1.50 | $6.00 | |||
|
|
Cohere | command-a-03-2025 |
command-a
|
$2.50 | $10.00 | |||
|
|
Z.ai | GLM-4.5V |
glm-4.5v
|
$0.60 | $1.80 | |||
|
|
OpenAI | o3-pro |
o3-pro
|
$20.00 | $80.00 | |||
|
|
Arcee AI | Trinity Mini (free) |
trinity-mini
|
$0.05 | $0.15 | |||
|
|
qwen | Qwen3 Embedding 4B |
qwen3-embedding-4b
|
$0.02 | - | |||
|
|
Mistral | Mistral Nemo |
mistral-nemo
|
$0.15 | $0.15 | |||
|
|
Mistral | Pixtral Large |
pixtral-large
|
$2.00 | $6.00 | |||
|
|
Mistral | Mixtral 8x22B Instruct |
mixtral-8x22b-instruct
|
$1.20 | $1.20 | |||
|
|
Mistral | Magistral Small |
magistral-small
|
$0.50 | $1.50 | |||
|
|
Alibaba | qwen3-max-preview |
qwen3-max-preview
|
$1.20 | $6.00 | |||
|
|
Mistral | Devstral Small 1.1 |
devstral-small
|
$0.10 | $0.30 | |||
|
|
OpenAI | o3-deep-research |
o3-deep-research
|
$10.00 | $40.00 | |||
|
|
Imagen 4 Ultra |
imagen-4.0-ultra-generate-001
|
- | - | ||||
|
|
Black Forest Labs | Flux Kontext Max |
flux-kontext-max
|
- | - | |||
|
|
Black Forest Labs | flux-2-klein-9b |
flux-2-klein-9b
|
- | - | |||
|
|
Black Forest Labs | Flux 2 Max |
flux-2-max
|
- | - | |||
|
|
Perplexity | Sonar Reasoning |
sonar-reasoning
|
$1.00 | $5.00 | |||
|
|
Black Forest Labs | Flux Kontext Pro |
flux-kontext-pro
|
- | - | |||
|
|
Imagen 4 Fast |
imagen-4.0-fast-generate-001
|
- | - | ||||
|
|
Imagen 4 Standard |
imagen-4.0-generate-001
|
- | - | ||||
|
|
Azure | GPT-3.5 Turbo Instruct |
gpt-3.5-turbo-instruct
|
$1.50 | $2.00 | |||
|
|
Recraft AI | Recraft V3 |
recraft-v3
|
- | - | |||
|
|
Black Forest Labs | Flux 2 Flex |
flux-2-flex
|
- | - | |||
|
|
Black Forest Labs | Flux 2 Pro |
flux-2-pro
|
- | - | |||
|
|
Black Forest Labs | flux-2-klein-4b |
flux-2-klein-4b
|
- | - | |||
|
|
OpenAI | GPT-4o-mini Search Preview |
gpt-4o-mini-search-preview
|
$0.15 | $0.60 | |||
|
|
Arcee AI | Trinity Large Preview (free) |
trinity-large-preview
|
$0.25 | $1.00 |
|
||
|
|
Mistral | devstral-2 |
devstral-2
|
- | - | |||
|
|
qwen | Qwen3 Coder Next |
qwen3-coder-next
|
$0.50 | $1.20 | |||
|
|
Z.ai | GLM-5 |
glm-5
|
$1.00 | $3.20 |
|
||
|
|
Cerebras | Llama 3.1 8B |
llama-3.1-8b
|
$0.10 | $0.10 | |||
|
|
Minimax | MiniMax M2.5 |
minimax-m2.5
|
$0.30 | $1.20 |
|
||
|
|
xAI | grok-imagine-image |
grok-imagine-image
|
- | - | |||
|
|
xAI | Grok Imagine Image Pro |
grok-imagine-image-pro
|
- | - | |||
|
|
qwen | Qwen3.5 Plus 2026-02-15 |
qwen3.5-plus
|
$0.40 | $2.40 |
|