Vercel AI Gateway
vercel
Updated 2 months ago
Vercel AI Gateway is a powerful observability and routing layer for AI applications that provides analytics, cost tracking, and caching for requests to major AI providers. The platform supports unified access to models from OpenAI, Anthropic, Google, Meta, Mistral, and other providers through a single API endpoint. Vercel's AI Gateway enables rate limiting, request caching, and fallback mechanisms to improve reliability and reduce costs for AI-powered applications.
Browse 165 LLM models available from Vercel AI Gateway. Compare prices and features.
Models (165)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
Gemini 3.1 Pro |
gemini-3.1-pro-preview
|
$2.00 | $12.00 |
|
|||
|
|
Anthropic | Claude Sonnet 4.6 |
claude-sonnet-4.6
|
$3.00 | $15.00 |
|
||
|
|
Minimax | MiniMax M2.5 |
minimax-m2.5
|
$0.30 | $1.20 |
|
||
|
|
Anthropic | Claude Opus 4.6 |
claude-opus-4.6
|
$5.00 | $25.00 |
|
||
|
|
Z.ai | GLM-5 |
glm-5
|
$1.00 | $3.20 |
|
||
|
|
Moonshot AI | Kimi K2.5 |
kimi-k2.5
|
$0.50 | $2.80 | |||
|
|
Arcee AI | Trinity Large Preview (free) |
trinity-large-preview
|
$0.25 | $1.00 | |||
|
|
qwen | Qwen3 Coder Next |
qwen3-coder-next
|
$0.50 | $1.20 | |||
|
|
Gemini 3 Flash |
gemini-3-flash
|
$0.50 | $3.00 |
|
|||
|
|
Meituan | LongCat-Flash-Thinking-2601 |
longcat-flash-thinking-2601
|
- | - | |||
|
|
Z.ai | GLM-4.7 |
glm-4.7
|
$0.43 | $1.75 |
|
||
|
|
OpenAI | GPT-5.2 Pro |
gpt-5.2-pro
|
$21.00 | $168.00 | |||
|
|
Minimax | MiniMax M2.1 |
minimax-m2.1
|
$0.30 | $1.20 | |||
|
|
Xiaomi | MiMo-V2-Flash |
mimo-v2-flash
|
$0.10 | $0.30 | |||
|
|
OpenAI | GPT-5.2 |
gpt-5.2
|
$1.75 | $14.00 |
|
||
|
|
Z.ai | GLM-4.7-Flash |
glm-4.7-flash
|
- | - | |||
|
|
DeepSeek | DeepSeek-V3.2 |
deepseek-v3.2-thinking
|
$0.28 | $0.42 |
|
||
|
|
Gemini 3 Pro |
gemini-3-pro-preview
|
$2.00 | $12.00 | ||||
|
|
Nvidia | Nemotron 3 Nano (30B A3B) |
nemotron-3-nano-30b-a3b
|
$0.06 | $0.24 | |||
|
|
Anthropic | Claude Opus 4.5 |
claude-opus-4.5
|
$5.00 | $25.00 |
|
||
|
|
qwen | Qwen3 Max Thinking |
qwen3-max
|
$1.20 | $6.00 | |||
|
|
qwen | Qwen3 Max Thinking |
qwen3-max-thinking
|
$1.20 | $6.00 | |||
|
|
xAI | Grok-4.1 Fast Non-Reasoning |
grok-4.1-fast-non-reasoning
|
$0.20 | $0.50 |
|
||
|
|
OpenAI | GPT-5.1 Instant |
gpt-5.1-instant
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.1 Thinking |
gpt-5.1-thinking
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.1 Codex |
gpt-5.1-codex
|
$1.25 | $10.00 | |||
|
|
Minimax | MiniMax M2 |
minimax-m2
|
$0.30 | $1.20 | |||
|
|
Anthropic | Claude 4.5 Sonnet |
claude-sonnet-4.5
|
$3.00 | $15.00 |
|
||
|
|
Anthropic | Claude 4.5 Haiku |
claude-haiku-4.5
|
$1.00 | $5.00 |
|
||
|
|
Z.ai | GLM-4.6 |
glm-4.6
|
$0.45 | $1.80 | |||
|
|
OpenAI | GPT-5 Codex |
gpt-5-codex
|
$1.25 | $10.00 | |||
|
|
OpenAI | GPT-5.1 Codex Mini |
gpt-5.1-codex-mini
|
$0.25 | $2.00 | |||
|
|
OpenAI | GPT-5 |
gpt-5
|
$1.25 | $10.00 | |||
|
|
Meituan | LongCat-Flash-Thinking |
longcat-flash-thinking
|
$0.15 | $1.50 | |||
|
|
xAI | Grok Code Fast 1 |
grok-code-fast-1
|
$0.20 | $1.50 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
qwen3-next-80b-a3b-thinking
|
$0.15 | $1.50 | |||
|
|
OpenAI | GPT-5 mini |
gpt-5-mini
|
$0.25 | $2.00 |
|
||
|
|
Anthropic | Claude 4.1 Opus |
claude-opus-4.1
|
$15.00 | $75.00 | |||
|
|
Meituan | LongCat-Flash-Chat |
longcat-flash-chat
|
- | - | |||
|
|
xAI | Grok-4 |
grok-4
|
$3.00 | $15.00 | |||
|
|
OpenAI | GPT OSS 120B |
gpt-oss-120b
|
$0.25 | $0.69 |
|
||
|
|
Moonshot AI | Kimi K2 0905 |
kimi-k2-0905
|
$1.00 | $3.00 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
qwen3-next-80b-a3b-instruct
|
$0.09 | $1.10 | |||
|
|
OpenAI | GPT-5 nano |
gpt-5-nano
|
$0.05 | $0.40 | |||
|
|
Z.ai | GLM-4.5 |
glm-4.5
|
$0.60 | $2.20 | |||
|
|
Z.ai | GLM-4.5-Air |
glm-4.5-air
|
$0.20 | $1.10 | |||
|
|
OpenAI | GPT OSS 20B |
gpt-oss-20b
|
$0.07 | $0.30 | |||
|
|
Gemini 2.5 Pro Preview 06-05 |
gemini-2.5-pro
|
$1.25 | $10.00 | ||||
|
|
Anthropic | Claude 4 Sonnet |
claude-sonnet-4
|
$3.00 | $15.00 | |||
|
|
Anthropic | Claude 4 Opus |
claude-opus-4
|
$15.00 | $75.00 | |||
|
|
Gemini 2.5 Flash |
gemini-2.5-flash
|
$0.30 | $2.50 |
|
|||
|
|
Gemini 2.5 Flash |
gemini-2.5-flash-preview-09-2025
|
$0.30 | $2.50 |
|
|||
|
|
Gemini 2.5 Flash-Lite |
gemini-2.5-flash-lite
|
$0.10 | $0.40 |
|
|||
|
|
Gemini 2.5 Flash-Lite |
gemini-2.5-flash-lite-preview-09-2025
|
$0.10 | $0.40 |
|
|||
|
|
OpenAI | o4-mini |
o4-mini
|
$1.10 | $4.40 | |||
|
|
OpenAI | o3 |
o3
|
$2.00 | $8.00 | |||
|
|
Mistral | Magistral Medium |
magistral-medium
|
$2.00 | $5.00 | |||
|
|
qwen | Qwen3 32B |
qwen-3-32b
|
$0.10 | $0.30 | |||
|
|
xAI | Grok-3 |
grok-3
|
$3.00 | $15.00 | |||
|
|
xAI | Grok-3 Mini |
grok-3-mini
|
$0.30 | $0.50 | |||
|
|
Anthropic | Claude 3.7 Sonnet |
claude-3.7-sonnet
|
$3.00 | $15.00 | |||
|
|
Meta | Llama 4 Maverick |
llama-4-maverick
|
$0.24 | $0.97 | |||
|
|
qwen | Qwen3 235B A22B |
qwen3-235b-a22b-thinking
|
$0.30 | $2.90 | |||
|
|
OpenAI | GPT-4.1 |
gpt-4.1
|
$2.00 | $8.00 | |||
|
|
Meta | Llama 4 Scout |
llama-4-scout
|
$0.17 | $0.66 | |||
|
|
OpenAI | GPT-4.1 mini |
gpt-4.1-mini
|
$0.40 | $1.60 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-v3.1
|
$0.50 | $1.50 | |||
|
|
OpenAI | o3-mini |
o3-mini
|
$1.10 | $4.40 | |||
|
|
Gemini 2.0 Flash Thinking |
gemini-2.0-flash
|
$0.15 | $0.60 | ||||
|
|
OpenAI | GPT-4.1 nano |
gpt-4.1-nano
|
$0.10 | $0.40 | |||
|
|
OpenAI | o1 |
o1
|
$15.00 | $60.00 | |||
|
|
OpenAI | GPT-4o |
gpt-4o
|
$2.50 | $10.00 | |||
|
|
DeepSeek | DeepSeek-V3 |
deepseek-v3
|
$0.77 | $0.77 | |||
|
|
Gemini 2.0 Flash-Lite |
gemini-2.0-flash-lite
|
$0.08 | $0.30 | ||||
|
|
OpenAI | GPT-4o mini |
gpt-4o-mini
|
$0.15 | $0.60 |
|
||
|
|
Anthropic | Claude 3.5 Sonnet |
claude-3.5-sonnet
|
$3.00 | $15.00 | |||
|
|
Anthropic | Claude 3.5 Sonnet |
claude-3.5-sonnet-20240620
|
$3.00 | $15.00 | |||
|
|
Anthropic | Claude 3.5 Haiku |
claude-3.5-haiku
|
$0.80 | $4.00 | |||
|
|
Amazon | Nova Pro |
nova-pro
|
$0.80 | $3.20 | |||
|
|
Amazon | Nova Lite |
nova-lite
|
$0.06 | $0.24 | |||
|
|
Amazon | Nova Micro |
nova-micro
|
$0.04 | $0.14 | |||
|
|
OpenAI | GPT-4 Turbo |
gpt-4-turbo
|
$10.00 | $30.00 | |||
|
|
Anthropic | Claude 3 Opus |
claude-3-opus
|
$15.00 | $75.00 | |||
|
|
Anthropic | Claude 3 Haiku |
claude-3-haiku
|
$0.25 | $1.25 | |||
|
|
qwen | Qwen3.5 Plus 2026-02-15 |
qwen3.5-plus
|
$0.40 | $2.40 | |||
|
|
xAI | Grok Imagine Image Pro |
grok-imagine-image-pro
|
- | - | |||
|
|
xAI | grok-imagine-image |
grok-imagine-image
|
- | - | |||
|
|
Black Forest Labs | flux-2-klein-4b |
flux-2-klein-4b
|
- | - | |||
|
|
OpenAI | GPT-5.2 Codex |
gpt-5.2-codex
|
$1.75 | $14.00 | |||
|
|
ByteDance Seed | Seed 1.6 |
seed-1.6
|
$0.25 | $2.00 | |||
|
|
Black Forest Labs | Flux 2 Max |
flux-2-max
|
- | - | |||
|
|
Azure | GPT-5.2 Chat |
gpt-5.2-chat
|
$1.75 | $14.00 | |||
|
|
Z.ai | GLM-4.6V |
glm-4.6v
|
$0.30 | $0.90 | |||
|
|
Azure | GPT-5.1 Codex Max |
gpt-5.1-codex-max
|
$1.25 | $10.00 | |||
|
|
Arcee AI | Trinity Mini (free) |
trinity-mini
|
$0.05 | $0.15 | |||
|
|
Black Forest Labs | Flux 2 Flex |
flux-2-flex
|
- | - | |||
|
|
Black Forest Labs | Flux 2 Pro |
flux-2-pro
|
- | - | |||
|
|
Gemini 3 Pro Image |
gemini-3-pro-image
|
$2.00 | $120.00 | ||||
|
|
xAI | Grok-4.1 Fast Reasoning |
grok-4.1-fast-reasoning
|
$0.20 | $0.50 | |||
|
|
Moonshot AI | Kimi K2 Thinking |
kimi-k2-thinking
|
$0.60 | $2.50 | |||
|
|
Gemini Embedding 001 |
gemini-embedding-001
|
$0.15 | - | ||||
|
|
Azure | text-embedding-3-large |
text-embedding-3-large
|
$0.13 | - | |||
|
|
Azure | text-embedding-3-small |
text-embedding-3-small
|
$0.02 | - | |||
|
|
Azure | text-embedding-ada-002 |
text-embedding-ada-002
|
$0.10 | - | |||
|
|
OpenAI | gpt-oss-safeguard-20b |
gpt-oss-safeguard-20b
|
$0.08 | $0.30 | |||
|
|
Nvidia | Nemotron Nano 12B 2 VL (free) |
nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | |||
|
|
qwen | Qwen3 Embedding 4B |
qwen3-embedding-4b
|
$0.02 | - | |||
|
|
qwen | Qwen3 Embedding 8B |
qwen3-embedding-8b
|
$0.05 | - | |||
|
|
OpenAI | o3-deep-research |
o3-deep-research
|
$10.00 | $40.00 | |||
|
|
Gemini 2.5 Flash Image (Nano Banana) |
gemini-2.5-flash-image
|
$0.30 | $2.50 | ||||
|
|
Azure | GPT-5 Pro |
gpt-5-pro
|
$15.00 | $120.00 | |||
|
|
Alibaba | Qwen3 Coder Plus |
qwen3-coder-plus
|
$1.00 | $5.00 | |||
|
|
DeepSeek | deepseek-v3.1-terminus |
deepseek-v3.1-terminus
|
$0.27 | $1.00 | |||
|
|
Nvidia | Nemotron Nano 9B V2 (free) |
nemotron-nano-9b-v2
|
$0.06 | $0.23 | |||
|
|
xAI | Grok-4 Fast Non-Reasoning |
grok-4-fast-non-reasoning
|
$0.20 | $0.50 | |||
|
|
xAI | Grok-4 Fast Reasoning |
grok-4-fast-reasoning
|
$0.20 | $0.50 | |||
|
|
Z.ai | GLM-4.5V |
glm-4.5v
|
$0.60 | $1.80 | |||
|
|
OpenAI | gpt-5-chat |
gpt-5-chat
|
$1.25 | $10.00 | |||
|
|
Mistral | Devstral Small |
devstral-small
|
$0.10 | $0.30 | |||
|
|
Morph | Morph v3 Fast |
morph-v3-fast
|
$0.80 | $1.20 | |||
|
|
Morph | Morph v3 Large |
morph-v3-large
|
$0.90 | $1.90 | |||
|
|
OpenAI | o3-pro |
o3-pro
|
$20.00 | $80.00 | |||
|
|
Alibaba | Qwen3 14B |
qwen-3-14b
|
$0.06 | $0.24 | |||
|
|
Cohere | Command A |
command-a
|
$2.50 | $10.00 | |||
|
|
OpenAI | GPT-4o-mini Search Preview |
gpt-4o-mini-search-preview
|
$0.15 | $0.60 | |||
|
|
Perplexity | Sonar Pro |
sonar-pro
|
$3.00 | $15.00 | |||
|
|
Perplexity | Sonar Reasoning Pro |
sonar-reasoning-pro
|
$2.00 | $8.00 | |||
|
|
Perplexity | Sonar |
sonar
|
$1.00 | $1.00 | |||
|
|
DeepSeek | DeepSeek R1 |
deepseek-r1
|
$1.35 | $5.40 | |||
|
|
qwen | Qwen3-Coder |
qwen3-coder
|
$0.40 | $1.60 | |||
|
|
Mistral | Pixtral Large |
pixtral-large
|
$2.00 | $6.00 | |||
|
|
Mistral | Pixtral 12B |
pixtral-12b
|
$0.15 | $0.15 | |||
|
|
Mistral | Mistral Nemo |
mistral-nemo
|
$0.15 | $0.15 | |||
|
|
Mistral | Mixtral 8x22B Instruct |
mixtral-8x22b-instruct
|
$1.20 | $1.20 | |||
|
|
Azure | GPT-3.5 Turbo Instruct |
gpt-3.5-turbo-instruct
|
$1.50 | $2.00 | |||
|
|
OpenAI | GPT-3.5 Turbo |
gpt-3.5-turbo
|
$0.50 | $1.50 | |||
|
|
Azure | Codex Mini |
codex-mini
|
$1.50 | $6.00 | |||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-v3.2
|
$0.26 | $0.38 | |||
|
|
Mistral | Devstral 2 |
devstral-2
|
- | - | |||
|
|
Black Forest Labs | Flux Kontext Max |
flux-kontext-max
|
- | - | |||
|
|
Black Forest Labs | Flux Kontext Pro |
flux-kontext-pro
|
- | - | |||
|
|
Black Forest Labs | flux-2-klein-9b |
flux-2-klein-9b
|
- | - | |||
|
|
Z.ai | GLM-4.6V-Flash |
glm-4.6v-flash
|
- | - | |||
|
|
Z.ai | GLM-4.7-FlashX |
glm-4.7-flashx
|
$0.06 | $0.40 | |||
|
|
xAI | Grok 2 Vision |
grok-2-vision
|
$2.00 | $10.00 | |||
|
|
xAI | Grok 3 Fast |
grok-3-fast
|
$5.00 | $25.00 | |||
|
|
xAI | Grok 3 Mini Fast |
grok-3-mini-fast
|
$0.60 | $4.00 | |||
|
|
Imagen 4 Fast |
imagen-4.0-fast-generate-001
|
- | - | ||||
|
|
Imagen 4 Standard |
imagen-4.0-generate-001
|
- | - | ||||
|
|
Imagen 4 Ultra |
imagen-4.0-ultra-generate-001
|
- | - | ||||
|
|
Moonshot AI | Kimi K2 0711 (free) |
kimi-k2
|
$0.50 | $2.00 | |||
|
|
Moonshot AI | kimi-k2-thinking-turbo |
kimi-k2-thinking-turbo
|
$1.15 | $8.00 | |||
|
|
Cerebras | Llama 3.1 8B |
llama-3.1-8b
|
$0.10 | $0.10 | |||
|
|
Mistral | Magistral Small 1 |
magistral-small
|
$0.50 | $1.50 | |||
|
|
Azure | Ministral 3B |
ministral-3b
|
$0.04 | $0.04 | |||
|
|
Mistral | Ministral 8B |
ministral-8b
|
$0.10 | $0.10 | |||
|
|
Mistral | Mistral Embed |
mistral-embed
|
$0.10 | - | |||
|
|
Mistral | Mistral Large 3 |
mistral-large-3
|
$0.50 | $1.50 | |||
|
|
Mistral | Mistral Medium |
mistral-medium
|
$0.40 | $2.00 | |||
|
|
Mistral | Mistral Small |
mistral-small
|
$0.10 | $0.30 | |||
|
|
Amazon | nova-2-lite |
nova-2-lite
|
$0.30 | $2.50 | |||
|
|
qwen | Qwen3 Embedding 0.6B |
qwen3-embedding-0.6b
|
$0.01 | - | |||
|
|
Alibaba | qwen3-max-preview |
qwen3-max-preview
|
$1.20 | $6.00 | |||
|
|
Recraft AI | Recraft V3 |
recraft-v3
|
- | - | |||
|
|
Perplexity | Sonar Reasoning |
sonar-reasoning
|
$1.00 | $5.00 |