Fireworks AI icon

Fireworks AI

fireworks

Updated 1 hour ago

Fireworks AI is a high-performance AI inference platform that provides fast, affordable access to over 200 open-source and proprietary AI models. The platform specializes in production-grade inference with ultra-low latency, offering models including Llama, Qwen, DeepSeek, Mistral, Google Gemma, FLUX image models, and more. Fireworks features serverless deployment, custom fine-tuning capabilities, and competitive pricing per 1M tokens. The platform is known for its speed and reliability, with models available through OpenAI-compatible APIs and dedicated instances for enterprise workloads.

Browse 141 LLM models available from Fireworks AI. Compare prices and features.

Models (141)

Organization Model Name Original Model Input Output Free
DeepSeek
DeepSeek DeepSeek-V4-Pro-Max deepseek-v4-pro $1.74 $3.48
Moonshot AI
Moonshot AI Kimi K2.6 kimi-k2p6 $0.95 $4.00
Z.ai
Z.ai GLM-5.1 glm-5p1 $1.40 $4.40
Minimax
Minimax MiniMax M2.7 minimax-m2p7 $0.30 $1.20
qwen
qwen Qwen3.6 Plus qwen3p6-plus - -
google
google Gemma 4 31B gemma-4-31b-it $0.90 $0.90
google
google Gemma 4 26B-A4B gemma-4-26b-a4b-it $0.50 $0.50
qwen
qwen Qwen3.5-27B qwen3p5-27b $0.90 $0.90
qwen
qwen Qwen3.5-122B-A10B qwen3p5-122b-a10b $1.20 $1.20
Minimax
Minimax MiniMax M2.5 minimax-m2p5 $0.30 $1.20
qwen
qwen Qwen3.5-35B-A3B qwen3p5-35b-a3b $0.50 $0.50
qwen
qwen Qwen3.5-397B-A17B qwen3p5-397b-a17b $0.60 $3.60
qwen
qwen Qwen3.5-9B qwen3p5-9b $0.20 $0.20
Z.ai
Z.ai GLM-5 glm-5 $1.00 $3.20
Moonshot AI
Moonshot AI Kimi K2.5 kimi-k2p5 $0.60 $3.00
Z.ai
Z.ai GLM-4.7 glm-4p7 $0.60 $2.20
Minimax
Minimax MiniMax M2.1 minimax-m2p1 $0.30 $1.20
Z.ai
Z.ai GLM-4.7-Flash glm-4p7-flash $0.50 $0.50
Z.ai
Z.ai GLM-4.6 glm-4p6 $0.55 $2.19
Minimax
Minimax MiniMax M2 minimax-m2 $0.30 $1.20
qwen
qwen Qwen3 VL 235B A22B qwen3-vl-235b-a22b-thinking $0.22 $0.88
qwen
qwen Qwen3 VL 235B A22B Instruct qwen3-vl-235b-a22b-instruct $0.22 $0.88
qwen
qwen Qwen3 VL 30B A3B qwen3-vl-30b-a3b-thinking $0.15 $0.60
qwen
qwen Qwen3-Next-80B-A3B-Thinking qwen3-next-80b-a3b-thinking $0.90 $0.90
qwen
qwen Qwen3 VL 8B Instruct qwen3-vl-8b-instruct $0.20 $0.20
qwen
qwen Qwen3 VL 30B A3B Instruct qwen3-vl-30b-a3b-instruct $0.15 $0.60
qwen
qwen Qwen3 VL 32B Instruct qwen3-vl-32b-instruct $0.90 $0.90
OpenAI
OpenAI GPT OSS 120B gpt-oss-120b $0.15 $0.60
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 kimi-k2-instruct-0905 $0.60 $2.50
qwen
qwen Qwen3-Next-80B-A3B-Instruct qwen3-next-80b-a3b-instruct $0.90 $0.90
qwen
qwen Qwen3-235B-A22B-Thinking-2507 qwen3-235b-a22b-thinking-2507 $0.22 $0.88
Z.ai
Z.ai GLM-4.5 glm-4p5 $0.55 $2.19
OpenAI
OpenAI GPT OSS 20B gpt-oss-20b $0.07 $0.30
Z.ai
Z.ai GLM-4.5-Air glm-4p5-air $0.22 $0.88
Nvidia
Nvidia NVIDIA Nemotron Nano 9B V2 nvidia-nemotron-nano-9b-v2 $0.20 $0.20
qwen
qwen Qwen3-235B-A22B-Instruct-2507 qwen3-235b-a22b-instruct-2507 $0.22 $0.88
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-r1-0528 $1.35 $5.40
Moonshot AI
Moonshot AI Kimi K2 Instruct kimi-k2-instruct $0.60 $2.50
qwen
qwen Qwen3 32B qwen3-32b $0.90 $0.90
Minimax
Minimax MiniMax M1 80K minimax-m1-80k $0.10 $0.10
qwen
qwen Qwen3 30B A3B qwen3-30b-a3b $0.15 $0.60
qwen
qwen Qwen3 235B A22B qwen3-235b-a22b $0.22 $0.88
qwen
qwen Qwen3-Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct $0.45 $1.80
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-v3-0324 $0.90 $0.90
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-v3p1 $0.56 $1.68
qwen
qwen QwQ-32B qwq-32b $0.90 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b $0.90 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b $0.90 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-r1-distill-qwen-14b $0.20 $0.20
google
google Gemma 3 27B gemma-3-27b-it $0.90 $0.90
DeepSeek
DeepSeek DeepSeek-V3 deepseek-v3 $0.90 $0.90
google
google Gemma 3 12B gemma-3-12b-it $0.20 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-r1-distill-qwen-7b $0.20 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-r1-distill-llama-8b $0.20 $0.20
google
google Gemma 3 4B gemma-3-4b-it $0.20 $0.20
google
google Gemma 3 1B gemma-3-1b-it $0.10 $0.10
Mistral
Mistral Mistral Small 3 24B Instruct mistral-small-24b-instruct-2501 $0.90 $0.90
Meta
Meta Llama 3.3 70B Instruct llama-v3p3-70b-instruct $0.90 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1p5b $0.10 $0.10
qwen
qwen Qwen2.5 Instruct 32B qwen2p5-32b-instruct $0.90 $0.90
qwen
qwen Qwen2.5 72B Instruct qwen2p5-72b-instruct $0.90 $0.90
qwen
qwen Qwen2.5 72B Instruct qwen2p5-72b $0.90 $0.90
Meta
Meta Llama 3.1 70B Instruct llama-v3p1-70b-instruct $0.90 $0.90
DeepSeek
DeepSeek DeepSeek-V2.5 deepseek-v2p5 $1.20 $1.20
qwen
qwen Qwen2.5 14B Instruct qwen2p5-14b-instruct $0.20 $0.20
qwen
qwen Qwen2.5 14B Instruct qwen-v2p5-14b-instruct $0.20 $0.20
Meta
Meta Llama 3.2 3B Instruct llama-v3p2-3b-instruct $0.10 $0.10
Meta
Meta Llama 3.1 405B Instruct llama-v3p1-405b-instruct $3.00 $3.00
qwen
qwen Qwen2 72B Instruct qwen2-72b-instruct $0.90 $0.90
Meta
Meta Llama 3.1 8B Instruct llama-v3p1-8b-instruct $0.20 $0.20
qwen
qwen Qwen2 7B Instruct qwen2-7b-instruct $0.20 $0.20
Allen Institute for AI
Allen Institute for AI Molmo2 8B (free) molmo2-8b $0.20 $0.20
Mistral
Mistral MiniStral 3 (14B Instruct 2512) ministral-3-14b-instruct-2512 $0.20 $0.20
Mistral
Mistral Ministral 3 (3B Instruct 2512) ministral-3-3b-instruct-2512 $0.10 $0.10
Mistral
Mistral Ministral 3 (8B Instruct 2512) ministral-3-8b-instruct-2512 $0.20 $0.20
Moonshot AI
Moonshot AI Kimi K2 Thinking kimi-k2-thinking $0.60 $2.50
OpenAI
OpenAI gpt-oss-safeguard-20b gpt-oss-safeguard-20b $0.50 $0.50
qwen
qwen Qwen3 Embedding 4B qwen3-embedding-4b - -
qwen
qwen Qwen3 Embedding 8B qwen3-embedding-8b - -
DeepSeek
DeepSeek deepseek-v3.1-terminus deepseek-v3p1-terminus $0.56 $1.68
SiliconFlow
SiliconFlow Qwen/Qwen3-30B-A3B-Thinking-2507 qwen3-30b-a3b-thinking-2507 $0.90 $0.90
Z.ai
Z.ai GLM-4.5V glm-4p5v $1.20 $1.20
Alibaba
Alibaba Qwen3 Coder 30B A3B qwen3-coder-30b-a3b-instruct $0.15 $0.60
Alibaba
Alibaba qwen3-30b-a3b-instruct-2507 qwen3-30b-a3b-instruct-2507 $0.50 $0.50
Mistral
Mistral Devstral Small devstral-small-2505 $0.90 $0.90
qwen
qwen Qwen3 4B qwen3-4b $0.20 $0.20
Alibaba
Alibaba Qwen3 14B qwen3-14b $0.20 $0.20
Alibaba
Alibaba Qwen3 8B qwen3-8b $0.20 $0.20
Alibaba
Alibaba qwen2.5-vl-32b-instruct qwen2p5-vl-32b-instruct $0.90 $0.90
Groq
Groq Llama Guard 3 8B llama-guard-3-8b $0.20 $0.20
Alibaba
Alibaba qwen2.5-vl-72b-instruct qwen2p5-vl-72b-instruct $0.90 $0.90
DeepSeek
DeepSeek DeepSeek R1 deepseek-r1 $1.35 $5.40
Nvidia
Nvidia Llama 3.2 11b Vision Instruct llama-v3p2-11b-vision-instruct $0.20 $0.20
Meta
Meta llama-3.2-1b-instruct llama-v3p2-1b-instruct $0.10 $0.10
qwen
qwen Qwen2.5 Coder 7B qwen2p5-coder-7b-instruct $0.20 $0.20
qwen
qwen Qwen2.5-Coder 32B Instruct qwen2p5-coder-32b-instruct $0.90 $0.90
Alibaba
Alibaba Qwen2.5-VL 7B Instruct qwen2p5-vl-7b-instruct $0.20 $0.20
Microsoft Phi-3.5-vision-instruct phi-3-vision-128k-instruct $0.20 $0.20
Mistral
Mistral Mistral NeMo Instruct mistral-nemo-instruct-2407 $0.20 $0.20
google
google Gemma 2 9B gemma2-9b-it $0.20 $0.20
Mistral
Mistral Mistral 7B mistral-7b $0.20 $0.20
Mistral
Mistral Mistral 7B Instruct v0.3 mistral-7b-instruct-v3 $0.20 $0.20
Meta
Meta LlamaGuard 2 8B llama-guard-2-8b $0.20 $0.20
Meta
Meta llama-3-70b-instruct llama-v3-70b-instruct $0.90 $0.90
Meta
Meta llama-3-8b-instruct llama-v3-8b-instruct $0.20 $0.20
Mistral
Mistral Mixtral 8x22B Instruct mixtral-8x22b-instruct $1.20 $1.20
Mistral
Mistral mistral-7b-instruct-v0.2 mistral-7b-instruct-v0p2 $0.20 $0.20
Mistral
Mistral Mixtral 8x7B mixtral-8x7b-instruct $0.50 $0.50
Mistral
Mistral Mixtral 8x7B mixtral-8x7b $0.50 $0.50
Nvidia
Nvidia Codegemma 7b codegemma-7b $0.20 $0.20
Nvidia
Nvidia Codellama 70b code-llama-70b $0.90 $0.90
Meta
Meta codellama-34b-instruct code-llama-34b-instruct $0.90 $0.90
Meta
Meta codellama-70b-instruct code-llama-70b-instruct $0.90 $0.90
DeepSeek
DeepSeek DeepSeek Prover V2 deepseek-prover-v2 $1.20 $1.20
DeepSeek
DeepSeek deepseek-v3.2 deepseek-v3p2 $0.56 $1.68
Black Forest Labs
Black Forest Labs FLUX.1-schnell flux-1-schnell - -
google
google gemma-2b-it gemma-2b-it $0.10 $0.10
google
google gemma-7b-it gemma-7b-it $0.20 $0.20
Meta
Meta Llama 3 8B llama-v3-8b $0.20 $0.20
Nvidia
Nvidia Llama 3.1 Nemotron 70B llama-v3p1-nemotron-70b-instruct $0.90 $0.90
Meta
Meta Llama 3.2 1B llama-v3p2-1b $0.10 $0.10
Meta
Meta Llama 3.2 3B llama-v3p2-3b $0.10 $0.10
Meta
Meta llama-2-13b-chat llama-v2-13b-chat $0.20 $0.20
Meta
Meta llama-2-7b-chat llama-v2-7b-chat $0.20 $0.20
Azure
Azure Llama-3.2-90B-Vision-Instruct llama-v3p2-90b-vision-instruct $0.90 $0.90
Nvidia
Nvidia Mistral Large 3 675B Instruct 2512 mistral-large-3-fp8 $1.20 $1.20
Mistral
Mistral Mixtral 8x22B mixtral-8x22b $1.20 $1.20
NousResearch
NousResearch nous-hermes-2-mixtral-8x7b-dpo nous-hermes-2-mixtral-8x7b-dpo $0.50 $0.50
WandB
WandB NVIDIA Nemotron 3 Super 120B nvidia-nemotron-3-super-120b-a12b-fp8 $0.90 $0.90
NousResearch
NousResearch openhermes-2.5-mistral-7b openhermes-2p5-mistral-7b $0.20 $0.20
Microsoft phi-3-mini-128k-instruct phi-3-mini-128k-instruct $0.10 $0.10
Alibaba
Alibaba qwen1.5-72b-chat qwen1p5-72b-chat $0.90 $0.90
qwen
qwen Qwen2.5 VL 3B Instruct qwen2p5-vl-3b-instruct $0.20 $0.20
qwen
qwen Qwen3 0.6B qwen3-0p6b $0.10 $0.10
qwen
qwen Qwen3 1.7B qwen3-1p7b $0.10 $0.10
qwen
qwen Qwen3 Embedding 0.6B qwen3-embedding-0p6b - -
SiliconFlow
SiliconFlow Qwen3 Omni 30B A3B qwen3-omni-30b-a3b-instruct $0.50 $0.50
qwen
qwen Qwen3-Reranker-0.6B qwen3-reranker-0p6b - -
qwen
qwen Qwen3-Reranker-4B qwen3-reranker-4b - -
qwen
qwen Qwen3-Reranker-8B qwen3-reranker-8b - -
SiliconFlow
SiliconFlow Seed-OSS-36B-Instruct seed-oss-36b-instruct $0.90 $0.90