Fireworks AI

fireworks

Updated 1 hour ago

Fireworks AI is a high-performance AI inference platform that provides fast, affordable access to over 200 open-source and proprietary AI models. The platform specializes in production-grade inference with ultra-low latency, offering models including Llama, Qwen, DeepSeek, Mistral, Google Gemma, FLUX image models, and more. Fireworks features serverless deployment, custom fine-tuning capabilities, and competitive pricing per 1M tokens. The platform is known for its speed and reliability, with models available through OpenAI-compatible APIs and dedicated instances for enterprise workloads.

Visit Website LLM.txt

Browse 141 LLM models available from Fireworks AI. Compare prices and features.

Models (141)

Organization	Model Name	Original Model	Input	Output
DeepSeek	DeepSeek-V4-Pro-Max	`deepseek-v4-pro`	$1.74	$3.48	View
Moonshot AI	Kimi K2.6	`kimi-k2p6`	$0.95	$4.00	View
Z.ai	GLM-5.1	`glm-5p1`	$1.40	$4.40	View
Minimax	MiniMax M2.7	`minimax-m2p7`	$0.30	$1.20	View
qwen	Qwen3.6 Plus	`qwen3p6-plus`	-	-	View
google	Gemma 4 31B	`gemma-4-31b-it`	$0.90	$0.90	View
google	Gemma 4 26B-A4B	`gemma-4-26b-a4b-it`	$0.50	$0.50	View
qwen	Qwen3.5-27B	`qwen3p5-27b`	$0.90	$0.90	View
qwen	Qwen3.5-122B-A10B	`qwen3p5-122b-a10b`	$1.20	$1.20	View
Minimax	MiniMax M2.5	`minimax-m2p5`	$0.30	$1.20	View
qwen	Qwen3.5-35B-A3B	`qwen3p5-35b-a3b`	$0.50	$0.50	View
qwen	Qwen3.5-397B-A17B	`qwen3p5-397b-a17b`	$0.60	$3.60	View
qwen	Qwen3.5-9B	`qwen3p5-9b`	$0.20	$0.20	View
Z.ai	GLM-5	`glm-5`	$1.00	$3.20	View
Moonshot AI	Kimi K2.5	`kimi-k2p5`	$0.60	$3.00	View
Z.ai	GLM-4.7	`glm-4p7`	$0.60	$2.20	View
Minimax	MiniMax M2.1	`minimax-m2p1`	$0.30	$1.20	View
Z.ai	GLM-4.7-Flash	`glm-4p7-flash`	$0.50	$0.50	View
Z.ai	GLM-4.6	`glm-4p6`	$0.55	$2.19	View
Minimax	MiniMax M2	`minimax-m2`	$0.30	$1.20	View
qwen	Qwen3 VL 235B A22B	`qwen3-vl-235b-a22b-thinking`	$0.22	$0.88	View
qwen	Qwen3 VL 235B A22B Instruct	`qwen3-vl-235b-a22b-instruct`	$0.22	$0.88	View
qwen	Qwen3 VL 30B A3B	`qwen3-vl-30b-a3b-thinking`	$0.15	$0.60	View
qwen	Qwen3-Next-80B-A3B-Thinking	`qwen3-next-80b-a3b-thinking`	$0.90	$0.90	View
qwen	Qwen3 VL 8B Instruct	`qwen3-vl-8b-instruct`	$0.20	$0.20	View
qwen	Qwen3 VL 30B A3B Instruct	`qwen3-vl-30b-a3b-instruct`	$0.15	$0.60	View
qwen	Qwen3 VL 32B Instruct	`qwen3-vl-32b-instruct`	$0.90	$0.90	View
OpenAI	GPT OSS 120B	`gpt-oss-120b`	$0.15	$0.60	View
Moonshot AI	Kimi K2-Instruct-0905	`kimi-k2-instruct-0905`	$0.60	$2.50	View
qwen	Qwen3-Next-80B-A3B-Instruct	`qwen3-next-80b-a3b-instruct`	$0.90	$0.90	View
qwen	Qwen3-235B-A22B-Thinking-2507	`qwen3-235b-a22b-thinking-2507`	$0.22	$0.88	View
Z.ai	GLM-4.5	`glm-4p5`	$0.55	$2.19	View
OpenAI	GPT OSS 20B	`gpt-oss-20b`	$0.07	$0.30	View
Z.ai	GLM-4.5-Air	`glm-4p5-air`	$0.22	$0.88	View
Nvidia	NVIDIA Nemotron Nano 9B V2	`nvidia-nemotron-nano-9b-v2`	$0.20	$0.20	View
qwen	Qwen3-235B-A22B-Instruct-2507	`qwen3-235b-a22b-instruct-2507`	$0.22	$0.88	View
DeepSeek	DeepSeek-R1-0528	`deepseek-r1-0528`	$1.35	$5.40	View
Moonshot AI	Kimi K2 Instruct	`kimi-k2-instruct`	$0.60	$2.50	View
qwen	Qwen3 32B	`qwen3-32b`	$0.90	$0.90	View
Minimax	MiniMax M1 80K	`minimax-m1-80k`	$0.10	$0.10	View
qwen	Qwen3 30B A3B	`qwen3-30b-a3b`	$0.15	$0.60	View
qwen	Qwen3 235B A22B	`qwen3-235b-a22b`	$0.22	$0.88	View
qwen	Qwen3-Coder 480B A35B Instruct	`qwen3-coder-480b-a35b-instruct`	$0.45	$1.80	View
DeepSeek	DeepSeek-V3 0324	`deepseek-v3-0324`	$0.90	$0.90	View
DeepSeek	DeepSeek-V3.1	`deepseek-v3p1`	$0.56	$1.68	View
qwen	QwQ-32B	`qwq-32b`	$0.90	$0.90	View
DeepSeek	DeepSeek R1 Distill Llama 70B	`deepseek-r1-distill-llama-70b`	$0.90	$0.90	View
DeepSeek	DeepSeek R1 Distill Qwen 32B	`deepseek-r1-distill-qwen-32b`	$0.90	$0.90	View
DeepSeek	DeepSeek R1 Distill Qwen 14B	`deepseek-r1-distill-qwen-14b`	$0.20	$0.20	View
google	Gemma 3 27B	`gemma-3-27b-it`	$0.90	$0.90	View
DeepSeek	DeepSeek-V3	`deepseek-v3`	$0.90	$0.90	View
google	Gemma 3 12B	`gemma-3-12b-it`	$0.20	$0.20	View
DeepSeek	DeepSeek R1 Distill Qwen 7B	`deepseek-r1-distill-qwen-7b`	$0.20	$0.20	View
DeepSeek	DeepSeek R1 Distill Llama 8B	`deepseek-r1-distill-llama-8b`	$0.20	$0.20	View
google	Gemma 3 4B	`gemma-3-4b-it`	$0.20	$0.20	View
google	Gemma 3 1B	`gemma-3-1b-it`	$0.10	$0.10	View
Mistral	Mistral Small 3 24B Instruct	`mistral-small-24b-instruct-2501`	$0.90	$0.90	View
Meta	Llama 3.3 70B Instruct	`llama-v3p3-70b-instruct`	$0.90	$0.90	View
DeepSeek	DeepSeek R1 Distill Qwen 1.5B	`deepseek-r1-distill-qwen-1p5b`	$0.10	$0.10	View
qwen	Qwen2.5 Instruct 32B	`qwen2p5-32b-instruct`	$0.90	$0.90	View
qwen	Qwen2.5 72B Instruct	`qwen2p5-72b-instruct`	$0.90	$0.90	View
qwen	Qwen2.5 72B Instruct	`qwen2p5-72b`	$0.90	$0.90	View
Meta	Llama 3.1 70B Instruct	`llama-v3p1-70b-instruct`	$0.90	$0.90	View
DeepSeek	DeepSeek-V2.5	`deepseek-v2p5`	$1.20	$1.20	View
qwen	Qwen2.5 14B Instruct	`qwen2p5-14b-instruct`	$0.20	$0.20	View
qwen	Qwen2.5 14B Instruct	`qwen-v2p5-14b-instruct`	$0.20	$0.20	View
Meta	Llama 3.2 3B Instruct	`llama-v3p2-3b-instruct`	$0.10	$0.10	View
Meta	Llama 3.1 405B Instruct	`llama-v3p1-405b-instruct`	$3.00	$3.00	View
qwen	Qwen2 72B Instruct	`qwen2-72b-instruct`	$0.90	$0.90	View
Meta	Llama 3.1 8B Instruct	`llama-v3p1-8b-instruct`	$0.20	$0.20	View
qwen	Qwen2 7B Instruct	`qwen2-7b-instruct`	$0.20	$0.20	View
Allen Institute for AI	Molmo2 8B (free)	`molmo2-8b`	$0.20	$0.20	View
Mistral	MiniStral 3 (14B Instruct 2512)	`ministral-3-14b-instruct-2512`	$0.20	$0.20	View
Mistral	Ministral 3 (3B Instruct 2512)	`ministral-3-3b-instruct-2512`	$0.10	$0.10	View
Mistral	Ministral 3 (8B Instruct 2512)	`ministral-3-8b-instruct-2512`	$0.20	$0.20	View
Moonshot AI	Kimi K2 Thinking	`kimi-k2-thinking`	$0.60	$2.50	View
OpenAI	gpt-oss-safeguard-20b	`gpt-oss-safeguard-20b`	$0.50	$0.50	View
qwen	Qwen3 Embedding 4B	`qwen3-embedding-4b`	-	-	View
qwen	Qwen3 Embedding 8B	`qwen3-embedding-8b`	-	-	View
DeepSeek	deepseek-v3.1-terminus	`deepseek-v3p1-terminus`	$0.56	$1.68	View
SiliconFlow	Qwen/Qwen3-30B-A3B-Thinking-2507	`qwen3-30b-a3b-thinking-2507`	$0.90	$0.90	View
Z.ai	GLM-4.5V	`glm-4p5v`	$1.20	$1.20	View
Alibaba	Qwen3 Coder 30B A3B	`qwen3-coder-30b-a3b-instruct`	$0.15	$0.60	View
Alibaba	qwen3-30b-a3b-instruct-2507	`qwen3-30b-a3b-instruct-2507`	$0.50	$0.50	View
Mistral	Devstral Small	`devstral-small-2505`	$0.90	$0.90	View
qwen	Qwen3 4B	`qwen3-4b`	$0.20	$0.20	View
Alibaba	Qwen3 14B	`qwen3-14b`	$0.20	$0.20	View
Alibaba	Qwen3 8B	`qwen3-8b`	$0.20	$0.20	View
Alibaba	qwen2.5-vl-32b-instruct	`qwen2p5-vl-32b-instruct`	$0.90	$0.90	View
Groq	Llama Guard 3 8B	`llama-guard-3-8b`	$0.20	$0.20	View
Alibaba	qwen2.5-vl-72b-instruct	`qwen2p5-vl-72b-instruct`	$0.90	$0.90	View
DeepSeek	DeepSeek R1	`deepseek-r1`	$1.35	$5.40	View
Nvidia	Llama 3.2 11b Vision Instruct	`llama-v3p2-11b-vision-instruct`	$0.20	$0.20	View
Meta	llama-3.2-1b-instruct	`llama-v3p2-1b-instruct`	$0.10	$0.10	View
qwen	Qwen2.5 Coder 7B	`qwen2p5-coder-7b-instruct`	$0.20	$0.20	View
qwen	Qwen2.5-Coder 32B Instruct	`qwen2p5-coder-32b-instruct`	$0.90	$0.90	View
Alibaba	Qwen2.5-VL 7B Instruct	`qwen2p5-vl-7b-instruct`	$0.20	$0.20	View
Microsoft	Phi-3.5-vision-instruct	`phi-3-vision-128k-instruct`	$0.20	$0.20	View
Mistral	Mistral NeMo Instruct	`mistral-nemo-instruct-2407`	$0.20	$0.20	View
google	Gemma 2 9B	`gemma2-9b-it`	$0.20	$0.20	View
Mistral	Mistral 7B	`mistral-7b`	$0.20	$0.20	View
Mistral	Mistral 7B Instruct v0.3	`mistral-7b-instruct-v3`	$0.20	$0.20	View
Meta	LlamaGuard 2 8B	`llama-guard-2-8b`	$0.20	$0.20	View
Meta	llama-3-70b-instruct	`llama-v3-70b-instruct`	$0.90	$0.90	View
Meta	llama-3-8b-instruct	`llama-v3-8b-instruct`	$0.20	$0.20	View
Mistral	Mixtral 8x22B Instruct	`mixtral-8x22b-instruct`	$1.20	$1.20	View
Mistral	mistral-7b-instruct-v0.2	`mistral-7b-instruct-v0p2`	$0.20	$0.20	View
Mistral	Mixtral 8x7B	`mixtral-8x7b-instruct`	$0.50	$0.50	View
Mistral	Mixtral 8x7B	`mixtral-8x7b`	$0.50	$0.50	View
Nvidia	Codegemma 7b	`codegemma-7b`	$0.20	$0.20	View
Nvidia	Codellama 70b	`code-llama-70b`	$0.90	$0.90	View
Meta	codellama-34b-instruct	`code-llama-34b-instruct`	$0.90	$0.90	View
Meta	codellama-70b-instruct	`code-llama-70b-instruct`	$0.90	$0.90	View
DeepSeek	DeepSeek Prover V2	`deepseek-prover-v2`	$1.20	$1.20	View
DeepSeek	deepseek-v3.2	`deepseek-v3p2`	$0.56	$1.68	View
Black Forest Labs	FLUX.1-schnell	`flux-1-schnell`	-	-	View
google	gemma-2b-it	`gemma-2b-it`	$0.10	$0.10	View
google	gemma-7b-it	`gemma-7b-it`	$0.20	$0.20	View
Meta	Llama 3 8B	`llama-v3-8b`	$0.20	$0.20	View
Nvidia	Llama 3.1 Nemotron 70B	`llama-v3p1-nemotron-70b-instruct`	$0.90	$0.90	View
Meta	Llama 3.2 1B	`llama-v3p2-1b`	$0.10	$0.10	View
Meta	Llama 3.2 3B	`llama-v3p2-3b`	$0.10	$0.10	View
Meta	llama-2-13b-chat	`llama-v2-13b-chat`	$0.20	$0.20	View
Meta	llama-2-7b-chat	`llama-v2-7b-chat`	$0.20	$0.20	View
Azure	Llama-3.2-90B-Vision-Instruct	`llama-v3p2-90b-vision-instruct`	$0.90	$0.90	View
Nvidia	Mistral Large 3 675B Instruct 2512	`mistral-large-3-fp8`	$1.20	$1.20	View
Mistral	Mixtral 8x22B	`mixtral-8x22b`	$1.20	$1.20	View
NousResearch	nous-hermes-2-mixtral-8x7b-dpo	`nous-hermes-2-mixtral-8x7b-dpo`	$0.50	$0.50	View
WandB	NVIDIA Nemotron 3 Super 120B	`nvidia-nemotron-3-super-120b-a12b-fp8`	$0.90	$0.90	View
NousResearch	openhermes-2.5-mistral-7b	`openhermes-2p5-mistral-7b`	$0.20	$0.20	View
Microsoft	phi-3-mini-128k-instruct	`phi-3-mini-128k-instruct`	$0.10	$0.10	View
Alibaba	qwen1.5-72b-chat	`qwen1p5-72b-chat`	$0.90	$0.90	View
qwen	Qwen2.5 VL 3B Instruct	`qwen2p5-vl-3b-instruct`	$0.20	$0.20	View
qwen	Qwen3 0.6B	`qwen3-0p6b`	$0.10	$0.10	View
qwen	Qwen3 1.7B	`qwen3-1p7b`	$0.10	$0.10	View
qwen	Qwen3 Embedding 0.6B	`qwen3-embedding-0p6b`	-	-	View
SiliconFlow	Qwen3 Omni 30B A3B	`qwen3-omni-30b-a3b-instruct`	$0.50	$0.50	View
qwen	Qwen3-Reranker-0.6B	`qwen3-reranker-0p6b`	-	-	View
qwen	Qwen3-Reranker-4B	`qwen3-reranker-4b`	-	-	View
qwen	Qwen3-Reranker-8B	`qwen3-reranker-8b`	-	-	View
SiliconFlow	Seed-OSS-36B-Instruct	`seed-oss-36b-instruct`	$0.90	$0.90	View

Back to Providers Visit Website