# Together AI

Together AI is an AI Acceleration Cloud platform that provides API access to over 200 open-source large language models including Meta's Llama family, Google's Gemma, Mistral, Qwen, and many more. The platform eliminates the need for infrastructure management while offering fine-tuning capabilities to customize models with your own data. Together AI delivers blazing fast inference at low cost, making professional-grade AI accessible to developers and enterprises who need scalable, cost-effective AI model deployment without the complexity of managing their own infrastructure.

## Provider Information

- **Website**: <https://www.together.ai/>
- **Available Models**: 119

## Models

| Name | Original Name | $ Input Price (per 1M) | $ Output Price (per 1M) | Free | Link |
|------|---------------|---------------------|----------------------|------|------|
| Kimi K2 Instruct | moonshotai/Kimi-K2-Instruct | 1.00 | 3.00 |  |  |
| Kimi K2 Thinking | moonshotai/Kimi-K2-Thinking | 1.20 | 4.00 |  |  |
| OpenAI GPT-OSS 120B | openai/gpt-oss-120b | 0.15 | 0.60 |  |  |
| Glm 4.6 Fp8 | zai-org/GLM-4.6 | 0.60 | 2.20 |  |  |
| DeepSeek R1-0528 | deepseek-ai/DeepSeek-R1 | 3.00 | 7.00 |  |  |
| DeepSeek V3 | deepseek-ai/DeepSeek-V3 | 1.25 | 1.25 |  |  |
| DeepSeek V3.1 | deepseek-ai/DeepSeek-V3-1 | 0.60 | 1.70 |  |  |
| FLUX.2 [flex] | black-forest-labs/FLUX.2-flex | 0.00 | 0.00 |  |  |
| Qwen Image | Qwen/Qwen-Image | 0.00 | 0.00 |  |  |
| MiniMax Hailuo 02 | minimax/hailuo-02 | 0.00 | 0.00 |  |  |
| Llama 4 Maverick Instruct (17Bx128E) | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 0.27 | 0.85 |  |  |
| Llama Guard 4 12B | meta-llama/Llama-Guard-4-12B | 0.20 | 0.20 |  |  |
| Meta Llama Guard 2 8B | meta-llama/LlamaGuard-2-8b | 0.20 | 0.20 |  |  |
| Llama 4 Scout Instruct (17Bx16E) | meta-llama/Llama-4-Scout-17B-16E-Instruct | 0.18 | 0.59 |  |  |
| Trinity Mini | arcee-ai/trinity-mini | 0.05 | 0.15 |  |  |
| FLUX.1 Kontext [max] | black-forest-labs/FLUX.1-kontext-max | 0.00 | 0.00 |  |  |
| Mistral (7B) Instruct v0.3 | mistralai/Mistral-7B-Instruct-v0.3 | 0.20 | 0.20 |  |  |
| Ministral 3 14B Instruct 2512 | mistralai/Ministral-3-14B-Instruct-2512 | 0.00 | 0.00 |  |  |
| Qwen3 Next 80B A3b Thinking | Qwen/Qwen3-Next-80B-A3B-Thinking | 0.15 | 1.50 |  |  |
| Gemma 3N E4B Instruct | google/gemma-3n-E4B-it | 0.02 | 0.04 |  |  |
| FLUX1.1 [pro] | black-forest-labs/FLUX.1.1-pro | 0.00 | 0.00 |  |  |
| GLM 4.7 Fp8 | zai-org/GLM-4.7 | 0.45 | 2.00 |  |  |
| FLUX.1 Kontext [pro] | black-forest-labs/FLUX.1-kontext-pro | 0.00 | 0.00 |  |  |
| Sora 2 | openai/sora-2 | 0.00 | 0.00 |  |  |
| Sora 2 Pro | openai/sora-2-pro | 0.00 | 0.00 |  |  |
| Mistral Small (24B) Instruct 25.01 | mistralai/Mistral-Small-24B-Instruct-2501 | 0.10 | 0.30 |  |  |
| Gemini 3 (Nano Banana 2 Pro) | google/gemini-3-pro-image | 0.00 | 0.00 |  |  |
| FLUX.2 [pro] | black-forest-labs/FLUX.2-pro | 0.00 | 0.00 |  |  |
| FLUX.2 [dev] | black-forest-labs/FLUX.2-dev | 0.00 | 0.00 |  |  |
| Qwen3 Next 80B A3b Instruct | Qwen/Qwen3-Next-80B-A3B-Instruct | 0.15 | 1.50 |  |  |
| Qwen3-VL-32B-Instruct | Qwen/Qwen3-VL-32B-Instruct | 0.50 | 1.50 |  |  |
| Nvidia Nemotron Nano 9B V2 | nvidia/NVIDIA-Nemotron-Nano-9B-v2 | 0.06 | 0.25 |  |  |
| Kimi K2-Instruct 0905 | moonshotai/Kimi-K2-Instruct-0905 | 1.00 | 3.00 |  |  |
| Mixtral-8x7B Instruct v0.1 | mistralai/Mixtral-8x7B-Instruct-v0.1 | 0.60 | 0.60 |  |  |
| Whisper large-v3 | openai/whisper-large-v3 | 0.27 | 0.85 |  |  |
| Qwen3-VL-8B-Instruct | Qwen/Qwen3-VL-8B-Instruct | 0.18 | 0.68 |  |  |
| Meta Llama 3.1 405B Instruct | meta-llama/Llama-3.1-405B-Instruct | 3.50 | 3.50 |  |  |
| Qwen3 235B A22B Thinking 2507 FP8 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 0.65 | 3.00 |  |  |
| Meta Llama 3.2 1B Instruct | meta-llama/Llama-3.2-1B-Instruct | 0.06 | 0.06 |  |  |
| Qwen 2.5 14B Instruct | Qwen/Qwen2.5-14B-Instruct | 0.80 | 0.80 |  |  |
| Meta Llama 3 8B Instruct | meta-llama/Meta-Llama-3-8B-Instruct | 0.20 | 0.20 |  |  |
| Mistral (7B) Instruct v0.2 | mistralai/Mistral-7B-Instruct-v0.2 | 0.00 | 0.00 |  |  |
| OpenAI GPT-OSS 20B | openai/gpt-oss-20b | 0.05 | 0.20 |  |  |
| Qwen2.5 72B Instruct | Qwen/Qwen2.5-72B-Instruct | 1.20 | 1.20 |  |  |
| DeepSeek R1 Distill Llama 70B | deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 2.00 | 2.00 |  |  |
| Deepseek V3.1 | deepseek-ai/DeepSeek-V3.1 | 0.60 | 1.70 |  |  |
| FLUX.1 Schnell | black-forest-labs/FLUX.1-schnell | 0.00 | 0.00 |  |  |
| Qwen2.5 7B Instruct Turbo | Qwen/Qwen2.5-7B-Instruct-Turbo | 0.30 | 0.30 |  |  |
| Kimi K2.5 | moonshotai/Kimi-K2.5 | 0.50 | 2.80 |  |  |
| Wan 2.6 Image | Wan-AI/Wan2.6-image | 0.00 | 0.00 |  |  |
| nim/nvidia/llama-3.3-nemotron-super-49b-v1 | nim/nvidia/llama-3.3-nemotron-super-49b-v1 | 0.00 | 0.00 |  |  |
| Qwen3 14B | Qwen/Qwen3-14B | 0.00 | 0.00 |  |  |
| nim/meta/llama-3.1-8b-instruct | nim/meta/llama-3.1-8b-instruct | 0.00 | 0.00 |  |  |
| Qwen2.5 7B Instruct | Qwen/Qwen2.5-7B-Instruct | 0.00 | 0.00 |  |  |
| Mixtral 8X22b Instruct V0.1 | mistralai/Mixtral-8x22B-Instruct-v0.1 | 0.00 | 0.00 |  |  |
| Meta Llama 3 8B | meta-llama/Meta-Llama-3-8B | 0.00 | 0.00 |  |  |
| Magistral Small 2506 | mistralai/Magistral-Small-2506 | 0.00 | 0.00 |  |  |
| Gemma 3 4b it | google/gemma-3-4b-it | 0.00 | 0.00 |  |  |
| Meta Llama 3.1 8B Instruct Reference | meta-llama/Meta-Llama-3.1-8B-Instruct | 0.00 | 0.00 |  |  |
| DeepSeek R1 Distill Qwen 1.5B | deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 0.00 | 0.00 |  |  |
| Llama 3.1 Nemotron 70B Instruct HF | nvidia/Llama-3.1-Nemotron-70B-Instruct-HF | 0.00 | 0.00 |  |  |
| Qwen2 72B Instruct | Qwen/Qwen2-72B-Instruct | 0.00 | 0.00 |  |  |
| nim/nvidia/llama-3.1-nemotron-70b-instruct | nim/nvidia/llama-3.1-nemotron-70b-instruct | 0.00 | 0.00 |  |  |
| Deepseek V3.1 Terminus | deepseek-ai/DeepSeek-V3.1-Terminus | 0.00 | 0.00 |  |  |
| Gemma 2B It | google/gemma-2b-it | 0.00 | 0.00 |  |  |
| Gemma 2 9B It | google/gemma-2-9b-it | 0.00 | 0.00 |  |  |
| nim/meta/llama-3.2-11b-vision-instruct | nim/meta/llama-3.2-11b-vision-instruct | 0.00 | 0.00 |  |  |
| Qwen3 0.6B | Qwen/Qwen3-0.6B | 0.00 | 0.00 |  |  |
| Meta Llama 3.2 3B Instruct | meta-llama/Llama-3.2-3B-Instruct | 0.00 | 0.00 |  |  |
| Deepseek R1 0528 | deepseek-ai/DeepSeek-R1-0528 | 0.00 | 0.00 |  |  |
| meta-llama/Llama-3.3-70B-Instruct | meta-llama/Llama-3.3-70B-Instruct | 0.00 | 0.00 |  |  |
| nim/meta/llama-3.3-70b-instruct | nim/meta/llama-3.3-70b-instruct | 0.00 | 0.00 |  |  |
| Nous Hermes 2 Mixtral 8X7B Dpo | NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO | 0.00 | 0.00 |  |  |
| Gemma-2 Instruct (27B) | google/gemma-2-27b-it | 0.00 | 0.00 |  |  |
| Gemma 3 1b it | google/gemma-3-1b-it | 0.00 | 0.00 |  |  |
| Meta Llama 3.1 70B Instruct Turbo | meta-llama/Meta-Llama-3.1-70B-Instruct | 0.00 | 0.00 |  |  |
| Qwen3 Coder 30B A3b Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 0.00 | 0.00 |  |  |
| DeepSeek R1 Distill Qwen 7B | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 0.00 | 0.00 |  |  |
| Mistral (7B) Instruct v0.1 | mistralai/Mistral-7B-Instruct-v0.1 | 0.00 | 0.00 |  |  |
| Qwen3 1.7B | Qwen/Qwen3-1.7B | 0.00 | 0.00 |  |  |
| Deepseek V3.1 Base | deepseek-ai/DeepSeek-V3.1-Base | 0.00 | 0.00 |  |  |
| Deepseek V3 Base | deepseek-ai/DeepSeek-V3-Base | 0.00 | 0.00 |  |  |
| Qwen QwQ-32B-Preview | Qwen/QwQ-32B-Preview | 0.00 | 0.00 |  |  |
| nim/meta/llama-3.1-70b-instruct | nim/meta/llama-3.1-70b-instruct | 0.00 | 0.00 |  |  |
| nim/mistralai/mixtral-8x7b-instruct-v01 | nim/mistralai/mixtral-8x7b-instruct-v01 | 0.00 | 0.00 |  |  |
| Llama 3.1 405B | meta-llama/Llama-3.1-405B | 0.00 | 0.00 |  |  |
| Minimax M1 40K | MiniMaxAI/MiniMax-M1-40k | 0.00 | 0.00 |  |  |
| Qwen3 30B A3b | Qwen/Qwen3-30B-A3B | 0.00 | 0.00 |  |  |
| nim/mistralai/mixtral-8x22b-instruct-v01 | nim/mistralai/mixtral-8x22b-instruct-v01 | 0.00 | 0.00 |  |  |
| MiniMax M2 | MiniMaxAI/MiniMax-M2 | 0.00 | 0.00 |  |  |
| DeepSeek R1 Distill Qwen 14B | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 0.00 | 0.00 |  |  |
| Qwen 2.5 Coder 32B Instruct | Qwen/Qwen2.5-Coder-32B-Instruct | 0.00 | 0.00 |  |  |
| Minimax M1 80K | MiniMaxAI/MiniMax-M1-80k | 0.00 | 0.00 |  |  |
| nim/meta/llama-3.2-90b-vision-instruct | nim/meta/llama-3.2-90b-vision-instruct | 0.00 | 0.00 |  |  |
| Molmo 7B D 0924 | allenai/Molmo-7B-D-0924 | 0.00 | 0.00 |  |  |
| Devstral Small 2505 | mistralai/Devstral-Small-2505 | 0.00 | 0.00 |  |  |
| Upstage SOLAR Instruct v1 (11B) | upstage/SOLAR-10.7B-Instruct-v1.0 | 0.00 | 0.00 |  |  |
| Qwen2.5 32B Instruct | Qwen/Qwen2.5-32B-Instruct | 0.00 | 0.00 |  |  |
| Qwen3 32B | Qwen/Qwen3-32B | 0.00 | 0.00 |  |  |
| Qwen3 8B | Qwen/Qwen3-8B | 0.00 | 0.00 |  |  |
| Glm 4.5V | zai-org/GLM-4.5V | 0.00 | 0.00 |  |  |
| Qwen QwQ-32B | Qwen/QwQ-32B | 0.00 | 0.00 |  |  |
|  | Qwen/Qwen3-4B | 0.00 | 0.00 |  |  |
| Gemma 3 27B It | google/gemma-3-27b-it | 0.00 | 0.00 |  |  |
| Deepseek V3 0324 | deepseek-ai/DeepSeek-V3-0324 | 0.00 | 0.00 |  |  |
| Qwen2.5 72B Instruct Turbo | Qwen/Qwen2.5-72B-Instruct-Turbo | 0.00 | 0.00 |  |  |
| Deepseek V3.2 Exp | deepseek-ai/DeepSeek-V3.2-Exp | 0.00 | 0.00 |  |  |
| Gemma 3 12B It | google/gemma-3-12b-it | 0.00 | 0.00 |  |  |
| Qwen3 30B A3b Instruct 2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | 0.00 | 0.00 |  |  |
| Nvidia Nemotron 3 Nano 30B A3b Bf16 | nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 | 0.00 | 0.00 |  |  |
| Deepseek V3.2 | deepseek-ai/DeepSeek-V3.2 | 0.00 | 0.00 |  |  |
| Minimax M2.1 | MiniMaxAI/MiniMax-M2.1 | 0.00 | 0.00 |  |  |
| Qwen2.5-VL (72B) Instruct | Qwen/Qwen2.5-VL-72B-Instruct | 0.00 | 0.00 |  |  |
| Llama 3.1 8B Instruct | meta-llama/Llama-3.1-8B-Instruct | 0.00 | 0.00 |  |  |
| GLM-5-FP4 | zai-org/GLM-5 | 1.00 | 3.20 |  |  |
| MiniMax M2.5 FP4 | MiniMaxAI/MiniMax-M2.5 | 0.30 | 1.20 |  |  |
| Qwen3.5 397B A17b | Qwen/Qwen3.5-397B-A17B | 0.60 | 3.60 |  |  |
| Trinity Large Preview | arcee-ai/trinity-large-preview | 0.00 | 0.00 |  |  |
| LFM2-24B-A2B | LiquidAI/LFM2-24B-A2B | 0.03 | 0.12 |  |  |

---

[← Back to all providers](/llm.txt)