Together AI
togetherai
Updated 19 minutes ago
Together AI is an AI Acceleration Cloud platform that provides API access to over 200 open-source large language models including Meta's Llama family, Google's Gemma, Mistral, Qwen, and many more. The platform eliminates the need for infrastructure management while offering fine-tuning capabilities to customize models with your own data. Together AI delivers blazing fast inference at low cost, making professional-grade AI accessible to developers and enterprises who need scalable, cost-effective AI model deployment without the complexity of managing their own infrastructure.
Browse 119 LLM models available from Together AI. Compare prices and features.
Models (119)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
qwen | Qwen3.5-397B-A17B |
Qwen/Qwen3.5-397B-A17B
|
$0.60 | $3.60 | |||
|
|
Moonshot AI | Kimi K2.5 |
moonshotai/Kimi-K2.5
|
$0.50 | $2.80 |
|
||
|
|
Z.ai | GLM-4.7 |
zai-org/GLM-4.7
|
$0.45 | $2.00 |
|
||
|
|
qwen | Qwen3-235B-A22B-Thinking-2507 |
Qwen/Qwen3-235B-A22B-Thinking-2507
|
$0.65 | $3.00 | |||
|
|
Z.ai | GLM-4.6 |
zai-org/GLM-4.6
|
$0.60 | $2.20 | |||
|
|
DeepSeek | DeepSeek-R1-0528 |
deepseek-ai/DeepSeek-R1-0528
|
$0.00 | $0.00 | |||
|
|
Minimax | MiniMax M2.1 |
MiniMaxAI/MiniMax-M2.1
|
$0.00 | $0.00 |
|
||
|
|
OpenAI | GPT OSS 120B |
openai/gpt-oss-120b
|
$0.15 | $0.60 |
|
||
|
|
DeepSeek | DeepSeek-V3.2-Exp |
deepseek-ai/DeepSeek-V3.2-Exp
|
$0.00 | $0.00 | |||
|
|
Minimax | MiniMax M2 |
MiniMaxAI/MiniMax-M2
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
Qwen/Qwen3-Next-80B-A3B-Thinking
|
$0.15 | $1.50 | |||
|
|
Moonshot AI | Kimi K2 Instruct |
moonshotai/Kimi-K2-Instruct
|
$1.00 | $3.00 | |||
|
|
Moonshot AI | Kimi K2-Instruct-0905 |
moonshotai/Kimi-K2-Instruct-0905
|
$1.00 | $3.00 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-ai/DeepSeek-V3-1
|
$0.60 | $1.70 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-ai/DeepSeek-V3.1
|
$0.60 | $1.70 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
Qwen/Qwen3-Next-80B-A3B-Instruct
|
$0.15 | $1.50 | |||
|
|
OpenAI | GPT OSS 20B |
openai/gpt-oss-20b
|
$0.05 | $0.20 | |||
|
|
Minimax | MiniMax M1 80K |
MiniMaxAI/MiniMax-M1-80k
|
$0.00 | $0.00 | |||
|
|
Minimax | MiniMax M1 40K |
MiniMaxAI/MiniMax-M1-40k
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 VL 32B Instruct |
Qwen/Qwen3-VL-32B-Instruct
|
$0.50 | $1.50 | |||
|
|
DeepSeek | DeepSeek-V3 0324 |
deepseek-ai/DeepSeek-V3-0324
|
$0.00 | $0.00 | |||
|
|
Mistral | Magistral Small 2506 |
mistralai/Magistral-Small-2506
|
$0.00 | $0.00 | |||
|
|
Nvidia | Llama-3.3 Nemotron Super 49B v1 |
nim/nvidia/llama-3.3-nemotron-super-49b-v1
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 30B A3B |
Qwen/Qwen3-30B-A3B
|
$0.00 | $0.00 | |||
|
|
DeepSeek | DeepSeek R1 Distill Llama 70B |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
|
$2.00 | $2.00 | |||
|
|
qwen | QwQ-32B-Preview |
Qwen/QwQ-32B-Preview
|
$0.00 | $0.00 | |||
|
|
qwen | QwQ-32B |
Qwen/QwQ-32B
|
$0.00 | $0.00 | |||
|
|
Nvidia | Nemotron Nano 9B v2 |
nvidia/NVIDIA-Nemotron-Nano-9B-v2
|
$0.06 | $0.25 | |||
|
|
DeepSeek | DeepSeek-V3 |
deepseek-ai/DeepSeek-V3
|
$1.25 | $1.25 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 14B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.1 405B Instruct |
meta-llama/Llama-3.1-405B-Instruct
|
$3.50 | $3.50 | |||
|
|
Meta | Llama 3.3 70B Instruct |
meta-llama/Llama-3.3-70B-Instruct
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.3 70B Instruct |
nim/meta/llama-3.3-70b-instruct
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen2.5 32B Instruct |
Qwen/Qwen2.5-32B-Instruct
|
$0.00 | $0.00 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 7B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen2.5 72B Instruct |
Qwen/Qwen2.5-72B-Instruct
|
$1.20 | $1.20 | |||
|
|
qwen | Qwen2.5 14B Instruct |
Qwen/Qwen2.5-14B-Instruct
|
$0.80 | $0.80 | |||
|
|
Mistral | Mistral Small 3 24B Instruct |
mistralai/Mistral-Small-24B-Instruct-2501
|
$0.10 | $0.30 | |||
|
|
qwen | Qwen2 72B Instruct |
Qwen/Qwen2-72B-Instruct
|
$0.00 | $0.00 | |||
|
|
Gemma 3 27B |
google/gemma-3-27b-it
|
$0.00 | $0.00 | ||||
|
|
Meta | Llama 3.1 70B Instruct |
meta-llama/Meta-Llama-3.1-70B-Instruct
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.1 70B Instruct |
nim/meta/llama-3.1-70b-instruct
|
$0.00 | $0.00 | |||
|
|
Gemma 3 12B |
google/gemma-3-12b-it
|
$0.00 | $0.00 | ||||
|
|
qwen | Qwen2.5 7B Instruct |
Qwen/Qwen2.5-7B-Instruct
|
$0.00 | $0.00 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 1.5B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.2 3B Instruct |
meta-llama/Llama-3.2-3B-Instruct
|
$0.00 | $0.00 | |||
|
|
Gemma 3 4B |
google/gemma-3-4b-it
|
$0.00 | $0.00 | ||||
|
|
Meta | Llama 3.1 8B Instruct |
nim/meta/llama-3.1-8b-instruct
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.1 8B Instruct |
meta-llama/Meta-Llama-3.1-8B-Instruct
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.1 8B Instruct |
meta-llama/Llama-3.1-8B-Instruct
|
$0.00 | $0.00 | |||
|
|
Gemma 3n E4B Instructed |
google/gemma-3n-E4B-it
|
$0.02 | $0.04 | ||||
|
|
Gemma 3 1B |
google/gemma-3-1b-it
|
$0.00 | $0.00 | ||||
|
|
Moonshot AI | Kimi K2 Thinking |
moonshotai/Kimi-K2-Thinking
|
$1.20 | $4.00 | |||
|
|
DeepSeek | DeepSeek-R1 |
deepseek-ai/DeepSeek-R1
|
$3.00 | $7.00 | |||
|
|
Black Forest Labs | Flux 2 Flex |
black-forest-labs/FLUX.2-flex
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen Image |
Qwen/Qwen-Image
|
$0.00 | $0.00 | |||
|
|
Minimax | Hailuo 02 |
minimax/hailuo-02
|
$0.00 | $0.00 | |||
|
|
Azure | Llama 4 Maverick 17B 128E Instruct FP8 |
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
|
$0.27 | $0.85 | |||
|
|
Groq | Llama Guard 4 12B |
meta-llama/Llama-Guard-4-12B
|
$0.20 | $0.20 | |||
|
|
Meta | LlamaGuard 2 8B |
meta-llama/LlamaGuard-2-8b
|
$0.20 | $0.20 | |||
|
|
Meta | llama-4-scout-17b-16e-instruct |
meta-llama/Llama-4-Scout-17B-16E-Instruct
|
$0.18 | $0.59 | |||
|
|
Arcee AI | Trinity Mini (free) |
arcee-ai/trinity-mini
|
$0.05 | $0.15 | |||
|
|
Black Forest Labs | flux-1-kontext-max |
black-forest-labs/FLUX.1-kontext-max
|
$0.00 | $0.00 | |||
|
|
Mistral | Mistral 7B Instruct v0.3 |
mistralai/Mistral-7B-Instruct-v0.3
|
$0.20 | $0.20 | |||
|
|
Mistral | MiniStral 3 (14B Instruct 2512) |
mistralai/Ministral-3-14B-Instruct-2512
|
$0.00 | $0.00 | |||
|
|
Black Forest Labs | Flux 1.1 Pro |
black-forest-labs/FLUX.1.1-pro
|
$0.00 | $0.00 | |||
|
|
Black Forest Labs | flux-1-kontext-pro |
black-forest-labs/FLUX.1-kontext-pro
|
$0.00 | $0.00 | |||
|
|
OpenAI | Sora 2 |
openai/sora-2
|
$0.00 | $0.00 | |||
|
|
OpenAI | Sora 2 Pro |
openai/sora-2-pro
|
$0.00 | $0.00 | |||
|
|
Gemini 3 Pro Image |
google/gemini-3-pro-image
|
$0.00 | $0.00 | ||||
|
|
Black Forest Labs | Flux 2 Pro |
black-forest-labs/FLUX.2-pro
|
$0.00 | $0.00 | |||
|
|
Black Forest Labs | flux-2-dev |
black-forest-labs/FLUX.2-dev
|
$0.00 | $0.00 | |||
|
|
Mistral | mixtral-8x7b-instruct-v0.1 |
mistralai/Mixtral-8x7B-Instruct-v0.1
|
$0.60 | $0.60 | |||
|
|
Nvidia | Whisper Large v3 |
openai/whisper-large-v3
|
$0.27 | $0.85 | |||
|
|
qwen | Qwen3 VL 8B Instruct |
Qwen/Qwen3-VL-8B-Instruct
|
$0.18 | $0.68 | |||
|
|
Meta | llama-3.2-1b-instruct |
meta-llama/Llama-3.2-1B-Instruct
|
$0.06 | $0.06 | |||
|
|
Meta | llama-3-8b-instruct |
meta-llama/Meta-Llama-3-8B-Instruct
|
$0.20 | $0.20 | |||
|
|
Mistral | mistral-7b-instruct-v0.2 |
mistralai/Mistral-7B-Instruct-v0.2
|
$0.00 | $0.00 | |||
|
|
Black Forest Labs | FLUX.1-schnell |
black-forest-labs/FLUX.1-schnell
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen2.5-7B-Instruct-Turbo |
Qwen/Qwen2.5-7B-Instruct-Turbo
|
$0.30 | $0.30 | |||
|
|
Alibaba | wan2.6-image |
Wan-AI/Wan2.6-image
|
$0.00 | $0.00 | |||
|
|
Alibaba | Qwen3 14B |
Qwen/Qwen3-14B
|
$0.00 | $0.00 | |||
|
|
Mistral | mixtral-8x22b-instruct-v0.1 |
mistralai/Mixtral-8x22B-Instruct-v0.1
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3 8B (Base) |
meta-llama/Meta-Llama-3-8B
|
$0.00 | $0.00 | |||
|
|
Nvidia | Llama-3.1-Nemotron-70B-Instruct-HF |
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
|
$0.00 | $0.00 | |||
|
|
Nvidia | Llama 3.1 Nemotron 70B Instruct |
nim/nvidia/llama-3.1-nemotron-70b-instruct
|
$0.00 | $0.00 | |||
|
|
DeepSeek | deepseek-v3.1-terminus |
deepseek-ai/DeepSeek-V3.1-Terminus
|
$0.00 | $0.00 | |||
|
|
gemma-2b-it |
google/gemma-2b-it
|
$0.00 | $0.00 | ||||
|
|
Gemma 2 9B |
google/gemma-2-9b-it
|
$0.00 | $0.00 | ||||
|
|
Nvidia | Llama 3.2 11b Vision Instruct |
nim/meta/llama-3.2-11b-vision-instruct
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 0.6B |
Qwen/Qwen3-0.6B
|
$0.00 | $0.00 | |||
|
|
NousResearch | nous-hermes-2-mixtral-8x7b-dpo |
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
|
$0.00 | $0.00 | |||
|
|
Gemma 2 27B |
google/gemma-2-27b-it
|
$0.00 | $0.00 | ||||
|
|
Alibaba | Qwen3-Coder 30B-A3B Instruct |
Qwen/Qwen3-Coder-30B-A3B-Instruct
|
$0.00 | $0.00 | |||
|
|
Mistral | Mistral 7B Instruct v0.1 |
mistralai/Mistral-7B-Instruct-v0.1
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 1.7B |
Qwen/Qwen3-1.7B
|
$0.00 | $0.00 | |||
|
|
DeepSeek | DeepSeek V3.1 Base |
deepseek-ai/DeepSeek-V3.1-Base
|
$0.00 | $0.00 | |||
|
|
DeepSeek | DeepSeek V3 Base |
deepseek-ai/DeepSeek-V3-Base
|
$0.00 | $0.00 | |||
|
|
Mistral | mixtral-8x7b-instruct-v0.1 |
nim/mistralai/mixtral-8x7b-instruct-v01
|
$0.00 | $0.00 | |||
|
|
Meta | Llama 3.1 405B (base) |
meta-llama/Llama-3.1-405B
|
$0.00 | $0.00 | |||
|
|
Mistral | mixtral-8x22b-instruct-v0.1 |
nim/mistralai/mixtral-8x22b-instruct-v01
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen2.5-Coder 32B Instruct |
Qwen/Qwen2.5-Coder-32B-Instruct
|
$0.00 | $0.00 | |||
|
|
Azure | Llama-3.2-90B-Vision-Instruct |
nim/meta/llama-3.2-90b-vision-instruct
|
$0.00 | $0.00 | |||
|
|
Allen Institute for AI | molmo-7b-d-0924 |
allenai/Molmo-7B-D-0924
|
$0.00 | $0.00 | |||
|
|
Mistral | Devstral Small 2505 |
mistralai/Devstral-Small-2505
|
$0.00 | $0.00 | |||
|
|
Upstage | SOLAR-10.7B-Instruct-v1.0 |
upstage/SOLAR-10.7B-Instruct-v1.0
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 32B |
Qwen/Qwen3-32B
|
$0.00 | $0.00 | |||
|
|
Alibaba | Qwen3 8B |
Qwen/Qwen3-8B
|
$0.00 | $0.00 | |||
|
|
Z.ai | GLM-4.5V |
zai-org/GLM-4.5V
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen3 4B (free) |
Qwen/Qwen3-4B
|
$0.00 | $0.00 | |||
|
|
qwen | Qwen2.5-72B-Instruct-Turbo |
Qwen/Qwen2.5-72B-Instruct-Turbo
|
$0.00 | $0.00 | |||
|
|
Alibaba | qwen3-30b-a3b-instruct-2507 |
Qwen/Qwen3-30B-A3B-Instruct-2507
|
$0.00 | $0.00 | |||
|
|
Nvidia | nvidia-nemotron-3-nano-30b-a3b-bf16 |
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
|
$0.00 | $0.00 | |||
|
|
DeepSeek | deepseek-v3.2-thinking |
deepseek-ai/DeepSeek-V3.2
|
$0.00 | $0.00 |
|
||
|
|
Alibaba | qwen2.5-vl-72b-instruct |
Qwen/Qwen2.5-VL-72B-Instruct
|
$0.00 | $0.00 | |||
|
|
Z.ai | GLM-5 |
zai-org/GLM-5
|
$1.00 | $3.20 |
|
||
|
|
Minimax | MiniMax M2.5 |
MiniMaxAI/MiniMax-M2.5
|
$0.30 | $1.20 |
|
||
|
|
Arcee AI | Trinity Large Preview (free) |
arcee-ai/trinity-large-preview
|
$0.00 | $0.00 |
|
||
|
|
Liquid AI | LFM2-24B-A2B |
LiquidAI/LFM2-24B-A2B
|
$0.03 | $0.12 |