Chutes.ai
chutes
Updated 25 minutes ago
Chutes.ai is an AI inference platform that provides access to a wide variety of open-source large language models through an OpenAI-compatible API. The platform offers models from leading providers including Qwen, DeepSeek, Mistral, Google, NousResearch, Meta, and others. Chutes.ai features transparent pricing per 1M tokens, context length information, and model capabilities including JSON mode, tools/function calling, structured outputs, and reasoning. The platform supports both standard and confidential compute variants of models.
Browse 25 LLM models available from Chutes.ai. Compare prices and features.
Models (25)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
qwen | Qwen3-235B-A22B-Thinking-2507 |
Qwen/Qwen3-235B-A22B-Thinking-2507
|
$0.11 | $0.60 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
Qwen/Qwen3-Next-80B-A3B-Instruct
|
$0.10 | $0.80 | |||
|
|
OpenAI | GPT OSS 20B |
openai/gpt-oss-20b
|
$0.04 | $0.15 | |||
|
|
qwen | Qwen3 30B A3B |
Qwen/Qwen3-30B-A3B
|
$0.06 | $0.22 | |||
|
|
DeepSeek | DeepSeek R1 Distill Llama 70B |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
|
$0.03 | $0.11 | |||
|
|
DeepSeek | DeepSeek-V3 |
deepseek-ai/DeepSeek-V3
|
$0.30 | $1.20 | |||
|
|
qwen | Qwen2.5 72B Instruct |
Qwen/Qwen2.5-72B-Instruct
|
$0.30 | $1.20 | |||
|
|
Mistral | Mistral Small 3.2 24B Instruct |
chutesai/Mistral-Small-3.2-24B-Instruct-2506
|
$0.06 | $0.18 | |||
|
|
Mistral | Mistral Small 3.1 24B Instruct |
chutesai/Mistral-Small-3.1-24B-Instruct-2503
|
$0.03 | $0.11 | |||
|
|
Mistral | Mistral Small 3 24B Instruct |
unsloth/Mistral-Small-24B-Instruct-2501
|
$0.07 | $0.30 | |||
|
|
Gemma 3 27B |
unsloth/gemma-3-27b-it
|
$0.04 | $0.15 | ||||
|
|
Gemma 3 12B |
unsloth/gemma-3-12b-it
|
$0.03 | $0.10 | ||||
|
|
Meta | Llama 3.2 3B Instruct |
unsloth/Llama-3.2-3B-Instruct
|
$0.01 | $0.01 | |||
|
|
Gemma 3 4B |
unsloth/gemma-3-4b-it
|
$0.01 | $0.03 | ||||
|
|
qwen | Qwen3 32B |
Qwen/Qwen3-32B
|
$0.08 | $0.24 | |||
|
|
Mistral | Mistral NeMo Instruct |
unsloth/Mistral-Nemo-Instruct-2407
|
$0.02 | $0.04 | |||
|
|
Alibaba | Qwen3 14B |
Qwen/Qwen3-14B
|
$0.05 | $0.22 | |||
|
|
qwen | Qwen2.5-Coder 32B Instruct |
Qwen/Qwen2.5-Coder-32B-Instruct
|
$0.03 | $0.11 | |||
|
|
qwen | Qwen3 VL 235B A22B Instruct |
Qwen/Qwen3-VL-235B-A22B-Instruct
|
$0.30 | $1.20 | |||
|
|
Z.ai | glm-4.6v |
zai-org/GLM-4.6V
|
$0.30 | $0.90 | |||
|
|
Alibaba | qwen2.5-vl-32b-instruct |
Qwen/Qwen2.5-VL-32B-Instruct
|
$0.05 | $0.22 | |||
|
|
Nvidia | nvidia-nemotron-3-nano-30b-a3b-bf16 |
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
|
$0.06 | $0.24 | |||
|
|
Z.ai | GLM-4.7 |
zai-org/GLM-4.7-FP8
|
$0.30 | $1.20 | |||
|
|
Meta | llama-3.2-1b-instruct |
unsloth/Llama-3.2-1B-Instruct
|
$0.01 | $0.01 | |||
|
|
NousResearch | DeepHermes 3 Mistral 24B Preview |
NousResearch/DeepHermes-3-Mistral-24B-Preview
|
$0.02 | $0.10 |