# Kilo Code

Kilo Code provides access to a wide range of AI models through their unified API, including free models from providers like MiniMax, Z.AI (GLM), MoonshotAI, and more. The platform offers models optimized for coding, reasoning, and agentic workflows with features like image support, prompt caching, and tools integration.

## Provider Information

- **Website**: <https://kilo.ai/>
- **Available Models**: 317

## Models

| Name | Original Name | $ Input Price (per 1M) | $ Output Price (per 1M) | Free | Link |
|------|---------------|---------------------|----------------------|------|------|
| Arcee AI: Trinity Large Preview (free) | arcee-ai/trinity-large-preview:free | 0.00 | 0.00 | Yes |  |
| Anthropic: Claude Opus 4.5 | anthropic/claude-opus-4.5 | 5.00 | 25.00 |  |  |
| Anthropic: Claude Sonnet 4.5 | anthropic/claude-sonnet-4.5 | 3.00 | 15.00 |  |  |
| Anthropic: Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 1.00 | 5.00 |  |  |
| OpenAI: GPT-5.2 | openai/gpt-5.2 | 1.75 | 14.00 |  |  |
| OpenAI: GPT-5.2-Codex | openai/gpt-5.2-codex | 1.75 | 14.00 |  |  |
| Google: Gemini 3 Pro Preview | google/gemini-3-pro-preview | 2.00 | 12.00 |  |  |
| Google: Gemini 3 Flash Preview | google/gemini-3-flash-preview | 0.50 | 3.00 |  |  |
| xAI: Grok Code Fast 1 | x-ai/grok-code-fast-1 | 0.20 | 1.50 |  |  |
| MoonshotAI: Kimi K2.5 | moonshotai/kimi-k2.5 | 0.23 | 3.00 |  |  |
| MiniMax: MiniMax M2-her | minimax/minimax-m2-her | 0.30 | 1.20 |  | [View](https://kilo.ai/models/minimax/minimax-m2-her) |
| OpenAI: GPT Audio | openai/gpt-audio | 2.50 | 10.00 |  | [View](https://kilo.ai/models/openai/gpt-audio) |
| OpenAI: GPT Audio Mini | openai/gpt-audio-mini | 0.60 | 2.40 |  | [View](https://kilo.ai/models/openai/gpt-audio-mini) |
| Z.ai: GLM 4.7 Flash | z-ai/glm-4.7-flash | 0.06 | 0.40 |  |  |
| AllenAI: Olmo 3.1 32B Instruct | allenai/olmo-3.1-32b-instruct | 0.20 | 0.60 |  | [View](https://kilo.ai/models/allenai/olmo-3.1-32b-instruct) |
| ByteDance Seed: Seed 1.6 Flash | bytedance-seed/seed-1.6-flash | 0.08 | 0.30 |  | [View](https://kilo.ai/models/bytedance-seed/seed-1.6-flash) |
| ByteDance Seed: Seed 1.6 | bytedance-seed/seed-1.6 | 0.25 | 2.00 |  |  |
| MiniMax: MiniMax M2.1 | minimax/minimax-m2.1 | 0.27 | 0.95 |  |  |
| Z.ai: GLM 4.7 | z-ai/glm-4.7 | 0.40 | 1.50 |  |  |
| Mistral: Mistral Small Creative | mistralai/mistral-small-creative | 0.10 | 0.30 |  | [View](https://kilo.ai/models/mistralai/mistral-small-creative) |
| AllenAI: Olmo 3.1 32B Think | allenai/olmo-3.1-32b-think | 0.15 | 0.50 |  | [View](https://kilo.ai/models/allenai/olmo-3.1-32b-think) |
| Xiaomi: MiMo-V2-Flash | xiaomi/mimo-v2-flash | 0.09 | 0.29 |  |  |
| NVIDIA: Nemotron 3 Nano 30B A3B | nvidia/nemotron-3-nano-30b-a3b | 0.05 | 0.20 |  | [View](https://kilo.ai/models/nvidia/nemotron-3-nano-30b-a3b) |
| OpenAI: GPT-5.2 Chat | openai/gpt-5.2-chat | 1.75 | 14.00 |  |  |
| OpenAI: GPT-5.2 Pro | openai/gpt-5.2-pro | 21.00 | 168.00 |  |  |
| Mistral: Devstral 2 2512 | mistralai/devstral-2512 | 0.05 | 0.22 |  |  |
| Relace: Relace Search | relace/relace-search | 1.00 | 3.00 |  | [View](https://kilo.ai/models/relace/relace-search) |
| Z.ai: GLM 4.6V | z-ai/glm-4.6v | 0.30 | 0.90 |  |  |
| Nex AGI: DeepSeek V3.1 Nex N1 | nex-agi/deepseek-v3.1-nex-n1 | 0.27 | 1.00 |  |  |
| OpenAI: GPT-5.1-Codex-Max | openai/gpt-5.1-codex-max | 1.25 | 10.00 |  |  |
| Amazon: Nova 2 Lite | amazon/nova-2-lite-v1 | 0.30 | 2.50 |  |  |
| Mistral: Ministral 3 14B 2512 | mistralai/ministral-14b-2512 | 0.20 | 0.20 |  |  |
| Mistral: Ministral 3 8B 2512 | mistralai/ministral-8b-2512 | 0.15 | 0.15 |  | [View](https://kilo.ai/models/mistralai/ministral-8b-2512) |
| Mistral: Ministral 3 3B 2512 | mistralai/ministral-3b-2512 | 0.10 | 0.10 |  | [View](https://kilo.ai/models/mistralai/ministral-3b-2512) |
| Mistral: Mistral Large 3 2512 | mistralai/mistral-large-2512 | 0.50 | 1.50 |  |  |
| Arcee AI: Trinity Mini | arcee-ai/trinity-mini | 0.05 | 0.15 |  |  |
| DeepSeek: DeepSeek V3.2 Speciale | deepseek/deepseek-v3.2-speciale | 0.27 | 0.41 |  |  |
| DeepSeek: DeepSeek V3.2 | deepseek/deepseek-v3.2 | 0.25 | 0.38 |  |  |
| TNG: R1T Chimera | tngtech/tng-r1t-chimera | 0.25 | 0.85 |  |  |
| AllenAI: Olmo 3 32B Think | allenai/olmo-3-32b-think | 0.15 | 0.50 |  | [View](https://kilo.ai/models/allenai/olmo-3-32b-think) |
| AllenAI: Olmo 3 7B Instruct | allenai/olmo-3-7b-instruct | 0.10 | 0.20 |  | [View](https://kilo.ai/models/allenai/olmo-3-7b-instruct) |
| AllenAI: Olmo 3 7B Think | allenai/olmo-3-7b-think | 0.12 | 0.20 |  | [View](https://kilo.ai/models/allenai/olmo-3-7b-think) |
| Google: Nano Banana Pro (Gemini 3 Pro Image Preview) | google/gemini-3-pro-image-preview | 2.00 | 12.00 |  | [View](https://kilo.ai/models/google/gemini-3-pro-image-preview) |
| xAI: Grok 4.1 Fast | x-ai/grok-4.1-fast | 0.20 | 0.50 |  |  |
| OpenAI: GPT-5.1 | openai/gpt-5.1 | 1.25 | 10.00 |  |  |
| OpenAI: GPT-5.1 Chat | openai/gpt-5.1-chat | 1.25 | 10.00 |  |  |
| OpenAI: GPT-5.1-Codex | openai/gpt-5.1-codex | 1.25 | 10.00 |  |  |
| OpenAI: GPT-5.1-Codex-Mini | openai/gpt-5.1-codex-mini | 0.25 | 2.00 |  |  |
| MoonshotAI: Kimi K2 Thinking | moonshotai/kimi-k2-thinking | 0.40 | 1.75 |  |  |
| Amazon: Nova Premier 1.0 | amazon/nova-premier-v1 | 2.50 | 12.50 |  | [View](https://kilo.ai/models/amazon/nova-premier-v1) |
| Perplexity: Sonar Pro Search | perplexity/sonar-pro-search | 3.00 | 15.00 |  | [View](https://kilo.ai/models/perplexity/sonar-pro-search) |
| Mistral: Voxtral Small 24B 2507 | mistralai/voxtral-small-24b-2507 | 0.10 | 0.30 |  |  |
| OpenAI: gpt-oss-safeguard-20b | openai/gpt-oss-safeguard-20b | 0.08 | 0.30 |  |  |
| NVIDIA: Nemotron Nano 12B 2 VL | nvidia/nemotron-nano-12b-v2-vl | 0.20 | 0.60 |  | [View](https://kilo.ai/models/nvidia/nemotron-nano-12b-v2-vl) |
| MiniMax: MiniMax M2 | minimax/minimax-m2 | 0.26 | 1.00 |  |  |
| Qwen: Qwen3 VL 32B Instruct | qwen/qwen3-vl-32b-instruct | 0.10 | 0.42 |  |  |
| LiquidAI: LFM2-8B-A1B | liquid/lfm2-8b-a1b | 0.01 | 0.02 |  | [View](https://kilo.ai/models/liquid/lfm2-8b-a1b) |
| LiquidAI: LFM2-2.6B | liquid/lfm-2.2-6b | 0.01 | 0.02 |  | [View](https://kilo.ai/models/liquid/lfm-2.2-6b) |
| OpenAI: GPT-5 Image Mini | openai/gpt-5-image-mini | 2.50 | 2.00 |  | [View](https://kilo.ai/models/openai/gpt-5-image-mini) |
| Qwen: Qwen3 VL 8B Thinking | qwen/qwen3-vl-8b-thinking | 0.12 | 1.37 |  |  |
| Qwen: Qwen3 VL 8B Instruct | qwen/qwen3-vl-8b-instruct | 0.08 | 0.50 |  |  |
| OpenAI: GPT-5 Image | openai/gpt-5-image | 10.00 | 10.00 |  | [View](https://kilo.ai/models/openai/gpt-5-image) |
| OpenAI: o3 Deep Research | openai/o3-deep-research | 10.00 | 40.00 |  |  |
| OpenAI: o4 Mini Deep Research | openai/o4-mini-deep-research | 2.00 | 8.00 |  |  |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | nvidia/llama-3.3-nemotron-super-49b-v1.5 | 0.10 | 0.40 |  |  |
| Baidu: ERNIE 4.5 21B A3B Thinking | baidu/ernie-4.5-21b-a3b-thinking | 0.07 | 0.28 |  |  |
| Google: Nano Banana (Gemini 2.5 Flash Image) | google/gemini-2.5-flash-image | 0.30 | 2.50 |  | [View](https://kilo.ai/models/google/gemini-2.5-flash-image) |
| Qwen: Qwen3 VL 30B A3B Thinking | qwen/qwen3-vl-30b-a3b-thinking | 0.00 | 0.00 | Yes |  |
| Qwen: Qwen3 VL 30B A3B Instruct | qwen/qwen3-vl-30b-a3b-instruct | 0.13 | 0.52 |  |  |
| OpenAI: GPT-5 Pro | openai/gpt-5-pro | 15.00 | 120.00 |  |  |
| Z.ai: GLM 4.6 | z-ai/glm-4.6 | 0.35 | 1.71 |  | [View](https://kilo.ai/models/z-ai/glm-4.6) |
| Z.ai: GLM 4.6 (exacto) | z-ai/glm-4.6:exacto | 0.44 | 1.76 |  |  |
| DeepSeek: DeepSeek V3.2 Exp | deepseek/deepseek-v3.2-exp | 0.27 | 0.41 |  |  |
| Relace: Relace Apply 3 | relace/relace-apply-3 | 0.85 | 1.25 |  | [View](https://kilo.ai/models/relace/relace-apply-3) |
| Google: Gemini 2.5 Flash Preview 09-2025 | google/gemini-2.5-flash-preview-09-2025 | 0.30 | 2.50 |  |  |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | google/gemini-2.5-flash-lite-preview-09-2025 | 0.10 | 0.40 |  |  |
| Qwen: Qwen3 VL 235B A22B Thinking | qwen/qwen3-vl-235b-a22b-thinking | 0.00 | 0.00 | Yes |  |
| Qwen: Qwen3 VL 235B A22B Instruct | qwen/qwen3-vl-235b-a22b-instruct | 0.20 | 0.88 |  |  |
| Qwen: Qwen3 Max | qwen/qwen3-max | 1.20 | 6.00 |  |  |
| Qwen: Qwen3 Coder Plus | qwen/qwen3-coder-plus | 1.00 | 5.00 |  |  |
| OpenAI: GPT-5 Codex | openai/gpt-5-codex | 1.25 | 10.00 |  |  |
| DeepSeek: DeepSeek V3.1 Terminus (exacto) | deepseek/deepseek-v3.1-terminus:exacto | 0.21 | 0.79 |  |  |
| DeepSeek: DeepSeek V3.1 Terminus | deepseek/deepseek-v3.1-terminus | 0.21 | 0.79 |  | [View](https://kilo.ai/models/deepseek/deepseek-v3.1-terminus) |
| xAI: Grok 4 Fast | x-ai/grok-4-fast | 0.20 | 0.50 |  |  |
| Tongyi DeepResearch 30B A3B | alibaba/tongyi-deepresearch-30b-a3b | 0.09 | 0.45 |  | [View](https://kilo.ai/models/alibaba/tongyi-deepresearch-30b-a3b) |
| Qwen: Qwen3 Coder Flash | qwen/qwen3-coder-flash | 0.30 | 1.50 |  |  |
| Qwen: Qwen3 Next 80B A3B Thinking | qwen/qwen3-next-80b-a3b-thinking | 0.15 | 1.20 |  |  |
| Qwen: Qwen3 Next 80B A3B Instruct | qwen/qwen3-next-80b-a3b-instruct | 0.09 | 1.10 |  | [View](https://kilo.ai/models/qwen/qwen3-next-80b-a3b-instruct) |
| Qwen: Qwen Plus 0728 | qwen/qwen-plus-2025-07-28 | 0.40 | 1.20 |  | [View](https://kilo.ai/models/qwen/qwen-plus-2025-07-28) |
| Qwen: Qwen Plus 0728 (thinking) | qwen/qwen-plus-2025-07-28:thinking | 0.40 | 1.20 |  | [View](https://kilo.ai/models/qwen/qwen-plus-2025-07-28) |
| NVIDIA: Nemotron Nano 9B V2 | nvidia/nemotron-nano-9b-v2 | 0.04 | 0.16 |  | [View](https://kilo.ai/models/nvidia/nemotron-nano-9b-v2) |
| MoonshotAI: Kimi K2 0905 | moonshotai/kimi-k2-0905 | 0.40 | 2.00 |  |  |
| MoonshotAI: Kimi K2 0905 (exacto) | moonshotai/kimi-k2-0905:exacto | 0.60 | 2.50 |  | [View](https://kilo.ai/models/moonshotai/kimi-k2-0905) |
| Qwen: Qwen3 30B A3B Thinking 2507 | qwen/qwen3-30b-a3b-thinking-2507 | 0.05 | 0.34 |  |  |
| Nous: Hermes 4 70B | nousresearch/hermes-4-70b | 0.11 | 0.38 |  |  |
| Nous: Hermes 4 405B | nousresearch/hermes-4-405b | 1.00 | 3.00 |  |  |
| DeepSeek: DeepSeek V3.1 | deepseek/deepseek-chat-v3.1 | 0.15 | 0.75 |  |  |
| OpenAI: GPT-4o Audio | openai/gpt-4o-audio-preview | 2.50 | 10.00 |  | [View](https://kilo.ai/models/openai/gpt-4o-audio-preview) |
| Mistral: Mistral Medium 3.1 | mistralai/mistral-medium-3.1 | 0.40 | 2.00 |  |  |
| Baidu: ERNIE 4.5 21B A3B | baidu/ernie-4.5-21b-a3b | 0.07 | 0.28 |  |  |
| Baidu: ERNIE 4.5 VL 28B A3B | baidu/ernie-4.5-vl-28b-a3b | 0.14 | 0.56 |  |  |
| Z.ai: GLM 4.5V | z-ai/glm-4.5v | 0.60 | 1.80 |  |  |
| AI21: Jamba Large 1.7 | ai21/jamba-large-1.7 | 2.00 | 8.00 |  | [View](https://kilo.ai/models/ai21/jamba-large-1.7) |
| OpenAI: GPT-5 Chat | openai/gpt-5-chat | 1.25 | 10.00 |  |  |
| OpenAI: GPT-5 | openai/gpt-5 | 1.25 | 10.00 |  |  |
| OpenAI: GPT-5 Mini | openai/gpt-5-mini | 0.25 | 2.00 |  |  |
| OpenAI: GPT-5 Nano | openai/gpt-5-nano | 0.05 | 0.40 |  |  |
| OpenAI: gpt-oss-120b | openai/gpt-oss-120b | 0.04 | 0.19 |  | [View](https://kilo.ai/models/openai/gpt-oss-120b) |
| OpenAI: gpt-oss-120b (exacto) | openai/gpt-oss-120b:exacto | 0.04 | 0.19 |  |  |
| OpenAI: gpt-oss-20b | openai/gpt-oss-20b | 0.03 | 0.14 |  | [View](https://kilo.ai/models/openai/gpt-oss-20b) |
| Anthropic: Claude Opus 4.1 | anthropic/claude-opus-4.1 | 15.00 | 75.00 |  |  |
| Mistral: Codestral 2508 | mistralai/codestral-2508 | 0.30 | 0.90 |  |  |
| Qwen: Qwen3 Coder 30B A3B Instruct | qwen/qwen3-coder-30b-a3b-instruct | 0.07 | 0.27 |  |  |
| Qwen: Qwen3 30B A3B Instruct 2507 | qwen/qwen3-30b-a3b-instruct-2507 | 0.08 | 0.33 |  |  |
| Z.ai: GLM 4.5 | z-ai/glm-4.5 | 0.35 | 1.55 |  |  |
| Z.ai: GLM 4.5 Air | z-ai/glm-4.5-air | 0.13 | 0.85 |  |  |
| Qwen: Qwen3 235B A22B Thinking 2507 | qwen/qwen3-235b-a22b-thinking-2507 | 0.00 | 0.00 | Yes |  |
| Z.ai: GLM 4 32B  | z-ai/glm-4-32b | 0.10 | 0.10 |  | [View](https://kilo.ai/models/z-ai/glm-4-32b) |
| Qwen: Qwen3 Coder 480B A35B | qwen/qwen3-coder | 0.22 | 1.00 |  | [View](https://kilo.ai/models/qwen/qwen3-coder) |
| Qwen: Qwen3 Coder 480B A35B (exacto) | qwen/qwen3-coder:exacto | 0.22 | 1.80 |  | [View](https://kilo.ai/models/qwen/qwen3-coder) |
| ByteDance: UI-TARS 7B  | bytedance/ui-tars-1.5-7b | 0.10 | 0.20 |  | [View](https://kilo.ai/models/bytedance/ui-tars-1.5-7b) |
| Google: Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 0.10 | 0.40 |  |  |
| Qwen: Qwen3 235B A22B Instruct 2507 | qwen/qwen3-235b-a22b-2507 | 0.07 | 0.10 |  |  |
| Switchpoint Router | switchpoint/router | 0.85 | 3.40 |  | [View](https://kilo.ai/models/switchpoint/router) |
| MoonshotAI: Kimi K2 0711 | moonshotai/kimi-k2 | 0.50 | 2.40 |  |  |
| Mistral: Devstral Medium | mistralai/devstral-medium | 0.40 | 2.00 |  |  |
| Mistral: Devstral Small 1.1 | mistralai/devstral-small | 0.10 | 0.30 |  |  |
| xAI: Grok 4 | x-ai/grok-4 | 3.00 | 15.00 |  |  |
| Tencent: Hunyuan A13B Instruct | tencent/hunyuan-a13b-instruct | 0.14 | 0.57 |  |  |
| TNG: DeepSeek R1T2 Chimera | tngtech/deepseek-r1t2-chimera | 0.25 | 0.85 |  |  |
| Morph: Morph V3 Large | morph/morph-v3-large | 0.90 | 1.90 |  |  |
| Morph: Morph V3 Fast | morph/morph-v3-fast | 0.80 | 1.20 |  |  |
| Baidu: ERNIE 4.5 VL 424B A47B  | baidu/ernie-4.5-vl-424b-a47b | 0.42 | 1.25 |  |  |
| Baidu: ERNIE 4.5 300B A47B  | baidu/ernie-4.5-300b-a47b | 0.28 | 1.10 |  |  |
| Inception: Mercury | inception/mercury | 0.25 | 1.00 |  |  |
| Mistral: Mistral Small 3.2 24B | mistralai/mistral-small-3.2-24b-instruct | 0.06 | 0.18 |  |  |
| MiniMax: MiniMax M1 | minimax/minimax-m1 | 0.40 | 2.20 |  |  |
| Google: Gemini 2.5 Flash | google/gemini-2.5-flash | 0.30 | 2.50 |  |  |
| Google: Gemini 2.5 Pro | google/gemini-2.5-pro | 1.25 | 10.00 |  |  |
| OpenAI: o3 Pro | openai/o3-pro | 20.00 | 80.00 |  |  |
| xAI: Grok 3 Mini | x-ai/grok-3-mini | 0.30 | 0.50 |  |  |
| xAI: Grok 3 | x-ai/grok-3 | 3.00 | 15.00 |  |  |
| Google: Gemini 2.5 Pro Preview 06-05 | google/gemini-2.5-pro-preview | 1.25 | 10.00 |  |  |
| DeepSeek: R1 0528 | deepseek/deepseek-r1-0528 | 0.40 | 1.75 |  | [View](https://kilo.ai/models/deepseek/deepseek-r1-0528) |
| Anthropic: Claude Opus 4 | anthropic/claude-opus-4 | 15.00 | 75.00 |  |  |
| Anthropic: Claude Sonnet 4 | anthropic/claude-sonnet-4 | 3.00 | 15.00 |  |  |
| Google: Gemma 3n 4B | google/gemma-3n-e4b-it | 0.02 | 0.04 |  | [View](https://kilo.ai/models/google/gemma-3n-e4b-it) |
| Mistral: Mistral Medium 3 | mistralai/mistral-medium-3 | 0.40 | 2.00 |  |  |
| Google: Gemini 2.5 Pro Preview 05-06 | google/gemini-2.5-pro-preview-05-06 | 1.25 | 10.00 |  |  |
| Arcee AI: Spotlight | arcee-ai/spotlight | 0.18 | 0.18 |  | [View](https://kilo.ai/models/arcee-ai/spotlight) |
| Arcee AI: Maestro Reasoning | arcee-ai/maestro-reasoning | 0.90 | 3.30 |  | [View](https://kilo.ai/models/arcee-ai/maestro-reasoning) |
| Arcee AI: Virtuoso Large | arcee-ai/virtuoso-large | 0.75 | 1.20 |  | [View](https://kilo.ai/models/arcee-ai/virtuoso-large) |
| Arcee AI: Coder Large | arcee-ai/coder-large | 0.50 | 0.80 |  | [View](https://kilo.ai/models/arcee-ai/coder-large) |
| Inception: Mercury Coder | inception/mercury-coder | 0.25 | 1.00 |  |  |
| Meta: Llama Guard 4 12B | meta-llama/llama-guard-4-12b | 0.18 | 0.18 |  |  |
| Qwen: Qwen3 30B A3B | qwen/qwen3-30b-a3b | 0.06 | 0.22 |  |  |
| Qwen: Qwen3 8B | qwen/qwen3-8b | 0.05 | 0.40 |  |  |
| Qwen: Qwen3 14B | qwen/qwen3-14b | 0.05 | 0.22 |  |  |
| Qwen: Qwen3 32B | qwen/qwen3-32b | 0.08 | 0.24 |  |  |
| Qwen: Qwen3 235B A22B | qwen/qwen3-235b-a22b | 0.30 | 1.20 |  |  |
| TNG: DeepSeek R1T Chimera | tngtech/deepseek-r1t-chimera | 0.30 | 1.20 |  |  |
| OpenAI: o4 Mini High | openai/o4-mini-high | 1.10 | 4.40 |  | [View](https://kilo.ai/models/openai/o4-mini-high) |
| OpenAI: o3 | openai/o3 | 2.00 | 8.00 |  |  |
| OpenAI: o4 Mini | openai/o4-mini | 1.10 | 4.40 |  |  |
| Qwen: Qwen2.5 Coder 7B Instruct | qwen/qwen2.5-coder-7b-instruct | 0.03 | 0.09 |  |  |
| OpenAI: GPT-4.1 | openai/gpt-4.1 | 2.00 | 8.00 |  |  |
| OpenAI: GPT-4.1 Mini | openai/gpt-4.1-mini | 0.40 | 1.60 |  |  |
| OpenAI: GPT-4.1 Nano | openai/gpt-4.1-nano | 0.10 | 0.40 |  |  |
| xAI: Grok 3 Mini Beta | x-ai/grok-3-mini-beta | 0.30 | 0.50 |  |  |
| xAI: Grok 3 Beta | x-ai/grok-3-beta | 3.00 | 15.00 |  |  |
| NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 | nvidia/llama-3.1-nemotron-ultra-253b-v1 | 0.60 | 1.80 |  |  |
| Meta: Llama 4 Maverick | meta-llama/llama-4-maverick | 0.15 | 0.60 |  |  |
| Meta: Llama 4 Scout | meta-llama/llama-4-scout | 0.08 | 0.30 |  |  |
| Qwen: Qwen2.5 VL 32B Instruct | qwen/qwen2.5-vl-32b-instruct | 0.05 | 0.22 |  |  |
| DeepSeek: DeepSeek V3 0324 | deepseek/deepseek-chat-v3-0324 | 0.19 | 0.87 |  |  |
| OpenAI: o1-pro | openai/o1-pro | 150.00 | 600.00 |  |  |
| Mistral: Mistral Small 3.1 24B | mistralai/mistral-small-3.1-24b-instruct | 0.03 | 0.11 |  |  |
| AllenAI: Olmo 2 32B Instruct | allenai/olmo-2-0325-32b-instruct | 0.05 | 0.20 |  | [View](https://kilo.ai/models/allenai/olmo-2-0325-32b-instruct) |
| Google: Gemma 3 4B | google/gemma-3-4b-it | 0.04 | 0.08 |  | [View](https://kilo.ai/models/google/gemma-3-4b-it) |
| Google: Gemma 3 12B | google/gemma-3-12b-it | 0.04 | 0.13 |  | [View](https://kilo.ai/models/google/gemma-3-12b-it) |
| Cohere: Command A | cohere/command-a | 2.50 | 10.00 |  |  |
| OpenAI: GPT-4o-mini Search Preview | openai/gpt-4o-mini-search-preview | 0.15 | 0.60 |  |  |
| OpenAI: GPT-4o Search Preview | openai/gpt-4o-search-preview | 2.50 | 10.00 |  | [View](https://kilo.ai/models/openai/gpt-4o-search-preview) |
| Google: Gemma 3 27B | google/gemma-3-27b-it | 0.04 | 0.15 |  |  |
| Perplexity: Sonar Reasoning Pro | perplexity/sonar-reasoning-pro | 2.00 | 8.00 |  |  |
| Perplexity: Sonar Pro | perplexity/sonar-pro | 3.00 | 15.00 |  |  |
| Perplexity: Sonar Deep Research | perplexity/sonar-deep-research | 2.00 | 8.00 |  |  |
| Qwen: QwQ 32B | qwen/qwq-32b | 0.15 | 0.40 |  |  |
| Google: Gemini 2.0 Flash Lite | google/gemini-2.0-flash-lite-001 | 0.08 | 0.30 |  |  |
| Anthropic: Claude 3.7 Sonnet (thinking) | anthropic/claude-3.7-sonnet:thinking | 3.00 | 15.00 |  | [View](https://kilo.ai/models/anthropic/claude-3.7-sonnet) |
| Anthropic: Claude 3.7 Sonnet | anthropic/claude-3.7-sonnet | 3.00 | 15.00 |  |  |
| Mistral: Saba | mistralai/mistral-saba | 0.20 | 0.60 |  | [View](https://kilo.ai/models/mistralai/mistral-saba) |
| Llama Guard 3 8B | meta-llama/llama-guard-3-8b | 0.02 | 0.06 |  |  |
| OpenAI: o3 Mini High | openai/o3-mini-high | 1.10 | 4.40 |  |  |
| Google: Gemini 2.0 Flash | google/gemini-2.0-flash-001 | 0.10 | 0.40 |  |  |
| Qwen: Qwen VL Plus | qwen/qwen-vl-plus | 0.21 | 0.63 |  |  |
| AionLabs: Aion-1.0 | aion-labs/aion-1.0 | 4.00 | 8.00 |  | [View](https://kilo.ai/models/aion-labs/aion-1.0) |
| AionLabs: Aion-1.0-Mini | aion-labs/aion-1.0-mini | 0.70 | 1.40 |  | [View](https://kilo.ai/models/aion-labs/aion-1.0-mini) |
| AionLabs: Aion-RP 1.0 (8B) | aion-labs/aion-rp-llama-3.1-8b | 0.80 | 1.60 |  | [View](https://kilo.ai/models/aion-labs/aion-rp-llama-3.1-8b) |
| Qwen: Qwen VL Max | qwen/qwen-vl-max | 0.80 | 3.20 |  |  |
| Qwen: Qwen-Turbo | qwen/qwen-turbo | 0.05 | 0.20 |  |  |
| Qwen: Qwen2.5 VL 72B Instruct | qwen/qwen2.5-vl-72b-instruct | 0.15 | 0.60 |  |  |
| Qwen: Qwen-Plus | qwen/qwen-plus | 0.40 | 1.20 |  |  |
| Qwen: Qwen-Max  | qwen/qwen-max | 1.60 | 6.40 |  |  |
| OpenAI: o3 Mini | openai/o3-mini | 1.10 | 4.40 |  |  |
| Mistral: Mistral Small 3 | mistralai/mistral-small-24b-instruct-2501 | 0.05 | 0.08 |  |  |
| DeepSeek: R1 Distill Qwen 32B | deepseek/deepseek-r1-distill-qwen-32b | 0.29 | 0.29 |  |  |
| Perplexity: Sonar | perplexity/sonar | 1.00 | 1.00 |  |  |
| DeepSeek: R1 Distill Llama 70B | deepseek/deepseek-r1-distill-llama-70b | 0.03 | 0.11 |  |  |
| DeepSeek: R1 | deepseek/deepseek-r1 | 0.70 | 2.50 |  |  |
| MiniMax: MiniMax-01 | minimax/minimax-01 | 0.20 | 1.10 |  |  |
| Microsoft: Phi 4 | microsoft/phi-4 | 0.06 | 0.14 |  |  |
| DeepSeek: DeepSeek V3 | deepseek/deepseek-chat | 0.30 | 1.20 |  |  |
| OpenAI: o1 | openai/o1 | 15.00 | 60.00 |  |  |
| Cohere: Command R7B (12-2024) | cohere/command-r7b-12-2024 | 0.04 | 0.15 |  |  |
| Meta: Llama 3.3 70B Instruct | meta-llama/llama-3.3-70b-instruct | 0.10 | 0.32 |  |  |
| Amazon: Nova Lite 1.0 | amazon/nova-lite-v1 | 0.06 | 0.24 |  | [View](https://kilo.ai/models/amazon/nova-lite-v1) |
| Amazon: Nova Micro 1.0 | amazon/nova-micro-v1 | 0.04 | 0.14 |  | [View](https://kilo.ai/models/amazon/nova-micro-v1) |
| Amazon: Nova Pro 1.0 | amazon/nova-pro-v1 | 0.80 | 3.20 |  |  |
| OpenAI: GPT-4o (2024-11-20) | openai/gpt-4o-2024-11-20 | 2.50 | 10.00 |  |  |
| Mistral Large 2411 | mistralai/mistral-large-2411 | 2.00 | 6.00 |  |  |
| Mistral Large 2407 | mistralai/mistral-large-2407 | 2.00 | 6.00 |  | [View](https://kilo.ai/models/mistralai/mistral-large-2407) |
| Mistral: Pixtral Large 2411 | mistralai/pixtral-large-2411 | 2.00 | 6.00 |  | [View](https://kilo.ai/models/mistralai/pixtral-large-2411) |
| Qwen2.5 Coder 32B Instruct | qwen/qwen-2.5-coder-32b-instruct | 0.03 | 0.11 |  |  |
| Anthropic: Claude 3.5 Haiku | anthropic/claude-3.5-haiku | 0.80 | 4.00 |  |  |
| Anthropic: Claude 3.5 Sonnet | anthropic/claude-3.5-sonnet | 6.00 | 30.00 |  |  |
| Qwen: Qwen2.5 7B Instruct | qwen/qwen-2.5-7b-instruct | 0.04 | 0.10 |  |  |
| NVIDIA: Llama 3.1 Nemotron 70B Instruct | nvidia/llama-3.1-nemotron-70b-instruct | 1.20 | 1.20 |  |  |
| Inflection: Inflection 3 Pi | inflection/inflection-3-pi | 2.50 | 10.00 |  | [View](https://kilo.ai/models/inflection/inflection-3-pi) |
| Inflection: Inflection 3 Productivity | inflection/inflection-3-productivity | 2.50 | 10.00 |  | [View](https://kilo.ai/models/inflection/inflection-3-productivity) |
| Meta: Llama 3.2 3B Instruct | meta-llama/llama-3.2-3b-instruct | 0.02 | 0.02 |  |  |
| Meta: Llama 3.2 11B Vision Instruct | meta-llama/llama-3.2-11b-vision-instruct | 0.05 | 0.05 |  |  |
| Meta: Llama 3.2 1B Instruct | meta-llama/llama-3.2-1b-instruct | 0.03 | 0.20 |  |  |
| Qwen2.5 72B Instruct | qwen/qwen-2.5-72b-instruct | 0.12 | 0.39 |  |  |
| Cohere: Command R (08-2024) | cohere/command-r-08-2024 | 0.15 | 0.60 |  |  |
| Cohere: Command R+ (08-2024) | cohere/command-r-plus-08-2024 | 2.50 | 10.00 |  |  |
| Qwen: Qwen2.5-VL 7B Instruct | qwen/qwen-2.5-vl-7b-instruct | 0.20 | 0.20 |  |  |
| OpenAI: ChatGPT-4o | openai/chatgpt-4o-latest | 5.00 | 15.00 |  |  |
| OpenAI: GPT-4o (2024-08-06) | openai/gpt-4o-2024-08-06 | 2.50 | 10.00 |  |  |
| Meta: Llama 3.1 405B (base) | meta-llama/llama-3.1-405b | 4.00 | 4.00 |  | [View](https://kilo.ai/models/meta-llama/llama-3.1-405b) |
| Meta: Llama 3.1 8B Instruct | meta-llama/llama-3.1-8b-instruct | 0.02 | 0.05 |  |  |
| Meta: Llama 3.1 405B Instruct | meta-llama/llama-3.1-405b-instruct | 4.00 | 4.00 |  |  |
| Meta: Llama 3.1 70B Instruct | meta-llama/llama-3.1-70b-instruct | 0.40 | 0.40 |  |  |
| Mistral: Mistral Nemo | mistralai/mistral-nemo | 0.02 | 0.04 |  |  |
| OpenAI: GPT-4o-mini (2024-07-18) | openai/gpt-4o-mini-2024-07-18 | 0.15 | 0.60 |  | [View](https://kilo.ai/models/openai/gpt-4o-mini-2024-07-18) |
| OpenAI: GPT-4o-mini | openai/gpt-4o-mini | 0.15 | 0.60 |  |  |
| Google: Gemma 2 27B | google/gemma-2-27b-it | 0.65 | 0.65 |  |  |
| Google: Gemma 2 9B | google/gemma-2-9b-it | 0.03 | 0.09 |  |  |
| Mistral: Mistral 7B Instruct v0.3 | mistralai/mistral-7b-instruct-v0.3 | 0.20 | 0.20 |  |  |
| Mistral: Mistral 7B Instruct | mistralai/mistral-7b-instruct | 0.20 | 0.20 |  |  |
| OpenAI: GPT-4o (2024-05-13) | openai/gpt-4o-2024-05-13 | 5.00 | 15.00 |  |  |
| Meta: LlamaGuard 2 8B | meta-llama/llama-guard-2-8b | 0.20 | 0.20 |  | [View](https://kilo.ai/models/meta-llama/llama-guard-2-8b) |
| OpenAI: GPT-4o | openai/gpt-4o | 2.50 | 10.00 |  | [View](https://kilo.ai/models/openai/gpt-4o) |
| OpenAI: GPT-4o (extended) | openai/gpt-4o:extended | 6.00 | 18.00 |  |  |
| Meta: Llama 3 70B Instruct | meta-llama/llama-3-70b-instruct | 0.51 | 0.74 |  |  |
| Meta: Llama 3 8B Instruct | meta-llama/llama-3-8b-instruct | 0.03 | 0.04 |  |  |
| Mistral: Mixtral 8x22B Instruct | mistralai/mixtral-8x22b-instruct | 2.00 | 6.00 |  |  |
| WizardLM-2 8x22B | microsoft/wizardlm-2-8x22b | 0.62 | 0.62 |  |  |
| OpenAI: GPT-4 Turbo | openai/gpt-4-turbo | 10.00 | 30.00 |  |  |
| Anthropic: Claude 3 Haiku | anthropic/claude-3-haiku | 0.25 | 1.25 |  |  |
| Mistral Large | mistralai/mistral-large | 2.00 | 6.00 |  |  |
| OpenAI: GPT-3.5 Turbo (older v0613) | openai/gpt-3.5-turbo-0613 | 1.00 | 2.00 |  |  |
| OpenAI: GPT-4 Turbo Preview | openai/gpt-4-turbo-preview | 10.00 | 30.00 |  | [View](https://kilo.ai/models/openai/gpt-4-turbo-preview) |
| Mistral: Mixtral 8x7B Instruct | mistralai/mixtral-8x7b-instruct | 0.54 | 0.54 |  | [View](https://kilo.ai/models/mistralai/mixtral-8x7b-instruct) |
| Auto Router | openrouter/auto |  |  |  | [View](https://kilo.ai/models/openrouter/auto) |
| OpenAI: GPT-4 Turbo (older v1106) | openai/gpt-4-1106-preview | 10.00 | 30.00 |  | [View](https://kilo.ai/models/openai/gpt-4-1106-preview) |
| Mistral: Mistral 7B Instruct v0.1 | mistralai/mistral-7b-instruct-v0.1 | 0.11 | 0.19 |  |  |
| OpenAI: GPT-3.5 Turbo Instruct | openai/gpt-3.5-turbo-instruct | 1.50 | 2.00 |  |  |
| OpenAI: GPT-3.5 Turbo 16k | openai/gpt-3.5-turbo-16k | 3.00 | 4.00 |  | [View](https://kilo.ai/models/openai/gpt-3.5-turbo-16k) |
| Mancer: Weaver (alpha) | mancer/weaver | 0.75 | 1.00 |  | [View](https://kilo.ai/models/mancer/weaver) |
| OpenAI: GPT-3.5 Turbo | openai/gpt-3.5-turbo | 0.50 | 1.50 |  |  |
| OpenAI: GPT-4 (older v0314) | openai/gpt-4-0314 | 30.00 | 60.00 |  | [View](https://kilo.ai/models/openai/gpt-4-0314) |
| OpenAI: GPT-4 | openai/gpt-4 | 30.00 | 60.00 |  |  |
| Nous: DeepHermes 3 Mistral 24B Preview | nousresearch/deephermes-3-mistral-24b-preview | 0.02 | 0.10 |  |  |
| Nous: Hermes 3 70B Instruct | nousresearch/hermes-3-llama-3.1-70b | 0.30 | 0.30 |  | [View](https://kilo.ai/models/nousresearch/hermes-3-llama-3.1-70b) |
| Nous: Hermes 3 405B Instruct | nousresearch/hermes-3-llama-3.1-405b | 1.00 | 1.00 |  |  |
| NousResearch: Hermes 2 Pro - Llama-3 8B | nousresearch/hermes-2-pro-llama-3-8b | 0.14 | 0.14 |  |  |
| Qwen: Qwen3 Coder Next | qwen/qwen3-coder-next | 0.07 | 0.30 |  |  |
| Kilo: Auto | kilo/auto | 1.00 | 1.00 |  |  |
| Anthropic: Claude Opus 4.6 | anthropic/claude-opus-4.6 | 5.00 | 25.00 |  |  |
| Qwen: Qwen3 Max Thinking | qwen/qwen3-max-thinking | 1.20 | 6.00 |  |  |
| StepFun: Step 3.5 Flash (free) | stepfun/step-3.5-flash:free | 0.00 | 0.00 | Yes |  |
| LiquidAI: LFM2.5-1.2B-Thinking (free) | liquid/lfm-2.5-1.2b-thinking:free | 0.00 | 0.00 | Yes |  |
| LiquidAI: LFM2.5-1.2B-Instruct (free) | liquid/lfm-2.5-1.2b-instruct:free | 0.00 | 0.00 | Yes |  |
| NVIDIA: Nemotron Nano 12B 2 VL (free) | nvidia/nemotron-nano-12b-v2-vl:free | 0.00 | 0.00 | Yes |  |
| OpenAI: gpt-oss-20b (free) | openai/gpt-oss-20b:free | 0.00 | 0.00 | Yes |  |
| Google: Gemma 3n 2B (free) | google/gemma-3n-e2b-it:free | 0.00 | 0.00 | Yes |  |
| Google: Gemma 3 12B (free) | google/gemma-3-12b-it:free | 0.00 | 0.00 | Yes |  |
| StepFun: Step 3.5 Flash | stepfun/step-3.5-flash | 0.10 | 0.30 |  | [View](https://kilo.ai/models/stepfun/step-3.5-flash) |
| Z.ai: GLM 5 | z-ai/glm-5 | 0.30 | 2.55 |  |  |
| MiniMax: MiniMax M2.5 (free) | minimax/minimax-m2.5:free | 0.00 | 0.00 | Yes |  |
| MiniMax: MiniMax M2.5 | minimax/minimax-m2.5 | 0.30 | 1.10 |  | [View](https://kilo.ai/models/minimax/minimax-m2.5) |
| AllenAI: Molmo2 8B | allenai/molmo-2-8b | 0.20 | 0.20 |  |  |
| Meituan: LongCat Flash Chat | meituan/longcat-flash-chat | 0.20 | 0.80 |  |  |
| Qwen: Qwen3.5 Plus 2026-02-15 | qwen/qwen3.5-plus-02-15 | 0.40 | 2.40 |  |  |
| Qwen: Qwen3.5 397B A17B | qwen/qwen3.5-397b-a17b | 0.60 | 3.60 |  |  |
| Anthropic: Claude Sonnet 4.6 | anthropic/claude-sonnet-4.6 | 3.00 | 15.00 |  | [View](https://kilo.ai/models/anthropic/claude-sonnet-4.6) |
| Google: Gemini 3.1 Pro Preview | google/gemini-3.1-pro-preview | 2.00 | 12.00 |  | [View](https://kilo.ai/models/google/gemini-3.1-pro-preview) |
| OpenAI: GPT-5.3-Codex (new) | openai/gpt-5.3-codex | 1.75 | 14.00 |  | [View](https://kilo.ai/models/openai/gpt-5.3-codex) |
| AionLabs: Aion-2.0 | aion-labs/aion-2.0 | 0.80 | 1.60 |  | [View](https://kilo.ai/models/aion-labs/aion-2.0) |
| LiquidAI: LFM2-24B-A2B | liquid/lfm-2-24b-a2b | 0.03 | 0.12 |  | [View](https://kilo.ai/models/liquid/lfm-2-24b-a2b) |
| Google: Gemini 3.1 Pro Preview Custom Tools | google/gemini-3.1-pro-preview-customtools | 2.00 | 12.00 |  | [View](https://kilo.ai/models/google/gemini-3.1-pro-preview-customtools) |
| Qwen: Qwen3.5-35B-A3B | qwen/qwen3.5-35b-a3b | 0.25 | 2.00 |  | [View](https://kilo.ai/models/qwen/qwen3.5-35b-a3b) |
| Qwen: Qwen3.5-27B | qwen/qwen3.5-27b | 0.30 | 2.40 |  | [View](https://kilo.ai/models/qwen/qwen3.5-27b) |
| Qwen: Qwen3.5-122B-A10B | qwen/qwen3.5-122b-a10b | 0.40 | 3.20 |  | [View](https://kilo.ai/models/qwen/qwen3.5-122b-a10b) |
| Qwen: Qwen3.5-Flash | qwen/qwen3.5-flash-02-23 | 0.10 | 0.40 |  | [View](https://kilo.ai/models/qwen/qwen3.5-flash-02-23) |
| NVIDIA: Nemotron Nano 9B V2 (free) | nvidia/nemotron-nano-9b-v2:free | 0.00 | 0.00 | Yes |  |
| NVIDIA: Nemotron 3 Nano 30B A3B (free) | nvidia/nemotron-3-nano-30b-a3b:free | 0.00 | 0.00 | Yes |  |
| DeepSeek: R1 0528 (free) | deepseek/deepseek-r1-0528:free | 0.00 | 0.00 | Yes |  |
| Google: Gemma 3n 4B (free) | google/gemma-3n-e4b-it:free | 0.00 | 0.00 | Yes |  |
| Google: Gemma 3 4B (free) | google/gemma-3-4b-it:free | 0.00 | 0.00 | Yes |  |
| Qwen: Qwen3 Coder 480B A35B (free) | qwen/qwen3-coder:free | 0.00 | 0.00 | Yes |  |
| Qwen: Qwen3 4B | qwen/qwen3-4b | 0.07 | 0.27 |  |  |
| Qwen: Qwen3 Next 80B A3B Instruct (free) | qwen/qwen3-next-80b-a3b-instruct:free | 0.00 | 0.00 | Yes |  |
| MoonshotAI: Kimi K2.5 (free) | moonshotai/kimi-k2.5:free | 0.00 | 0.00 | Yes | [View](https://kilo.ai/models/moonshotai/kimi-k2.5) |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) | google/gemini-3.1-flash-image-preview | 0.25 | 1.50 |  | [View](https://kilo.ai/models/google/gemini-3.1-flash-image-preview) |
| ByteDance Seed: Seed-2.0-Mini | bytedance-seed/seed-2.0-mini | 0.10 | 0.40 |  | [View](https://kilo.ai/models/bytedance-seed/seed-2.0-mini) |

---

[← Back to all providers](/llm.txt)