GMI Cloud
gmi
Updated 47 minutes ago
GMI Cloud is an AI model hosting platform that provides access to leading large language models including Kimi K2.5, Claude (Haiku 4.5, Opus 4.1, Sonnet 4, 3.7 Sonnet), GPT-5.1, Gemini 2.5, Grok 2, and DeepSeek models. The platform offers serverless deployment with transparent pricing per 1M tokens, GPU hardware options (H200), and model metadata including context lengths, quantization (int4, fp8), and provider information. GMI Cloud features an OpenAI-compatible API at api.gmi-serving.com for easy integration.
Browse 69 LLM models available from GMI Cloud. Compare prices and features.
Models (69)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
OpenAI | GPT-5.5 |
openai/gpt-5.5
|
$3.50 | $21.00 |
|
||
|
|
DeepSeek | DeepSeek-V4-Pro-Max |
deepseek-ai/DeepSeek-V4-Pro
|
$1.39 | $2.78 |
|
||
|
|
DeepSeek | DeepSeek V4 Flash |
deepseek-ai/DeepSeek-V4-Flash
|
$0.11 | $0.22 |
|
||
|
|
Anthropic | Claude Opus 4.7 |
anthropic/claude-opus-4.7
|
$4.50 | $22.50 |
|
||
|
|
Moonshot AI | Kimi K2.6 |
moonshotai/Kimi-K2.6
|
$0.86 | $3.60 |
|
||
|
|
Minimax | MiniMax M2.7 |
MiniMaxAI/MiniMax-M2.7
|
$0.30 | $1.20 |
|
||
|
|
qwen | Qwen3.6 Plus |
Qwen/Qwen3.6-Plus
|
$0.50 | $3.00 |
|
||
|
|
qwen | Qwen3.6 Plus |
Qwen/Qwen3.6-Plus-2026-04-02
|
$0.50 | $3.00 |
|
||
|
|
OpenAI | GPT-5.4 |
openai/gpt-5.4
|
$2.50 | $15.00 |
|
||
|
|
Xiaomi | MiMo-V2.5-Pro |
XiaomiMiMo/MiMo-V2.5-Pro
|
$0.80 | $2.40 |
|
||
|
|
Gemma 4 31B |
google/gemma-4-31b-it
|
$0.14 | $0.40 |
|
|||
|
|
Gemma 4 26B-A4B |
google/gemma-4-26b-a4b-it
|
$0.13 | $0.40 | ||||
|
|
Gemini 3.1 Pro |
google/gemini-3.1-pro-preview
|
$2.00 | $12.00 |
|
|||
|
|
OpenAI | GPT-5.4 mini |
openai/gpt-5.4-mini
|
$0.75 | $4.50 |
|
||
|
|
OpenAI | GPT-5.4 nano |
openai/gpt-5.4-nano
|
$0.20 | $1.25 |
|
||
|
|
OpenAI | GPT-5.4 Pro |
openai/gpt-5.4-pro
|
$30.00 | $180.00 | |||
|
|
Anthropic | Claude Opus 4.6 |
anthropic/claude-opus-4.6
|
$5.00 | $25.00 |
|
||
|
|
qwen | Qwen3.5-27B |
Qwen/Qwen3.5-27B
|
$0.30 | $2.40 | |||
|
|
Anthropic | Claude Sonnet 4.6 |
anthropic/claude-sonnet-4.6
|
$3.00 | $15.00 |
|
||
|
|
qwen | Qwen3.5-122B-A10B |
Qwen/Qwen3.5-122B-A10B
|
$0.40 | $3.20 | |||
|
|
Minimax | MiniMax M2.5 |
MiniMaxAI/MiniMax-M2.5
|
$0.30 | $1.20 |
|
||
|
|
qwen | Qwen3.5-35B-A3B |
Qwen/Qwen3.5-35B-A3B
|
$0.25 | $2.00 | |||
|
|
Gemini 3.1 Flash-Lite |
google/gemini-3.1-flash-lite-preview
|
$0.25 | $1.50 |
|
|||
|
|
qwen | Qwen3.5-397B-A17B |
Qwen/Qwen3.5-397B-A17B
|
$0.60 | $3.60 | |||
|
|
Moonshot AI | Kimi K2.5 |
moonshotai/Kimi-K2.5
|
$0.60 | $3.00 | |||
|
|
OpenAI | GPT-5.2 |
openai/gpt-5.2
|
$1.75 | $14.00 |
|
||
|
|
OpenAI | GPT-5.3 Codex |
openai/gpt-5.3-codex
|
$1.75 | $14.00 |
|
||
|
|
Minimax | MiniMax M2.1 |
MiniMaxAI/MiniMax-M2.1
|
$0.30 | $1.20 | |||
|
|
DeepSeek | DeepSeek-V3.2-Speciale |
deepseek-ai/DeepSeek-V3.2-Speciale
|
$0.28 | $0.40 | |||
|
|
Anthropic | Claude Opus 4.5 |
anthropic/claude-opus-4.5
|
$5.00 | $25.00 |
|
||
|
|
OpenAI | GPT-5.1 Thinking |
openai/gpt-5.1
|
$1.25 | $10.00 | |||
|
|
Z.ai | GLM-4.7-Flash |
zai-org/GLM-4.7-Flash
|
$0.07 | $0.40 | |||
|
|
Anthropic | Claude 4.5 Haiku |
anthropic/claude-haiku-4.5
|
$1.00 | $5.00 |
|
||
|
|
Anthropic | Claude 4.5 Sonnet |
anthropic/claude-sonnet-4.5
|
$3.00 | $15.00 |
|
||
|
|
Minimax | MiniMax M2 |
MiniMaxAI/MiniMax-M2
|
$0.30 | $1.20 | |||
|
|
Z.ai | GLM-4.6 |
zai-org/GLM-4.6
|
$0.60 | $2.00 | |||
|
|
DeepSeek | DeepSeek-V3.2-Exp |
deepseek-ai/DeepSeek-V3.2-Exp
|
$0.27 | $0.41 | |||
|
|
OpenAI | GPT-5 |
openai/gpt-5
|
$1.25 | $10.00 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
Qwen/Qwen3-Next-80B-A3B-Thinking
|
$0.15 | $1.50 | |||
|
|
Anthropic | Claude 4.1 Opus |
anthropic/claude-opus-4.1
|
$15.00 | $75.00 | |||
|
|
Anthropic | Claude 4 Sonnet |
anthropic/claude-sonnet-4
|
$3.00 | $15.00 | |||
|
|
DeepSeek | DeepSeek-R1-0528 |
deepseek-ai/DeepSeek-R1-0528
|
$0.57 | $2.29 | |||
|
|
Moonshot AI | Kimi K2-Instruct-0905 |
moonshotai/Kimi-K2-Instruct-0905
|
$0.57 | $2.29 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
Qwen/Qwen3-Next-80B-A3B-Instruct
|
$0.15 | $1.50 | |||
|
|
qwen | Qwen3 30B A3B |
Qwen/Qwen3-30B-A3B
|
$0.08 | $0.25 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-ai/DeepSeek-V3.1
|
$0.27 | $1.00 | |||
|
|
OpenAI | GPT-4o mini |
openai/gpt-4o-mini
|
$0.15 | $0.60 |
|
||
|
|
DeepSeek | DeepSeek-V3 0324 |
deepseek-ai/DeepSeek-V3-0324
|
$0.29 | $1.14 | |||
|
|
OpenAI | GPT-4o |
openai/gpt-4o
|
$2.50 | $10.00 | |||
|
|
DeepSeek | DeepSeek R1 Distill Llama 70B |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
|
$0.25 | $0.75 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 32B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
|
$0.50 | $0.90 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 14B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
|
$0.20 | $0.20 | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 7B |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
|
$0.10 | $0.20 | |||
|
|
DeepSeek | DeepSeek R1 Distill Llama 8B |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
$0.14 | $0.39 | |||
|
|
Meta | Llama 3.3 70B Instruct |
meta-llama/Llama-3.3-70B-Instruct
|
$0.25 | $0.75 | |||
|
|
qwen | Qwen: Qwen3.6 Max Preview |
Qwen/Qwen3.6-Max-Preview
|
$1.30 | $7.80 | |||
|
|
Xiaomi | MiMo-V2.5 |
XiaomiMiMo/MiMo-V2.5
|
$0.32 | $1.60 | |||
|
|
ByteDance Seed | Seed-2.0-Mini |
bytedance/seed-2.0-mini
|
$0.10 | $0.40 | |||
|
|
OpenAI | GPT-5.2 Codex |
openai/gpt-5.2-codex
|
$1.75 | $14.00 | |||
|
|
Azure | GPT-5.2 Chat |
openai/gpt-5.2-chat
|
$1.75 | $14.00 | |||
|
|
Azure | GPT-5.1 Chat |
openai/gpt-5.1-chat
|
$1.25 | $10.00 | |||
|
|
Moonshot AI | Kimi K2 Thinking |
moonshotai/Kimi-K2-Thinking
|
$0.80 | $1.20 | |||
|
|
DeepSeek | deepseek-v3.1-terminus |
deepseek-ai/DeepSeek-V3.1-Terminus
|
$0.27 | $1.00 | |||
|
|
DeepSeek | DeepSeek R1 |
deepseek-ai/DeepSeek-R1
|
$0.50 | $2.18 | |||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-ai/DeepSeek-V3.2
|
$0.29 | $0.43 | |||
|
|
WandB | GLM 5 |
zai-org/GLM-5-FP8
|
$0.65 | $2.08 | |||
|
|
Z.ai | GLM-4.7 |
zai-org/GLM-4.7-FP8
|
$0.60 | $2.20 | |||
|
|
Azure | Llama 4 Maverick 17B 128E Instruct FP8 |
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
|
$0.25 | $0.80 | |||
|
|
Meta | llama-4-scout-17b-16e-instruct |
meta-llama/Llama-4-Scout-17B-16E-Instruct
|
$0.08 | $0.50 |