GMI Cloud icon

GMI Cloud

gmi

Updated 12 minutes ago

GMI Cloud is an AI model hosting platform that provides access to leading large language models including Kimi K2.5, Claude (Haiku 4.5, Opus 4.1, Sonnet 4, 3.7 Sonnet), GPT-5.1, Gemini 2.5, Grok 2, and DeepSeek models. The platform offers serverless deployment with transparent pricing per 1M tokens, GPU hardware options (H200), and model metadata including context lengths, quantization (int4, fp8), and provider information. GMI Cloud features an OpenAI-compatible API at api.gmi-serving.com for easy integration.

Browse 78 LLM models available from GMI Cloud. Compare prices and features.

Models (78)

Organization Model Name Original Model Input Output Free
Anthropic
Anthropic Claude Opus 4.8 anthropic/claude-opus-4.8 $5.00 $25.00
Minimax
Minimax MiniMax M3 MiniMaxAI/MiniMax-M3 $0.60 $2.40
qwen
qwen Qwen3.7 Max Qwen/Qwen3.7-Max $2.50 $7.50
Moonshot AI
Moonshot AI Kimi K2.7 Code moonshotai/kimi-k2.7-code $0.95 $4.00
OpenAI
OpenAI GPT-5.5 openai/gpt-5.5 $5.00 $30.00
Xiaomi
Xiaomi MiMo-V2.5 XiaomiMiMo/MiMo-V2.5 $0.32 $1.60
Nvidia
Nvidia Nemotron 3 Ultra (550B A55B) nvidia/nemotron-3-ultra-550b-a55b $0.80 $2.60
DeepSeek
DeepSeek DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro $1.39 $2.78
google
google Gemini 3.5 Flash google/gemini-3.5-flash $1.50 $9.00
Anthropic
Anthropic Claude Opus 4.7 anthropic/claude-opus-4.7 $4.50 $22.50
DeepSeek
DeepSeek DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash $0.11 $0.22
Moonshot AI
Moonshot AI Kimi K2.6 moonshotai/Kimi-K2.6 $0.86 $3.60
Xiaomi
Xiaomi MiMo-V2.5-Pro XiaomiMiMo/MiMo-V2.5-Pro $0.80 $2.40
Minimax
Minimax MiniMax M2.7 MiniMaxAI/MiniMax-M2.7 $0.30 $1.20
qwen
qwen Qwen3.6 Plus Qwen/Qwen3.6-Plus $0.50 $3.00
qwen
qwen Qwen3.6 Plus Qwen/Qwen3.6-Plus-2026-04-02 $0.50 $3.00
OpenAI
OpenAI GPT-5.4 openai/gpt-5.4 $2.50 $15.00
google
google Gemini 3.1 Pro google/gemini-3.1-pro-preview $2.00 $12.00
qwen
qwen Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B $0.25 $1.49
OpenAI
OpenAI GPT-5.4 mini openai/gpt-5.4-mini $0.75 $4.50
google
google Gemma 4 31B google/gemma-4-31b-it $0.14 $0.40
Anthropic
Anthropic Claude Opus 4.6 anthropic/claude-opus-4.6 $5.00 $25.00
OpenAI
OpenAI GPT-5.4 nano openai/gpt-5.4-nano $0.20 $1.25
Anthropic
Anthropic Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 $3.00 $15.00
google
google Gemma 4 26B-A4B google/gemma-4-26b-a4b-it $0.13 $0.40
Minimax
Minimax MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 $0.30 $1.20
qwen
qwen Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B $0.40 $3.20
qwen
qwen Qwen3.5-27B Qwen/Qwen3.5-27B $0.30 $2.40
qwen
qwen Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B $0.25 $2.00
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/Kimi-K2.5 $0.60 $3.00
qwen
qwen Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B $0.60 $3.60
google
google Gemini 3.1 Flash-Lite google/gemini-3.1-flash-lite-preview $0.25 $1.50
OpenAI
OpenAI GPT-5.2 openai/gpt-5.2 $1.75 $14.00
Anthropic
Anthropic Claude Opus 4.5 anthropic/claude-opus-4.5 $5.00 $25.00
DeepSeek
DeepSeek DeepSeek-V3.2-Speciale deepseek-ai/DeepSeek-V3.2-Speciale $0.28 $0.40
Minimax
Minimax MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 $0.30 $1.20
OpenAI
OpenAI GPT-5.1 Thinking openai/gpt-5.1 $1.25 $10.00
DeepSeek
DeepSeek DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 $0.29 $0.43
Z.ai
Z.ai GLM-4.7-Flash zai-org/GLM-4.7-Flash $0.07 $0.40
Anthropic
Anthropic Claude 4.5 Sonnet anthropic/claude-sonnet-4.5 $3.00 $15.00
Anthropic
Anthropic Claude 4.5 Haiku anthropic/claude-haiku-4.5 $1.00 $5.00
Minimax
Minimax MiniMax M2 MiniMaxAI/MiniMax-M2 $0.30 $1.20
Z.ai
Z.ai GLM-4.6 zai-org/GLM-4.6 $0.60 $2.00
DeepSeek
DeepSeek DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.2-Exp $0.27 $0.41
OpenAI
OpenAI GPT-5 openai/gpt-5 $1.25 $10.00
Anthropic
Anthropic Claude 4.1 Opus anthropic/claude-opus-4.1 $15.00 $75.00
qwen
qwen Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking $0.15 $1.50
Anthropic
Anthropic Claude 4 Sonnet anthropic/claude-sonnet-4 $3.00 $15.00
qwen
qwen Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct $0.15 $1.50
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 $0.57 $2.29
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 $0.57 $2.29
qwen
qwen Qwen3 30B A3B Qwen/Qwen3-30B-A3B $0.08 $0.25
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 $0.27 $1.00
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-ai/DeepSeek-V3-0324 $0.29 $1.14
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.25 $0.75
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B $0.50 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B $0.20 $0.20
OpenAI
OpenAI GPT-4o openai/gpt-4o $2.50 $10.00
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B $0.10 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/DeepSeek-R1-Distill-Llama-8B $0.14 $0.39
OpenAI
OpenAI GPT-4o mini openai/gpt-4o-mini $0.15 $0.60
Meta
Meta Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct $0.25 $0.75
qwen
qwen Qwen: Qwen3.6 Max Preview Qwen/Qwen3.6-Max-Preview $1.30 $7.80
Tencent
Tencent Hy3-preview tencent/hy3-preview $0.18 $0.60
OpenAI
OpenAI GPT-5.4 Pro openai/gpt-5.4-pro $30.00 $180.00
ByteDance Seed
ByteDance Seed Seed-2.0-Mini bytedance/seed-2.0-mini $0.10 $0.40
OpenAI
OpenAI GPT-5.3 Codex openai/gpt-5.3-codex $1.75 $14.00
OpenAI
OpenAI GPT-5.2 Codex openai/gpt-5.2-codex $1.75 $14.00
Azure
Azure GPT-5.2 Chat openai/gpt-5.2-chat $1.75 $14.00
Azure
Azure GPT-5.1 Chat openai/gpt-5.1-chat $1.25 $10.00
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/Kimi-K2-Thinking $0.80 $1.20
DeepSeek
DeepSeek DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus $0.27 $1.00
DeepSeek
DeepSeek DeepSeek R1 deepseek-ai/DeepSeek-R1 $0.50 $2.18
WandB
WandB GLM 5 zai-org/GLM-5-FP8 $0.60 $1.92
Z.ai
Z.ai GLM-4.7 zai-org/GLM-4.7-FP8 $0.60 $2.20
Ambient GLM-5.1 zai-org/GLM-5.1-FP8 $0.98 $3.08
Azure
Azure Llama 4 Maverick 17B 128E Instruct FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.25 $0.80
Meta
Meta llama-4-scout-17b-16e-instruct meta-llama/Llama-4-Scout-17B-16E-Instruct $0.08 $0.50