GMI Cloud icon

GMI Cloud

gmi

Updated 47 minutes ago

GMI Cloud is an AI model hosting platform that provides access to leading large language models including Kimi K2.5, Claude (Haiku 4.5, Opus 4.1, Sonnet 4, 3.7 Sonnet), GPT-5.1, Gemini 2.5, Grok 2, and DeepSeek models. The platform offers serverless deployment with transparent pricing per 1M tokens, GPU hardware options (H200), and model metadata including context lengths, quantization (int4, fp8), and provider information. GMI Cloud features an OpenAI-compatible API at api.gmi-serving.com for easy integration.

Browse 69 LLM models available from GMI Cloud. Compare prices and features.

Models (69)

Organization Model Name Original Model Input Output Free
OpenAI
OpenAI GPT-5.5 openai/gpt-5.5 $3.50 $21.00
DeepSeek
DeepSeek DeepSeek-V4-Pro-Max deepseek-ai/DeepSeek-V4-Pro $1.39 $2.78
DeepSeek
DeepSeek DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash $0.11 $0.22
Anthropic
Anthropic Claude Opus 4.7 anthropic/claude-opus-4.7 $4.50 $22.50
Moonshot AI
Moonshot AI Kimi K2.6 moonshotai/Kimi-K2.6 $0.86 $3.60
Minimax
Minimax MiniMax M2.7 MiniMaxAI/MiniMax-M2.7 $0.30 $1.20
qwen
qwen Qwen3.6 Plus Qwen/Qwen3.6-Plus $0.50 $3.00
qwen
qwen Qwen3.6 Plus Qwen/Qwen3.6-Plus-2026-04-02 $0.50 $3.00
OpenAI
OpenAI GPT-5.4 openai/gpt-5.4 $2.50 $15.00
Xiaomi
Xiaomi MiMo-V2.5-Pro XiaomiMiMo/MiMo-V2.5-Pro $0.80 $2.40
google
google Gemma 4 31B google/gemma-4-31b-it $0.14 $0.40
google
google Gemma 4 26B-A4B google/gemma-4-26b-a4b-it $0.13 $0.40
google
google Gemini 3.1 Pro google/gemini-3.1-pro-preview $2.00 $12.00
OpenAI
OpenAI GPT-5.4 mini openai/gpt-5.4-mini $0.75 $4.50
OpenAI
OpenAI GPT-5.4 nano openai/gpt-5.4-nano $0.20 $1.25
OpenAI
OpenAI GPT-5.4 Pro openai/gpt-5.4-pro $30.00 $180.00
Anthropic
Anthropic Claude Opus 4.6 anthropic/claude-opus-4.6 $5.00 $25.00
qwen
qwen Qwen3.5-27B Qwen/Qwen3.5-27B $0.30 $2.40
Anthropic
Anthropic Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 $3.00 $15.00
qwen
qwen Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B $0.40 $3.20
Minimax
Minimax MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 $0.30 $1.20
qwen
qwen Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B $0.25 $2.00
google
google Gemini 3.1 Flash-Lite google/gemini-3.1-flash-lite-preview $0.25 $1.50
qwen
qwen Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B $0.60 $3.60
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/Kimi-K2.5 $0.60 $3.00
OpenAI
OpenAI GPT-5.2 openai/gpt-5.2 $1.75 $14.00
OpenAI
OpenAI GPT-5.3 Codex openai/gpt-5.3-codex $1.75 $14.00
Minimax
Minimax MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 $0.30 $1.20
DeepSeek
DeepSeek DeepSeek-V3.2-Speciale deepseek-ai/DeepSeek-V3.2-Speciale $0.28 $0.40
Anthropic
Anthropic Claude Opus 4.5 anthropic/claude-opus-4.5 $5.00 $25.00
OpenAI
OpenAI GPT-5.1 Thinking openai/gpt-5.1 $1.25 $10.00
Z.ai
Z.ai GLM-4.7-Flash zai-org/GLM-4.7-Flash $0.07 $0.40
Anthropic
Anthropic Claude 4.5 Haiku anthropic/claude-haiku-4.5 $1.00 $5.00
Anthropic
Anthropic Claude 4.5 Sonnet anthropic/claude-sonnet-4.5 $3.00 $15.00
Minimax
Minimax MiniMax M2 MiniMaxAI/MiniMax-M2 $0.30 $1.20
Z.ai
Z.ai GLM-4.6 zai-org/GLM-4.6 $0.60 $2.00
DeepSeek
DeepSeek DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.2-Exp $0.27 $0.41
OpenAI
OpenAI GPT-5 openai/gpt-5 $1.25 $10.00
qwen
qwen Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking $0.15 $1.50
Anthropic
Anthropic Claude 4.1 Opus anthropic/claude-opus-4.1 $15.00 $75.00
Anthropic
Anthropic Claude 4 Sonnet anthropic/claude-sonnet-4 $3.00 $15.00
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 $0.57 $2.29
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 $0.57 $2.29
qwen
qwen Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct $0.15 $1.50
qwen
qwen Qwen3 30B A3B Qwen/Qwen3-30B-A3B $0.08 $0.25
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 $0.27 $1.00
OpenAI
OpenAI GPT-4o mini openai/gpt-4o-mini $0.15 $0.60
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-ai/DeepSeek-V3-0324 $0.29 $1.14
OpenAI
OpenAI GPT-4o openai/gpt-4o $2.50 $10.00
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.25 $0.75
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B $0.50 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B $0.20 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B $0.10 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/DeepSeek-R1-Distill-Llama-8B $0.14 $0.39
Meta
Meta Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct $0.25 $0.75
qwen
qwen Qwen: Qwen3.6 Max Preview Qwen/Qwen3.6-Max-Preview $1.30 $7.80
Xiaomi
Xiaomi MiMo-V2.5 XiaomiMiMo/MiMo-V2.5 $0.32 $1.60
ByteDance Seed
ByteDance Seed Seed-2.0-Mini bytedance/seed-2.0-mini $0.10 $0.40
OpenAI
OpenAI GPT-5.2 Codex openai/gpt-5.2-codex $1.75 $14.00
Azure
Azure GPT-5.2 Chat openai/gpt-5.2-chat $1.75 $14.00
Azure
Azure GPT-5.1 Chat openai/gpt-5.1-chat $1.25 $10.00
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/Kimi-K2-Thinking $0.80 $1.20
DeepSeek
DeepSeek deepseek-v3.1-terminus deepseek-ai/DeepSeek-V3.1-Terminus $0.27 $1.00
DeepSeek
DeepSeek DeepSeek R1 deepseek-ai/DeepSeek-R1 $0.50 $2.18
DeepSeek
DeepSeek deepseek-v3.2 deepseek-ai/DeepSeek-V3.2 $0.29 $0.43
WandB
WandB GLM 5 zai-org/GLM-5-FP8 $0.65 $2.08
Z.ai
Z.ai GLM-4.7 zai-org/GLM-4.7-FP8 $0.60 $2.20
Azure
Azure Llama 4 Maverick 17B 128E Instruct FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.25 $0.80
Meta
Meta llama-4-scout-17b-16e-instruct meta-llama/Llama-4-Scout-17B-16E-Instruct $0.08 $0.50