GMI Cloud icon

GMI Cloud

gmi

Updated 1 hour ago

GMI Cloud is an AI model hosting platform that provides access to leading large language models including Kimi K2.5, Claude (Haiku 4.5, Opus 4.1, Sonnet 4, 3.7 Sonnet), GPT-5.1, Gemini 2.5, Grok 2, and DeepSeek models. The platform offers serverless deployment with transparent pricing per 1M tokens, GPU hardware options (H200), and model metadata including context lengths, quantization (int4, fp8), and provider information. GMI Cloud features an OpenAI-compatible API at api.gmi-serving.com for easy integration.

Browse 55 LLM models available from GMI Cloud. Compare prices and features.

Models (55)

Organization Model Name Original Model Input Output Free
google
google Gemini 3.1 Pro google/gemini-3.1-pro-preview $2.00 $12.00
OpenAI
OpenAI GPT-5.4 openai/gpt-5.4 $2.50 $15.00
OpenAI
OpenAI GPT-5.2 openai/gpt-5.2 $1.75 $14.00
Anthropic
Anthropic Claude Opus 4.6 anthropic/claude-opus-4.6 $5.00 $25.00
Anthropic
Anthropic Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 $3.00 $15.00
qwen
qwen Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B $0.60 $3.60
OpenAI
OpenAI GPT-5.1 Thinking openai/gpt-5.1 $1.25 $10.00
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/Kimi-K2.5 $0.60 $3.00
Anthropic
Anthropic Claude Opus 4.5 anthropic/claude-opus-4.5 $5.00 $25.00
google
google Gemini 3.1 Flash-Lite google/gemini-3.1-flash-lite-preview $0.25 $1.50
qwen
qwen Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B $0.40 $3.20
OpenAI
OpenAI GPT-5 openai/gpt-5 $1.25 $10.00
qwen
qwen Qwen3.5-27B Qwen/Qwen3.5-27B $0.30 $2.40
qwen
qwen Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B $0.25 $2.00
Anthropic
Anthropic Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 $3.00 $15.00
Minimax
Minimax MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 $0.30 $1.20
Z.ai
Z.ai GLM-4.6 zai-org/GLM-4.6 $0.60 $2.00
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 $0.40 $1.80
Anthropic
Anthropic Claude Opus 4.1 anthropic/claude-opus-4.1 $15.00 $75.00
DeepSeek
DeepSeek DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.2-Exp $0.27 $0.41
Minimax
Minimax MiniMax M2 MiniMaxAI/MiniMax-M2 $0.30 $1.20
qwen
qwen Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking $0.15 $1.50
Anthropic
Anthropic Claude Sonnet 4 anthropic/claude-sonnet-4 $3.00 $15.00
Z.ai
Z.ai GLM-4.7-Flash zai-org/GLM-4.7-Flash $0.07 $0.40
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 $0.30 $1.70
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 $0.27 $1.00
Anthropic
Anthropic Claude Haiku 4.5 anthropic/claude-haiku-4.5 $1.00 $5.00
qwen
qwen Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct $0.15 $1.50
OpenAI
OpenAI GPT-4o openai/gpt-4o $2.50 $10.00
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-ai/DeepSeek-V3-0324 $0.18 $0.60
qwen
qwen Qwen3 30B A3B Qwen/Qwen3-30B-A3B $0.08 $0.25
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.25 $0.75
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B $0.50 $0.90
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B $0.20 $0.20
Meta
Meta Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct $0.25 $0.75
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B $0.10 $0.20
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/DeepSeek-R1-Distill-Llama-8B $0.14 $0.39
OpenAI
OpenAI GPT-4o mini openai/gpt-4o-mini $0.15 $0.60
Azure
Azure GPT-5.1 Chat openai/gpt-5.1-chat $1.25 $10.00
Azure
Azure GPT-5.2 Chat openai/gpt-5.2-chat $1.75 $14.00
DeepSeek
DeepSeek DeepSeek-V3.2-Speciale deepseek-ai/DeepSeek-V3.2-Speciale $0.28 $0.40
DeepSeek
DeepSeek deepseek-v3.2 deepseek-ai/DeepSeek-V3.2 $0.20 $0.32
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/Kimi-K2-Thinking $0.80 $1.20
DeepSeek
DeepSeek DeepSeek-R1 deepseek-ai/DeepSeek-R1 $0.50 $2.18
DeepSeek
DeepSeek deepseek-v3.1-terminus deepseek-ai/DeepSeek-V3.1-Terminus $0.27 $1.00
Azure
Azure Llama 4 Maverick 17B 128E Instruct FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.25 $0.80
Meta
Meta llama-4-scout-17b-16e-instruct meta-llama/Llama-4-Scout-17B-16E-Instruct $0.08 $0.50
Z.ai
Z.ai GLM-4.7 zai-org/GLM-4.7-FP8 $0.33 $1.50
Minimax
Minimax MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 $0.30 $1.20
OpenAI
OpenAI GPT-5.2 Codex openai/gpt-5.2-codex $1.75 $14.00
OpenAI
OpenAI GPT-5.3 Codex openai/gpt-5.3-codex $1.75 $14.00
OpenAI
OpenAI GPT-5.4 Pro openai/gpt-5.4-pro $30.00 $180.00
WandB
WandB GLM 5 zai-org/GLM-5-FP8 $1.00 $3.20
Minimax
Minimax MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 $0.30 $1.20
ByteDance Seed
ByteDance Seed Seed-2.0-Mini bytedance/seed-2.0-mini $0.10 $0.40