Ollama Cloud
ollama
Updated 1 hour ago
Ollama is a platform that enables users to run large language models locally on their own hardware, providing complete data privacy and security without requiring cloud services or API keys. It offers an OpenAI-compatible RESTful API for easy integration and supports a wide variety of open-source models including Llama, Olmo, and many others. Ollama enables multimodal support for text chat, PDF integration with RAG (Retrieval Augmented Generation), voice chat, and image-based interactions. The platform eliminates API costs, works offline after initial model download, and is ideal for privacy-sensitive work and local prototyping.
Browse 17 LLM models available from Ollama Cloud. Compare prices and features.
Models (17)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
Gemini 3 Flash |
gemini-3-flash-preview
|
- | - |
|
|||
|
|
Moonshot AI | Kimi K2.5 |
kimi-k2.5
|
- | - |
|
||
|
|
Z.ai | GLM-4.7 |
glm-4.7
|
- | - |
|
||
|
|
Z.ai | GLM-4.6 |
glm-4.6
|
- | - | |||
|
|
Minimax | MiniMax M2.1 |
minimax-m2.1
|
- | - | |||
|
|
Minimax | MiniMax M2 |
minimax-m2
|
- | - | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-v3.1:671b
|
- | - | |||
|
|
Moonshot AI | Kimi K2 0711 (free) |
kimi-k2:1t
|
- | - | |||
|
|
Moonshot AI | Kimi K2 Thinking |
kimi-k2-thinking
|
- | - | |||
|
|
qwen | Qwen3-Coder |
qwen3-coder:480b
|
- | - | |||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-v3.2
|
- | - | |||
|
|
Mistral | mistral-large-3 |
mistral-large-3:675b
|
- | - | |||
|
|
Mistral | devstral-2 |
devstral-2:123b
|
- | - | |||
|
|
qwen | Qwen3 Coder Next |
qwen3-coder-next
|
- | - | |||
|
|
Z.ai | GLM-5 |
glm-5
|
- | - |
|
||
|
|
Minimax | MiniMax M2.5 |
minimax-m2.5
|
- | - |
|
||
|
|
BaseTen | Nemotron 3 Super |
nemotron-3-super
|
- | - |