# MiMo-V2-Omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities. 256K context window.

## Model Information

- **Organization**: [Xiaomi](/llm.txt)
- **Slug**: mimo-v2-omni
- **Available at Providers**: 3
- **Release Date**: March 18, 2026

## Providers

| Provider | Name | $ Input (per 1M) | $ Output (per 1M) | Free | Link |
|----------|------|-----------------|------------------|------|------|
| [Kilo Code](/llm/kilocode.txt) | Xiaomi: MiMo-V2-Omni | 0.40 | 2.00 |  | [View](https://kilo.ai/models/xiaomi/mimo-v2-omni) |
| [OpenRouter](/llm/openrouter.txt) | MiMo-V2-Omni | 0.40 | 2.00 |  | [View](https://openrouter.ai/xiaomi/mimo-v2-omni-20260318) |
| [OpenCode Zen](/llm/opencode.txt) | mimo-v2-omni | 0.00 | 0.00 | Yes |  |

---

[← Back to all providers](/llm.txt)