# MiMo-V2-Omni MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities. 256K context window. ## Model Information - **Organization**: [Xiaomi](/llm.txt) - **Slug**: mimo-v2-omni - **Available at Providers**: 3 - **Release Date**: March 18, 2026 ## Providers | Provider | Name | $ Input (per 1M) | $ Output (per 1M) | Free | Link | |----------|------|-----------------|------------------|------|------| | [Kilo Code](/llm/kilocode.txt) | Xiaomi: MiMo-V2-Omni | 0.40 | 2.00 | | [View](https://kilo.ai/models/xiaomi/mimo-v2-omni) | | [OpenRouter](/llm/openrouter.txt) | MiMo-V2-Omni | 0.40 | 2.00 | | [View](https://openrouter.ai/xiaomi/mimo-v2-omni-20260318) | | [OpenCode Zen](/llm/opencode.txt) | mimo-v2-omni | 0.00 | 0.00 | Yes | | --- [← Back to all providers](/llm.txt)