MAI-Voice-2

Microsoft mai-voice-2
Model Information
Slug mai-voice-2
LLMs.txt View
Release Date June 2, 2026 New
Organization
Model Description
MAI-Voice-2 is a high-fidelity, expressive text-to-speech model from Microsoft, powered by Azure AI Speech. It synthesizes natural-sounding speech across 10+ languages with support for expressive SSML styles (cheerful, sad, excited, etc.) and speed control (0.5×–2×). Voice names follow the Azure locale format (e.g., en-US-Harper:MAI-Voice-2). Output is available in MP3 and PCM at 24 kHz.
Available at 2 Providers
Provider Type Model Name Original Model Input ($/1M) Output ($/1M) Free Actions
OpenRouter
OpenRouter
Chat Code
MAI-Voice-2
microsoft/mai-voice-2 $22.00 $0.00
AIMLAPI
AIMLAPI
MAI-Voice 2
microsoft/mai-voice-2 $28.60 $0.00