MAI-Voice-2

microsoft • mai-voice-2

Model Information

Slug	mai-voice-2
LLMs.txt	View
Release Date	June 2, 2026

Organization

Name	microsoft
Website	https://ai.azure.com/

Model Description

MAI-Voice-2 is a high-fidelity, expressive text-to-speech model from Microsoft, powered by Azure AI Speech. It synthesizes natural-sounding speech across 10+ languages with support for expressive SSML styles (cheerful, sad, excited, etc.) and speed control (0.5×–2×). Voice names follow the Azure locale format (e.g., en-US-Harper:MAI-Voice-2). Output is available in MP3 and PCM at 24 kHz.

Available at 2 Providers

	Provider	Type	Model Name	Original Model	Input ($/1M)	Output ($/1M)	Free	Actions
	OpenRouter	Chat Code	MAI-Voice-2	`microsoft/mai-voice-2`	$22.00	$0.00		Model
	AIMLAPI		MAI-Voice 2	`microsoft/mai-voice-2`	$28.60	$4.00		Visit

Back to Models View Organization