google

Gemini 3.1 Flash TTS Preview

google gemini-3-1-flash-tts
Model Information
Slug gemini-3-1-flash-tts
LLM.txt View
Release Date April 24, 2026 New
Organization
Model Description
Gemini 3.1 Flash TTS Preview is a text-to-speech model from Google, and a substantial generational step up from Gemini 2.5 Flash TTS. It takes text input and produces audio output across 70+ languages — nearly 3× the language coverage of its predecessor.

The headline addition is a system of 200+ inline audio tags (e.g. `[whispers]`, `[laughs]`, `[excited]`) that let developers steer delivery, emotion, and pacing mid-sentence, alongside a "director's chair" workflow in Google AI Studio for defining per-character Audio Profiles and scene-level context. It supports up to two speakers with independent voice and style configuration per speaker, outputs PCM audio at 24 kHz / 16-bit mono, and automatically watermarks all output with SynthID. Context window is 32k tokens.
Available at 4 Providers
Provider Type Model Name Original Model Input ($/1M) Output ($/1M) Free Actions
Nous Research
Nous Research
Google: Gemini 3.1 Flash TTS Preview
google/gemini-3.1-flash-tts-preview $1.00 $20.00
OpenRouter
OpenRouter
Chat Code
Gemini 3.1 Flash TTS Preview
google/gemini-3.1-flash-tts-preview $1.00 $20.00
Poe
Poe
Gemini-3.1-Flash-TTS
gemini-3.1-flash-tts - -
Runware
Runware
Gemini 3.1 Flash TTS
google-gemini-3-1-flash-tts - -