Alibaba

Qwen3-ASR Flash

Alibaba qwen3-asr-flash
Model Information
Slug qwen3-asr-flash
LLMs.txt View
Release Date May 14, 2026
Organization
Model Description
Qwen3-ASR-Flash is Alibaba's automatic speech recognition service, built on the Qwen3-Omni foundation and trained on tens of millions of hours of multimodal speech data. The model handles 11 languages — including Chinese (with Cantonese, Sichuanese, Minnan, and Wu dialects), English, Arabic, French, German, Spanish, Italian, Portuguese, Russian, Japanese, and Korean — with automatic language detection so no manual configuration is needed for mixed-language audio.

The model is designed for difficult acoustic conditions: it transcribes lyrics over background music, handles noisy and far-field recordings, filters silence and non-speech audio, and accepts arbitrary context text (names, jargon, domain terminology) to bias recognition toward specific vocabulary.
Available at 3 Providers
Provider Type Model Name Original Model Input ($/1M) Output ($/1M) Free Actions
Alibaba (China)
Alibaba (China)
Qwen3-ASR Flash
qwen3-asr-flash $0.03 $0.03
Alibaba
Alibaba
Qwen3-ASR Flash
qwen3-asr-flash $0.04 $0.04
OpenRouter
OpenRouter
Chat Code
Qwen3 ASR Flash
qwen/qwen3-asr-flash-2026-02-10 $35.00 $0.00