Model Information
| Slug | qwen3-asr-flash |
|---|---|
| LLMs.txt | View |
| Release Date | May 14, 2026 |
Organization
| Name | Alibaba |
|---|
Model Description
Qwen3-ASR-Flash is Alibaba's automatic speech recognition service, built on the Qwen3-Omni foundation and trained on tens of millions of hours of multimodal speech data. The model handles 11 languages — including Chinese (with Cantonese, Sichuanese, Minnan, and Wu dialects), English, Arabic, French, German, Spanish, Italian, Portuguese, Russian, Japanese, and Korean — with automatic language detection so no manual configuration is needed for mixed-language audio.
The model is designed for difficult acoustic conditions: it transcribes lyrics over background music, handles noisy and far-field recordings, filters silence and non-speech audio, and accepts arbitrary context text (names, jargon, domain terminology) to bias recognition toward specific vocabulary.
The model is designed for difficult acoustic conditions: it transcribes lyrics over background music, handles noisy and far-field recordings, filters silence and non-speech audio, and accepts arbitrary context text (names, jargon, domain terminology) to bias recognition toward specific vocabulary.
Available at 3 Providers
| Provider | Type | Model Name | Original Model | Input ($/1M) | Output ($/1M) | Free | Actions | |
|---|---|---|---|---|---|---|---|---|
|
|
Alibaba (China) |
Qwen3-ASR Flash
|
qwen3-asr-flash
|
$0.03 | $0.03 | |||
|
|
Alibaba |
Qwen3-ASR Flash
|
qwen3-asr-flash
|
$0.04 | $0.04 | |||
|
|
OpenRouter |
Chat
Code
|
Qwen3 ASR Flash
|
qwen/qwen3-asr-flash-2026-02-10
|
$35.00 | $0.00 |