Qwen3-ASR Flash

Alibaba • qwen3-asr-flash

Model Information

Slug	qwen3-asr-flash
LLMs.txt	View
Release Date	May 14, 2026

Organization

Name	Alibaba

Model Description

Qwen3-ASR-Flash is Alibaba's automatic speech recognition service, built on the Qwen3-Omni foundation and trained on tens of millions of hours of multimodal speech data. The model handles 11 languages — including Chinese (with Cantonese, Sichuanese, Minnan, and Wu dialects), English, Arabic, French, German, Spanish, Italian, Portuguese, Russian, Japanese, and Korean — with automatic language detection so no manual configuration is needed for mixed-language audio.

The model is designed for difficult acoustic conditions: it transcribes lyrics over background music, handles noisy and far-field recordings, filters silence and non-speech audio, and accepts arbitrary context text (names, jargon, domain terminology) to bias recognition toward specific vocabulary.

Available at 4 Providers

Provider	Type	Model Name	Original Model	Input ($/1M)	Output ($/1M)	Actions
ZenMUX	Chat Code	Qwen: Qwen3 ASR Flash	`qwen/qwen3-asr-flash`	$0.00	$0.00	Visit
Alibaba (China)		Qwen3-ASR Flash	`qwen3-asr-flash`	$0.03	$0.03	Visit
Alibaba		Qwen3-ASR Flash	`qwen3-asr-flash`	$0.04	$0.04	Visit
OpenRouter	Chat Code	Qwen3 ASR Flash	`qwen/qwen3-asr-flash-2026-02-10`	$35.00	$0.00	Model

Back to Models View Organization