Qwen3 235B A22B

Alibaba Cloud / Qwen Team qwen3-235b-a22b

Model Information
Slug qwen3-235b-a22b
Release Date April 29, 2025
Aliases qwen3-235b-a22b
Organization
Name Alibaba Cloud / Qwen Team
Description

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling. Context: 40960

Available at 9 Providers
Provider Input Price ($/1M) Output Price ($/1M) Free
Nvidia $0.00 $0.00
OpenRouter $0.18 $0.54
Fireworks AI $0.22 $0.88
AIHubMix $0.28 $1.12
Alibaba (China) $0.29 $1.15
Chutes $0.30 $1.20
SiliconFlow (China) $0.35 $1.42
SiliconFlow $0.35 $1.42
Alibaba $0.70 $2.80