Qwen3 32B

Alibaba Cloud / Qwen Team qwen3-32b

Model Information
Slug qwen3-32b
Release Date April 29, 2025
Aliases qwen3-32b
Organization
Name Alibaba Cloud / Qwen Team
Description

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling. Context: 40960

Available at 13 Providers
Provider Input Price ($/1M) Output Price ($/1M) Free
iFlow $0.00 $0.00
Chutes $0.08 $0.24
OpenRouter $0.08 $0.24
OVHcloud AI Endpoints $0.09 $0.25
Cortecs $0.10 $0.33
SiliconFlow (China) $0.14 $0.57
SiliconFlow $0.14 $0.57
Alibaba (China) $0.29 $1.15
Helicone $0.29 $0.59
AIHubMix $0.32 $3.20
Alibaba $0.70 $2.80
Fireworks AI $0.90 $0.90
Friendli - -