Qwen: Qwen3 8B

Alibaba Cloud / Qwen Team qwen3-8b

Model Information
Slug qwen3-8b
Aliases qwen3-8b
Organization
Name Alibaba Cloud / Qwen Team
Description

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling. Context: 128000

Available at 7 Providers
Provider Input Price ($/1M) Output Price ($/1M) Free
OpenRouter $0.04 $0.14
SiliconFlow (China) $0.06 $0.06
SiliconFlow $0.06 $0.06
Alibaba (China) $0.07 $0.29
AIHubMix $0.08 $0.80
Alibaba $0.18 $0.70
Fireworks AI $0.20 $0.20