Weights & Biases
wandb
Updated 11 minutes ago
Weights & Biases (W&B) is an AI/ML platform that provides model training tracking, experiment management, and inference services. Their W&B Inference API offers an OpenAI-compatible interface to access curated open-source language models including DeepSeek, Qwen, Meta Llama, Google Gemma, MiniMax, Moonshot AI (Kimi), NVIDIA Nemotron, Microsoft Phi, and others. The platform is known for its MLOps tooling and has expanded into hosted inference with competitive pricing.
Browse 43 LLM models available from Weights & Biases. Compare prices and features.
Models (43)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
DeepSeek | DeepSeek V4 Pro |
DeepSeek-V4-Pro
|
- | - |
|
||
|
|
DeepSeek | DeepSeek V4 Flash |
DeepSeek-V4-Flash
|
- | - |
|
||
|
|
Moonshot AI | Kimi K2.6 |
Kimi-K2.6
|
- | - |
|
||
|
|
Alibaba | Qwen3.6 27B |
Qwen3.6-27B
|
- | - | |||
|
|
Z.ai | GLM-5.1 |
zai-org/GLM-5.1
|
$1.40 | $4.40 |
|
||
|
|
Z.ai | GLM-5.1 |
GLM-5.1
|
- | - |
|
||
|
|
qwen | Qwen3.6 35B A3B |
Qwen3.6-35B-A3B
|
- | - | |||
|
|
Minimax | MiniMax M2.5 |
MiniMaxAI/MiniMax-M2.5
|
$0.30 | $1.20 | |||
|
|
Minimax | MiniMax M2.5 |
MiniMax-M2.5
|
- | - | |||
|
|
qwen | Qwen3.5-27B |
Qwen3.5-27B
|
- | - | |||
|
|
Gemma 4 31B |
gemma-4-31B-it
|
- | - |
|
|||
|
|
qwen | Qwen3.5-35B-A3B |
Qwen3.5-35B-A3B
|
- | - | |||
|
|
Moonshot AI | Kimi K2.5 |
moonshotai/Kimi-K2.5
|
$0.50 | $2.85 |
|
||
|
|
Moonshot AI | Kimi K2.5 |
Kimi-K2.5
|
- | - |
|
||
|
|
OpenAI | GPT OSS 120B |
openai/gpt-oss-120b
|
$0.15 | $0.60 |
|
||
|
|
OpenAI | GPT OSS 120B |
gpt-oss-120b
|
- | - |
|
||
|
|
OpenAI | GPT OSS 20B |
openai/gpt-oss-20b
|
$0.05 | $0.20 | |||
|
|
OpenAI | GPT OSS 20B |
gpt-oss-20b
|
- | - | |||
|
|
qwen | Qwen3-235B-A22B-Thinking-2507 |
Qwen/Qwen3-235B-A22B-Thinking-2507
|
$0.10 | $0.10 | |||
|
|
qwen | Qwen3-235B-A22B-Thinking-2507 |
Qwen3-235B-A22B-Thinking-2507
|
- | - | |||
|
|
qwen | Qwen3-235B-A22B-Instruct-2507 |
Qwen/Qwen3-235B-A22B-Instruct-2507
|
$0.10 | $0.10 | |||
|
|
qwen | Qwen3-235B-A22B-Instruct-2507 |
Qwen3-235B-A22B-Instruct-2507
|
- | - | |||
|
|
qwen | Qwen3-Coder 480B A35B Instruct |
Qwen/Qwen3-Coder-480B-A35B-Instruct
|
$1.00 | $1.50 | |||
|
|
qwen | Qwen3-Coder 480B A35B Instruct |
Qwen3-Coder-480B-A35B-Instruct
|
- | - | |||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-ai/DeepSeek-V3.1
|
$0.55 | $1.65 | |||
|
|
DeepSeek | DeepSeek-V3.1 |
DeepSeek-V3.1
|
- | - | |||
|
|
Meta | Llama 3.1 70B Instruct |
meta-llama/Llama-3.1-70B-Instruct
|
$0.80 | $0.80 | |||
|
|
Meta | Llama 3.1 70B Instruct |
Llama-3.1-70B-Instruct
|
- | - | |||
|
|
Meta | Llama 3.3 70B Instruct |
meta-llama/Llama-3.3-70B-Instruct
|
$0.71 | $0.71 | |||
|
|
Meta | Llama 3.3 70B Instruct |
Llama-3.3-70B-Instruct
|
- | - | |||
|
|
Meta | Llama 3.1 8B Instruct |
meta-llama/Llama-3.1-8B-Instruct
|
$0.22 | $0.22 | |||
|
|
Meta | Llama 3.1 8B Instruct |
Llama-3.1-8B-Instruct
|
- | - | |||
|
|
Nvidia | Phi-4-Mini |
microsoft/Phi-4-mini-instruct
|
$0.08 | $0.35 | |||
|
|
Nvidia | Phi-4-Mini |
Phi-4-mini-instruct
|
- | - | |||
|
|
Alibaba | qwen3-30b-a3b-instruct-2507 |
Qwen/Qwen3-30B-A3B-Instruct-2507
|
$0.10 | $0.30 | |||
|
|
Alibaba | qwen3-30b-a3b-instruct-2507 |
Qwen3-30B-A3B-Instruct-2507
|
- | - | |||
|
|
WandB | GLM 5 |
zai-org/GLM-5-FP8
|
$1.00 | $3.20 | |||
|
|
IBM | Granite 4.1 8B |
granite-4.1-8b
|
- | - | |||
|
|
Meta | llama-4-scout-17b-16e-instruct |
meta-llama/Llama-4-Scout-17B-16E-Instruct
|
$0.17 | $0.66 | |||
|
|
WandB | NVIDIA Nemotron 3 Super 120B |
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
|
$0.20 | $0.80 | |||
|
|
WandB | NVIDIA Nemotron 3 Super 120B |
NVIDIA-Nemotron-3-Super-120B-A12B-FP8
|
- | - | |||
|
|
WandB | OpenPipe Qwen3 14B Instruct |
OpenPipe/Qwen3-14B-Instruct
|
$0.05 | $0.22 | |||
|
|
WandB | OpenPipe Qwen3 14B Instruct |
Qwen3-14B-Instruct
|
- | - |