Nebius Token Factory
nebius
Updated 28 minutes ago
Nebius is a cloud platform that provides access to AI models through their TokenFactory inference service. The platform offers a wide range of open-source and proprietary models including DeepSeek, MiniMax, Kimi, Qwen, and others. Nebius focuses on providing fast, cost-effective AI inference with competitive pricing per 1M tokens and various quantization options (fp4, fp8) to optimize performance and cost.
Browse 22 LLM models available from Nebius Token Factory. Compare prices and features.
Models (22)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
DeepSeek | DeepSeek V4 Pro |
deepseek-ai/DeepSeek-V4-Pro
|
$1.75 | $3.50 |
|
||
|
|
Moonshot AI | Kimi K2.6 |
moonshotai/Kimi-K2.6
|
$0.95 | $4.00 |
|
||
|
|
Z.ai | GLM-5.1 |
zai-org/GLM-5.1
|
$1.40 | $4.40 |
|
||
|
|
Nvidia | Nemotron 3 Super (120B A12B) |
nvidia/nemotron-3-super-120b-a12b
|
$0.30 | $0.90 |
|
||
|
|
Minimax | MiniMax M2.5 |
MiniMaxAI/MiniMax-M2.5
|
$0.30 | $1.20 | |||
|
|
qwen | Qwen3.5-397B-A17B |
Qwen/Qwen3.5-397B-A17B
|
$0.60 | $3.60 | |||
|
|
Z.ai | GLM-5 |
zai-org/GLM-5
|
$1.00 | $3.20 | |||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
Qwen/Qwen3-Next-80B-A3B-Thinking
|
$0.15 | $1.20 | |||
|
|
OpenAI | GPT OSS 120B |
openai/gpt-oss-120b
|
$0.15 | $0.60 |
|
||
|
|
qwen | Qwen3 32B |
Qwen/Qwen3-32B
|
$0.10 | $0.30 | |||
|
|
qwen | Qwen3-235B-A22B-Instruct-2507 |
Qwen/Qwen3-235B-A22B-Instruct-2507
|
$0.20 | $0.60 | |||
|
|
Nvidia | Llama 3.1 Nemotron Ultra 253B v1 |
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
|
$0.60 | $1.80 | |||
|
|
Gemma 3 27B |
google/gemma-3-27b-it
|
$0.10 | $0.30 | ||||
|
|
Meta | Llama 3.3 70B Instruct |
meta-llama/Llama-3.3-70B-Instruct
|
$0.13 | $0.40 | |||
|
|
qwen | Qwen3 Embedding 8B |
Qwen/Qwen3-Embedding-8B
|
$0.01 | $0.00 | |||
|
|
Nebius | Hermes 4 70B |
NousResearch/Hermes-4-70B
|
$0.13 | $0.40 | |||
|
|
Nebius | Hermes-4 405B |
NousResearch/Hermes-4-405B
|
$1.00 | $3.00 | |||
|
|
Alibaba | qwen3-30b-a3b-instruct-2507 |
Qwen/Qwen3-30B-A3B-Instruct-2507
|
$0.10 | $0.30 | |||
|
|
Alibaba | qwen2.5-vl-72b-instruct |
Qwen/Qwen2.5-VL-72B-Instruct
|
$0.25 | $0.75 | |||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-ai/DeepSeek-V3.2
|
$0.30 | $0.45 | |||
|
|
DigitalOcean | Nemotron Nano 3 Omni |
nvidia/Nemotron-3-Nano-Omni
|
$0.06 | $0.24 | |||
|
|
Nvidia | NVIDIA Nemotron 3 Nano |
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B
|
$0.06 | $0.24 |