Nebius Token Factory

nebius

Updated 1 hour ago

Nebius is a cloud platform that provides access to AI models through their TokenFactory inference service. The platform offers a wide range of open-source and proprietary models including DeepSeek, MiniMax, Kimi, Qwen, and others. Nebius focuses on providing fast, cost-effective AI inference with competitive pricing per 1M tokens and various quantization options (fp4, fp8) to optimize performance and cost.

Visit Website LLMs.txt

Browse 23 LLM models available from Nebius Token Factory. Compare prices and features.

Models (23)

Organization	Model Name	Original Model	Input	Output
Z.ai	GLM-5.2	`zai-org/GLM-5.2`	$1.40	$4.40	Model
Minimax	MiniMax M3	`MiniMaxAI/MiniMax-M3`	$0.30	$1.20	Model
Nvidia	Nemotron 3 Ultra (550B A55B)	`nvidia/Nemotron-3-Ultra-550b-a55b`	$1.00	$3.00	View
Moonshot AI	Kimi K2.7 Code	`moonshotai/Kimi-K2.7-Code`	$0.95	$4.00	Model
DeepSeek	DeepSeek V4 Pro	`deepseek-ai/DeepSeek-V4-Pro`	$1.75	$3.50	Model
Moonshot AI	Kimi K2.6	`moonshotai/Kimi-K2.6`	$0.95	$4.00	Model
Z.ai	GLM-5.1	`zai-org/GLM-5.1`	$1.40	$4.40	Model
Nvidia	Nemotron 3 Super (120B A12B)	`nvidia/nemotron-3-super-120b-a12b`	$0.30	$0.90	Model
qwen	Qwen3.5-397B-A17B	`Qwen/Qwen3.5-397B-A17B`	$0.60	$3.60	Model
Minimax	MiniMax M2.5	`MiniMaxAI/MiniMax-M2.5`	$0.30	$1.20	Model
qwen	Qwen3 Embedding 8B	`Qwen/Qwen3-Embedding-8B`	$0.01	$0.00	Model
qwen	Qwen3-Next-80B-A3B-Thinking	`Qwen/Qwen3-Next-80B-A3B-Thinking`	$0.15	$1.20	Model
OpenAI	GPT OSS 120B	`openai/gpt-oss-120b`	$0.15	$0.60	Model
qwen	Qwen3 32B	`Qwen/Qwen3-32B`	$0.10	$0.30	Model
qwen	Qwen3-235B-A22B-Instruct-2507	`Qwen/Qwen3-235B-A22B-Instruct-2507`	$0.20	$0.60	Model
Nvidia	Llama 3.1 Nemotron Ultra 253B v1	`nvidia/Llama-3_1-Nemotron-Ultra-253B-v1`	$0.60	$1.80	Model
google	Gemma 3 27B	`google/gemma-3-27b-it`	$0.10	$0.30	Model
Meta	Llama 3.3 70B Instruct	`meta-llama/Llama-3.3-70B-Instruct`	$0.13	$0.40	Model
Nebius	Hermes 4 405B	`NousResearch/Hermes-4-405B`	$1.00	$3.00	Model
Nebius	Hermes 4 70B	`NousResearch/Hermes-4-70B`	$0.13	$0.40	Model
Alibaba	qwen3-30b-a3b-instruct-2507	`Qwen/Qwen3-30B-A3B-Instruct-2507`	$0.10	$0.30	Model
Alibaba	qwen2.5-vl-72b-instruct	`Qwen/Qwen2.5-VL-72B-Instruct`	$0.25	$0.75	Model
Nvidia	NVIDIA Nemotron 3 Nano	`nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B`	$0.06	$0.24	Model

Back to Providers Visit Website