Nvidia

nvidia

Updated 1 hour ago

NVIDIA NIM (NVIDIA Inference Microservices) is a platform that provides optimized AI model inference containers featuring industry-leading APIs for running AI models across NVIDIA's accelerated infrastructure. NIM supports models from major providers including Meta (Llama), Google (Gemma), Mistral, xAI (Grok), DeepSeek, Microsoft (Phi), Qwen, and NVIDIA's own Nemotron family. The platform offers standard APIs across multiple deployment options including cloud, on-premises, and local workstations, with microservices optimized for NVIDIA GPUs. NIM provides an OpenAI-compatible API endpoint at integrate.api.nvidia.com for easy integration, featuring over 180 models from various AI companies hosted on NVIDIA's inference infrastructure.

Visit Website LLMs.txt

Browse 49 LLM models available from Nvidia. Compare prices and features.

Models (49)

Organization	Model Name	Original Model	Input	Output	Free
Z.ai	GLM-5.2	`z-ai/glm-5.2`	$0.00	$0.00	Free	Model
Minimax	MiniMax M3	`minimaxai/minimax-m3`	$0.00	$0.00	Free	Model
Nvidia	Nemotron 3 Ultra (550B A55B)	`nvidia/nemotron-3-ultra-550b-a55b`	$0.50	$2.50		Model
StepFun	Step 3.7 Flash	`stepfun-ai/step-3.7-flash`	$0.00	$0.00	Free	Model
DeepSeek	DeepSeek V4 Pro	`deepseek-ai/deepseek-v4-pro`	$0.44	$0.87		Model
DeepSeek	DeepSeek V4 Flash	`deepseek-ai/deepseek-v4-flash`	$0.14	$0.28		Model
google	DiffusionGemma 26B-A4B	`google/diffusiongemma-26b-a4b-it`	-	-		Model
Minimax	MiniMax M2.7	`minimaxai/minimax-m2.7`	$0.00	$0.00	Free	Model
google	Gemma 4 31B	`google/gemma-4-31b-it`	$0.00	$0.00	Free	Model
Nvidia	Nemotron 3 Super (120B A12B)	`nvidia/nemotron-3-super-120b-a12b`	$0.20	$0.80		Model
qwen	Qwen3.5-122B-A10B	`qwen/qwen3.5-122b-a10b`	$0.00	$0.00	Free	Model
qwen	Qwen3.5-397B-A17B	`qwen/qwen3.5-397b-a17b`	$0.00	$0.00	Free	Model
StepFun	Step-3.5-Flash	`stepfun-ai/step-3.5-flash`	$0.00	$0.00	Free	Model
Nvidia	Nemotron 3 Nano (30B A3B)	`nvidia/nemotron-3-nano-30b-a3b`	$0.00	$0.00	Free	Model
qwen	Qwen3-Next-80B-A3B-Instruct	`qwen/qwen3-next-80b-a3b-instruct`	$0.00	$0.00	Free	Model
OpenAI	GPT OSS 120B	`openai/gpt-oss-120b`	$0.00	$0.00	Free	Model
Nvidia	NVIDIA Nemotron Nano 9B V2	`nvidia/nvidia-nemotron-nano-9b-v2`	$0.00	$0.00	Free	Model
OpenAI	GPT OSS 20B	`openai/gpt-oss-20b`	$0.00	$0.00	Free	Model
Nvidia	Llama-3.3 Nemotron Super 49B v1	`nvidia/llama-3.3-nemotron-super-49b-v1`	-	-		Model
google	Gemma 3n E2B Instructed	`google/gemma-3n-e2b-it`	$0.00	$0.00	Free	Model
google	Gemma 3n E4B Instructed	`google/gemma-3n-e4b-it`	$0.00	$0.00	Free	Model
Nvidia	Llama 3.1 Nemotron Nano 8B V1	`nvidia/llama-3.1-nemotron-nano-8b-v1`	-	-		Model
Meta	Llama 3.3 70B Instruct	`meta/llama-3.3-70b-instruct`	$0.00	$0.00	Free	Model
Meta	Llama 3.2 3B Instruct	`meta/llama-3.2-3b-instruct`	$0.00	$0.00	Free	Model
Meta	Llama 3.1 70B Instruct	`meta/llama-3.1-70b-instruct`	$0.00	$0.00	Free	Model
Meta	Llama 3.1 8B Instruct	`meta/llama-3.1-8b-instruct`	$0.00	$0.00	Free	Model
Meta	llama-4-maverick-17b-128e-instruct	`meta/llama-4-maverick-17b-128e-instruct`	$0.00	$0.00	Free	Model
Nvidia	Whisper Large v3	`openai/whisper-large-v3`	$0.00	$0.00	Free	Model
Black Forest Labs	flux-2-klein-4b	`black-forest-labs/flux.2-klein-4b`	$0.00	$0.00	Free	Model
Nvidia	Nemotron Nano 12B 2 VL (free)	`nvidia/nemotron-nano-12b-v2-vl`	-	-		Model
Nvidia	Phi-4-Mini	`microsoft/phi-4-mini-instruct`	$0.00	$0.00	Free	Model
Nvidia	Llama 3.3 Nemotron Super 49b V1.5	`nvidia/llama-3.3-nemotron-super-49b-v1.5`	-	-		Model
SiliconFlow	Seed-OSS-36B-Instruct	`bytedance/seed-oss-36b-instruct`	$0.00	$0.00	Free	Model
Groq	Llama Guard 4 12B	`meta/llama-guard-4-12b`	$0.00	$0.00	Free	Model
microsoft	Phi-4-multimodal-instruct	`microsoft/phi-4-multimodal-instruct`	$0.00	$0.00	Free	Model
Nvidia	Llama 3.2 11b Vision Instruct	`meta/llama-3.2-11b-vision-instruct`	$0.00	$0.00	Free	Model
Meta	llama-3.2-1b-instruct	`meta/llama-3.2-1b-instruct`	$0.00	$0.00	Free	Model
Azure	Llama-3.2-90B-Vision-Instruct	`meta/llama-3.2-90b-vision-instruct`	$0.00	$0.00	Free	Model
Black Forest Labs	flux-1-kontext-dev	`black-forest-labs/FLUX.1-Kontext-dev`	$0.00	$0.00	Free	Model
Nvidia	FLUX.1-dev	`black-forest-labs/FLUX.1-dev`	$0.00	$0.00	Free	Model
Black Forest Labs	FLUX.1-schnell	`black-forest-labs/FLUX.1-schnell`	$0.00	$0.00	Free	Model
google	gemma-2-2b-it	`google/gemma-2-2b-it`	$0.00	$0.00	Free	Model
Nvidia	Ministral 3 14B Instruct 2512	`mistralai/ministral-14b-instruct-2512`	-	-		Model
Nvidia	Mistral Large 3 675B Instruct 2512	`mistralai/mistral-large-3-675b-instruct-2512`	$0.00	$0.00	Free	Model
Mistral	mixtral-8x7b-instruct-v0.1	`mistralai/mixtral-8x7b-instruct-v0.1`	-	-		Model
Nvidia	Nemotron 3 Nano Omni 30B A3B Reasoning	`nvidia/nemotron-3-nano-omni-30b-a3b-reasoning`	$0.00	$0.00	Free	Model
Nvidia	Parakeet TDT 0.6B v2	`nvidia/parakeet-tdt-0.6b-v2`	-	-		Model
qwen	Qwen Image	`qwen/qwen-image`	$0.00	$0.00	Free	Model
qwen	Qwen Image Edit	`qwen/qwen-image-edit`	$0.00	$0.00	Free	Model

Back to Providers Visit Website