Nvidia icon

Nvidia

nvidia

Updated 17 minutes ago

NVIDIA NIM (NVIDIA Inference Microservices) is a platform that provides optimized AI model inference containers featuring industry-leading APIs for running AI models across NVIDIA's accelerated infrastructure. NIM supports models from major providers including Meta (Llama), Google (Gemma), Mistral, xAI (Grok), DeepSeek, Microsoft (Phi), Qwen, and NVIDIA's own Nemotron family. The platform offers standard APIs across multiple deployment options including cloud, on-premises, and local workstations, with microservices optimized for NVIDIA GPUs. NIM provides an OpenAI-compatible API endpoint at integrate.api.nvidia.com for easy integration, featuring over 180 models from various AI companies hosted on NVIDIA's inference infrastructure.

Browse 85 LLM models available from Nvidia. Compare prices and features.

Models (85)

Organization Model Name Original Model Input Output Free
qwen
qwen Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b - -
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/kimi-k2.5 $0.00 $0.00 Free
Z.ai
Z.ai GLM-4.7 z-ai/glm4.7 $0.00 $0.00 Free
Minimax
Minimax MiniMax M2.1 minimaxai/minimax-m2.1 $0.00 $0.00 Free
OpenAI
OpenAI GPT OSS 120B openai/gpt-oss-120b $0.00 $0.00 Free
Minimax
Minimax MiniMax M2 minimaxai/minimax-m2 $0.00 $0.00 Free
qwen
qwen Qwen3-Next-80B-A3B-Thinking qwen/qwen3-next-80b-a3b-thinking $0.00 $0.00 Free
Nvidia
Nvidia Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 $0.00 $0.00 Free
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/kimi-k2-instruct-0905 $0.00 $0.00 Free
Moonshot AI
Moonshot AI Kimi K2 Instruct moonshotai/kimi-k2-instruct $0.00 $0.00 Free
Nvidia
Nvidia Nemotron 3 Nano (30B A3B) nvidia/nemotron-3-nano-30b-a3b $0.00 $0.00 Free
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/deepseek-v3.1 $0.00 $0.00 Free
qwen
qwen Qwen3-Next-80B-A3B-Instruct qwen/qwen3-next-80b-a3b-instruct $0.00 $0.00 Free
OpenAI
OpenAI GPT OSS 20B openai/gpt-oss-20b - -
Mistral
Mistral Magistral Small 2506 mistralai/magistral-small-2506 - -
Nvidia
Nvidia Llama-3.3 Nemotron Super 49B v1 nvidia/llama-3.3-nemotron-super-49b-v1 $0.00 $0.00 Free
qwen
qwen QwQ-32B qwen/qwq-32b $0.00 $0.00 Free
Nvidia
Nvidia Nemotron Nano 9B v2 nvidia/nvidia-nemotron-nano-9b-v2 $0.00 $0.00 Free
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/deepseek-r1-distill-qwen-32b - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/deepseek-r1-distill-qwen-14b - -
Nvidia
Nvidia Llama 3.1 Nemotron Nano 8B V1 nvidia/llama-3.1-nemotron-nano-8b-v1 - -
Meta
Meta Llama 3.1 405B Instruct meta/llama-3.1-405b-instruct $0.00 $0.00 Free
Meta
Meta Llama 3.3 70B Instruct meta/llama-3.3-70b-instruct $0.00 $0.00 Free
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/deepseek-r1-distill-qwen-7b - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/deepseek-r1-distill-llama-8b - -
qwen
qwen Qwen3 235B A22B qwen/qwen3-235b-a22b $0.00 $0.00 Free
Mistral
Mistral Mistral Small 3.1 24B Instruct mistralai/mistral-small-3.1-24b-instruct-2503 $0.00 $0.00 Free
google
google Gemma 3 27B google/gemma-3-27b-it $0.00 $0.00 Free
Meta
Meta Llama 3.1 70B Instruct meta/llama-3.1-70b-instruct $0.00 $0.00 Free
qwen
qwen Qwen2.5 7B Instruct qwen/qwen2.5-7b-instruct - -
Meta
Meta Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct - -
Meta
Meta Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct - -
Microsoft Phi-3.5-mini-instruct microsoft/phi-3.5-mini-instruct - -
qwen
qwen Qwen2 7B Instruct qwen/qwen2-7b-instruct - -
google
google Gemma 3n E2B Instructed google/gemma-3n-e2b-it $0.00 $0.00 Free
google
google Gemma 3n E4B Instructed google/gemma-3n-e4b-it $0.00 $0.00 Free
google
google Gemma 3 1B google/gemma-3-1b-it $0.00 $0.00 Free
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/kimi-k2-thinking $0.00 $0.00 Free
Nvidia
Nvidia Cosmos Nemotron 34B nvidia/cosmos-nemotron-34b $0.00 $0.00 Free
Nvidia
Nvidia Parakeet TDT 0.6B v2 nvidia/parakeet-tdt-0.6b-v2 $0.00 $0.00 Free
Nvidia
Nvidia NeMo Retriever OCR v1 nvidia/nemoretriever-ocr-v1 $0.00 $0.00 Free
Nvidia
Nvidia Llama 3.3 Nemotron Super 49b V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 $0.00 $0.00 Free
google
google gemma-2-2b-it google/gemma-2-2b-it $0.00 $0.00 Free
google
google Gemma 2 27B google/gemma-2-27b-it $0.00 $0.00 Free
Nvidia
Nvidia Phi 3 Medium 128k Instruct microsoft/phi-3-medium-128k-instruct $0.00 $0.00 Free
Nvidia
Nvidia Phi 3 Small 128k Instruct microsoft/phi-3-small-128k-instruct $0.00 $0.00 Free
Microsoft Phi-3.5-vision-instruct microsoft/phi-3.5-vision-instruct $0.00 $0.00 Free
Microsoft phi-3-small-8k-instruct microsoft/phi-3-small-8k-instruct $0.00 $0.00 Free
Nvidia
Nvidia Phi-4-Mini microsoft/phi-4-mini-instruct $0.00 $0.00 Free
Microsoft phi-3-medium-4k-instruct microsoft/phi-3-medium-4k-instruct $0.00 $0.00 Free
Nvidia
Nvidia Whisper Large v3 openai/whisper-large-v3 $0.00 $0.00 Free
qwen
qwen Qwen2.5-Coder 32B Instruct qwen/qwen2.5-coder-32b-instruct $0.00 $0.00 Free
qwen
qwen Qwen2.5-Coder 7B Instruct qwen/qwen2.5-coder-7b-instruct $0.00 $0.00 Free
qwen
qwen Qwen3-Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct $0.00 $0.00 Free
Nvidia
Nvidia Devstral-2-123B-Instruct-2512 mistralai/devstral-2-123b-instruct-2512 $0.00 $0.00 Free
Nvidia
Nvidia Mistral Large 3 675B Instruct 2512 mistralai/mistral-large-3-675b-instruct-2512 $0.00 $0.00 Free
Nvidia
Nvidia Ministral 3 14B Instruct 2512 mistralai/ministral-14b-instruct-2512 $0.00 $0.00 Free
Nvidia
Nvidia Mamba Codestral 7b V0.1 mistralai/mamba-codestral-7b-v0.1 $0.00 $0.00 Free
Nvidia
Nvidia Llama 3.2 11b Vision Instruct meta/llama-3.2-11b-vision-instruct $0.00 $0.00 Free
Meta
Meta llama-3-70b-instruct meta/llama3-70b-instruct $0.00 $0.00 Free
Meta
Meta llama-3.2-1b-instruct meta/llama-3.2-1b-instruct $0.00 $0.00 Free
Meta
Meta llama-4-scout-17b-16e-instruct meta/llama-4-scout-17b-16e-instruct $0.00 $0.00 Free
Meta
Meta llama-4-maverick-17b-128e-instruct meta/llama-4-maverick-17b-128e-instruct $0.00 $0.00 Free
Meta
Meta llama-3-8b-instruct meta/llama3-8b-instruct $0.00 $0.00 Free
DeepSeek
DeepSeek deepseek-v3.1-terminus deepseek-ai/deepseek-v3.1-terminus $0.00 $0.00 Free
DeepSeek
DeepSeek deepseek-v3.2 deepseek-ai/deepseek-v3.2 $0.00 $0.00 Free
Nvidia
Nvidia Nemotron Nano 12B 2 VL (free) nvidia/nemotron-nano-12b-v2-vl - -
SiliconFlow
SiliconFlow ByteDance-Seed/Seed-OSS-36B-Instruct bytedance/seed-oss-36b-instruct - -
Black Forest Labs
Black Forest Labs flux-1-kontext-dev black-forest-labs/FLUX.1-Kontext-dev - -
Mistral
Mistral mixtral-8x22b-instruct-v0.1 mistralai/mixtral-8x22b-instruct-v0.1 - -
Mistral
Mistral mixtral-8x7b-instruct-v0.1 mistralai/mixtral-8x7b-instruct-v0.1 - -
ibm Granite 3.3 8B Instruct ibm/granite-3.3-8b-instruct - -
Groq
Groq Llama Guard 4 12B meta/llama-guard-4-12b - -
Nvidia
Nvidia FLUX.1-dev black-forest-labs/FLUX.1-dev $0.00 $0.00 Free
Black Forest Labs
Black Forest Labs FLUX.1-schnell black-forest-labs/FLUX.1-schnell - -
Mistral
Mistral Mistral 7B Instruct v0.3 mistralai/mistral-7b-instruct-v0.3 - -
Azure
Azure Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct - -
Microsoft Phi-4-multimodal-instruct microsoft/phi-4-multimodal-instruct - -
Microsoft phi-3-mini-128k-instruct microsoft/phi-3-mini-128k-instruct - -
google
google Gemma 2 9B google/gemma-2-9b-it - -
Microsoft phi-3-mini-4k-instruct microsoft/phi-3-mini-4k-instruct - -
Mistral
Mistral mistral-7b-instruct-v0.2 mistralai/mistral-7b-instruct-v0.2 - -
StepFun
StepFun Step-3.5-Flash stepfun-ai/step-3.5-flash - -
Z.ai
Z.ai GLM-5 z-ai/glm5 $0.00 $0.00 Free
Minimax
Minimax MiniMax M2.5 minimaxai/minimax-m2.5 - -