Nvidia
nvidia
Updated 17 minutes ago
NVIDIA NIM (NVIDIA Inference Microservices) is a platform that provides optimized AI model inference containers featuring industry-leading APIs for running AI models across NVIDIA's accelerated infrastructure. NIM supports models from major providers including Meta (Llama), Google (Gemma), Mistral, xAI (Grok), DeepSeek, Microsoft (Phi), Qwen, and NVIDIA's own Nemotron family. The platform offers standard APIs across multiple deployment options including cloud, on-premises, and local workstations, with microservices optimized for NVIDIA GPUs. NIM provides an OpenAI-compatible API endpoint at integrate.api.nvidia.com for easy integration, featuring over 180 models from various AI companies hosted on NVIDIA's inference infrastructure.
Browse 85 LLM models available from Nvidia. Compare prices and features.
Models (85)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
qwen | Qwen3.5-397B-A17B |
qwen/qwen3.5-397b-a17b
|
- | - | |||
|
|
Moonshot AI | Kimi K2.5 |
moonshotai/kimi-k2.5
|
$0.00 | $0.00 | Free |
|
|
|
|
Z.ai | GLM-4.7 |
z-ai/glm4.7
|
$0.00 | $0.00 | Free |
|
|
|
|
Minimax | MiniMax M2.1 |
minimaxai/minimax-m2.1
|
$0.00 | $0.00 | Free |
|
|
|
|
OpenAI | GPT OSS 120B |
openai/gpt-oss-120b
|
$0.00 | $0.00 | Free |
|
|
|
|
Minimax | MiniMax M2 |
minimaxai/minimax-m2
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen3-Next-80B-A3B-Thinking |
qwen/qwen3-next-80b-a3b-thinking
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Llama 3.1 Nemotron Ultra 253B v1 |
nvidia/llama-3.1-nemotron-ultra-253b-v1
|
$0.00 | $0.00 | Free | ||
|
|
Moonshot AI | Kimi K2-Instruct-0905 |
moonshotai/kimi-k2-instruct-0905
|
$0.00 | $0.00 | Free | ||
|
|
Moonshot AI | Kimi K2 Instruct |
moonshotai/kimi-k2-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Nemotron 3 Nano (30B A3B) |
nvidia/nemotron-3-nano-30b-a3b
|
$0.00 | $0.00 | Free | ||
|
|
DeepSeek | DeepSeek-V3.1 |
deepseek-ai/deepseek-v3.1
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen3-Next-80B-A3B-Instruct |
qwen/qwen3-next-80b-a3b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
OpenAI | GPT OSS 20B |
openai/gpt-oss-20b
|
- | - | |||
|
|
Mistral | Magistral Small 2506 |
mistralai/magistral-small-2506
|
- | - | |||
|
|
Nvidia | Llama-3.3 Nemotron Super 49B v1 |
nvidia/llama-3.3-nemotron-super-49b-v1
|
$0.00 | $0.00 | Free | ||
|
|
qwen | QwQ-32B |
qwen/qwq-32b
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Nemotron Nano 9B v2 |
nvidia/nvidia-nemotron-nano-9b-v2
|
$0.00 | $0.00 | Free | ||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 32B |
deepseek-ai/deepseek-r1-distill-qwen-32b
|
- | - | |||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 14B |
deepseek-ai/deepseek-r1-distill-qwen-14b
|
- | - | |||
|
|
Nvidia | Llama 3.1 Nemotron Nano 8B V1 |
nvidia/llama-3.1-nemotron-nano-8b-v1
|
- | - | |||
|
|
Meta | Llama 3.1 405B Instruct |
meta/llama-3.1-405b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | Llama 3.3 70B Instruct |
meta/llama-3.3-70b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
DeepSeek | DeepSeek R1 Distill Qwen 7B |
deepseek-ai/deepseek-r1-distill-qwen-7b
|
- | - | |||
|
|
DeepSeek | DeepSeek R1 Distill Llama 8B |
deepseek-ai/deepseek-r1-distill-llama-8b
|
- | - | |||
|
|
qwen | Qwen3 235B A22B |
qwen/qwen3-235b-a22b
|
$0.00 | $0.00 | Free |
|
|
|
|
Mistral | Mistral Small 3.1 24B Instruct |
mistralai/mistral-small-3.1-24b-instruct-2503
|
$0.00 | $0.00 | Free | ||
|
|
Gemma 3 27B |
google/gemma-3-27b-it
|
$0.00 | $0.00 | Free | |||
|
|
Meta | Llama 3.1 70B Instruct |
meta/llama-3.1-70b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen2.5 7B Instruct |
qwen/qwen2.5-7b-instruct
|
- | - | |||
|
|
Meta | Llama 3.2 3B Instruct |
meta/llama-3.2-3b-instruct
|
- | - | |||
|
|
Meta | Llama 3.1 8B Instruct |
meta/llama-3.1-8b-instruct
|
- | - | |||
|
|
Microsoft | Phi-3.5-mini-instruct |
microsoft/phi-3.5-mini-instruct
|
- | - | |||
|
|
qwen | Qwen2 7B Instruct |
qwen/qwen2-7b-instruct
|
- | - | |||
|
|
Gemma 3n E2B Instructed |
google/gemma-3n-e2b-it
|
$0.00 | $0.00 | Free | |||
|
|
Gemma 3n E4B Instructed |
google/gemma-3n-e4b-it
|
$0.00 | $0.00 | Free | |||
|
|
Gemma 3 1B |
google/gemma-3-1b-it
|
$0.00 | $0.00 | Free | |||
|
|
Moonshot AI | Kimi K2 Thinking |
moonshotai/kimi-k2-thinking
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Cosmos Nemotron 34B |
nvidia/cosmos-nemotron-34b
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Parakeet TDT 0.6B v2 |
nvidia/parakeet-tdt-0.6b-v2
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | NeMo Retriever OCR v1 |
nvidia/nemoretriever-ocr-v1
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Llama 3.3 Nemotron Super 49b V1.5 |
nvidia/llama-3.3-nemotron-super-49b-v1.5
|
$0.00 | $0.00 | Free | ||
|
|
gemma-2-2b-it |
google/gemma-2-2b-it
|
$0.00 | $0.00 | Free | |||
|
|
Gemma 2 27B |
google/gemma-2-27b-it
|
$0.00 | $0.00 | Free | |||
|
|
Nvidia | Phi 3 Medium 128k Instruct |
microsoft/phi-3-medium-128k-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Phi 3 Small 128k Instruct |
microsoft/phi-3-small-128k-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Microsoft | Phi-3.5-vision-instruct |
microsoft/phi-3.5-vision-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Microsoft | phi-3-small-8k-instruct |
microsoft/phi-3-small-8k-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Phi-4-Mini |
microsoft/phi-4-mini-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Microsoft | phi-3-medium-4k-instruct |
microsoft/phi-3-medium-4k-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Whisper Large v3 |
openai/whisper-large-v3
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen2.5-Coder 32B Instruct |
qwen/qwen2.5-coder-32b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen2.5-Coder 7B Instruct |
qwen/qwen2.5-coder-7b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
qwen | Qwen3-Coder 480B A35B Instruct |
qwen/qwen3-coder-480b-a35b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Devstral-2-123B-Instruct-2512 |
mistralai/devstral-2-123b-instruct-2512
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Mistral Large 3 675B Instruct 2512 |
mistralai/mistral-large-3-675b-instruct-2512
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Ministral 3 14B Instruct 2512 |
mistralai/ministral-14b-instruct-2512
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Mamba Codestral 7b V0.1 |
mistralai/mamba-codestral-7b-v0.1
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Llama 3.2 11b Vision Instruct |
meta/llama-3.2-11b-vision-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | llama-3-70b-instruct |
meta/llama3-70b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | llama-3.2-1b-instruct |
meta/llama-3.2-1b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | llama-4-scout-17b-16e-instruct |
meta/llama-4-scout-17b-16e-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | llama-4-maverick-17b-128e-instruct |
meta/llama-4-maverick-17b-128e-instruct
|
$0.00 | $0.00 | Free | ||
|
|
Meta | llama-3-8b-instruct |
meta/llama3-8b-instruct
|
$0.00 | $0.00 | Free | ||
|
|
DeepSeek | deepseek-v3.1-terminus |
deepseek-ai/deepseek-v3.1-terminus
|
$0.00 | $0.00 | Free | ||
|
|
DeepSeek | deepseek-v3.2 |
deepseek-ai/deepseek-v3.2
|
$0.00 | $0.00 | Free | ||
|
|
Nvidia | Nemotron Nano 12B 2 VL (free) |
nvidia/nemotron-nano-12b-v2-vl
|
- | - | |||
|
|
SiliconFlow | ByteDance-Seed/Seed-OSS-36B-Instruct |
bytedance/seed-oss-36b-instruct
|
- | - | |||
|
|
Black Forest Labs | flux-1-kontext-dev |
black-forest-labs/FLUX.1-Kontext-dev
|
- | - | |||
|
|
Mistral | mixtral-8x22b-instruct-v0.1 |
mistralai/mixtral-8x22b-instruct-v0.1
|
- | - | |||
|
|
Mistral | mixtral-8x7b-instruct-v0.1 |
mistralai/mixtral-8x7b-instruct-v0.1
|
- | - | |||
|
|
ibm | Granite 3.3 8B Instruct |
ibm/granite-3.3-8b-instruct
|
- | - | |||
|
|
Groq | Llama Guard 4 12B |
meta/llama-guard-4-12b
|
- | - | |||
|
|
Nvidia | FLUX.1-dev |
black-forest-labs/FLUX.1-dev
|
$0.00 | $0.00 | Free | ||
|
|
Black Forest Labs | FLUX.1-schnell |
black-forest-labs/FLUX.1-schnell
|
- | - | |||
|
|
Mistral | Mistral 7B Instruct v0.3 |
mistralai/mistral-7b-instruct-v0.3
|
- | - | |||
|
|
Azure | Llama-3.2-90B-Vision-Instruct |
meta/llama-3.2-90b-vision-instruct
|
- | - | |||
|
|
Microsoft | Phi-4-multimodal-instruct |
microsoft/phi-4-multimodal-instruct
|
- | - | |||
|
|
Microsoft | phi-3-mini-128k-instruct |
microsoft/phi-3-mini-128k-instruct
|
- | - | |||
|
|
Gemma 2 9B |
google/gemma-2-9b-it
|
- | - | ||||
|
|
Microsoft | phi-3-mini-4k-instruct |
microsoft/phi-3-mini-4k-instruct
|
- | - | |||
|
|
Mistral | mistral-7b-instruct-v0.2 |
mistralai/mistral-7b-instruct-v0.2
|
- | - | |||
|
|
StepFun | Step-3.5-Flash |
stepfun-ai/step-3.5-flash
|
- | - |
|
||
|
|
Z.ai | GLM-5 |
z-ai/glm5
|
$0.00 | $0.00 | Free |
|
|
|
|
Minimax | MiniMax M2.5 |
minimaxai/minimax-m2.5
|
- | - |
|