Model Information
| Slug | nemotron-nano-12b-v2-vl |
|---|---|
| LLM.txt | View |
| Release Date | October 28, 2025 |
Organization
| Name | Nvidia |
|---|---|
| Website | https://www.nvidia.com/en-us/ai/ |
Model Description
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s memory-efficient sequence modeling for significantly higher throughput and lower latency.
The model supports inputs of text and multi-image documents, producing natural-language outputs. It is trained on high-quality NVIDIA-curated synthetic datasets optimized for optical-character recognition, chart reasoning, and multimodal comprehension.
Nemotron Nano 2 VL achieves leading results on OCRBench v2 and scores ≈ 74 average across MMMU, MathVista, AI2D, OCRBench, OCR-Reasoning, ChartQA, DocVQA, and Video-MME—surpassing prior open VL baselines. With Efficient Video Sampling (EVS), it handles long-form videos while reducing inference cost.
Open-weights, training data, and fine-tuning recipes are released under a permissive NVIDIA open license, with deployment supported across NeMo, NIM, and major inference runtimes.
The model supports inputs of text and multi-image documents, producing natural-language outputs. It is trained on high-quality NVIDIA-curated synthetic datasets optimized for optical-character recognition, chart reasoning, and multimodal comprehension.
Nemotron Nano 2 VL achieves leading results on OCRBench v2 and scores ≈ 74 average across MMMU, MathVista, AI2D, OCRBench, OCR-Reasoning, ChartQA, DocVQA, and Video-MME—surpassing prior open VL baselines. With Efficient Video Sampling (EVS), it handles long-form videos while reducing inference cost.
Open-weights, training data, and fine-tuning recipes are released under a permissive NVIDIA open license, with deployment supported across NeMo, NIM, and major inference runtimes.
Available at 16 Providers
| Provider | Type | Model Name | Original Model | Input ($/1M) | Output ($/1M) | Free | Actions | |
|---|---|---|---|---|---|---|---|---|
|
|
Kilo Code |
Code
|
NVIDIA: Nemotron Nano 12B 2 VL (free)
|
nvidia/nemotron-nano-12b-v2-vl:free
|
$0.00 | $0.00 | ||
|
|
OpenRouter |
Chat
Code
|
Nemotron Nano 12B 2 VL (free)
|
nvidia/nemotron-nano-12b-v2-vl:free
|
$0.00 | $0.00 | ||
|
|
Routeway |
NVIDIA: Nemotron Nano 12B 2 VL
|
nemotron-nano-12b-v2-vl
|
$0.02 | $0.06 | |||
|
|
OpenRouter |
Chat
Code
|
Nemotron Nano 12B 2 VL
|
nvidia/nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | ||
|
|
Vercel AI Gateway |
Nvidia Nemotron Nano 12B V2 VL
|
nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | |||
|
|
Kilo Code |
Code
|
NVIDIA: Nemotron Nano 12B 2 VL
|
nvidia/nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | ||
|
|
Chats-LLM |
Chat
|
NVIDIA: Nemotron Nano 12B 2 VL
|
nemotron-nano-12b-v2-vl
|
$0.20 | $0.60 | ||
|
|
WaveSpeed AI |
Chat
Code
|
nemotron-nano-12b-v2-vl
|
nvidia/nemotron-nano-12b-v2-vl
|
$0.22 | $0.66 | ||
|
|
AIMLAPI |
Nemotron Nano 12B V2 VL
|
nvidia/nemotron-nano-12b-v2-vl
|
$0.27 | $0.82 | |||
|
|
ValorGPT |
Nemotron Nano 12B 2 VL
|
nvidia-nemotron-nano-12b-v2-vl
|
- | - | |||
|
|
ValorGPT |
Nemotron Nano 12B 2 VL
|
nvidia-nemotron-nano-12b-v2-vl-free
|
- | - | |||
|
|
Yupp |
Chat
|
Nemotron Nano 12B 2 VL (OpenRouter)
|
nvidia/nemotron-nano-12b-v2-vl
|
- | - | ||
|
|
LangDB |
nemotron-nano-12b-v2-vl
|
nemotron-nano-12b-v2-vl
|
- | - | |||
|
|
Nvidia |
nemotron-nano-12b-v2-vl
|
nvidia/nemotron-nano-12b-v2-vl
|
- | - | |||
|
|
Writingmate |
Chat
Code
|
NVIDIA: Nemotron Nano 12B 2 VL
|
nvidia/nemotron-nano-12b-v2-vl
|
- | - | ||
|
|
Blackbox AI |
Code
|
blackboxai/nvidia/nemotron-nano-12b-v2-vl:free
|
blackboxai/nvidia/nemotron-nano-12b-v2-vl:free
|
- | - |