NVIDIA: Nemotron Nano 9B V2

OpenAI • text • function-calling • json-mode

Provider IDnvidia/nemotron-nano-9b-v2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Quick Summary

Best For:

High-volume, low-latency tasks where cost efficiency is paramount

Pricing:

$0.00/1M input tokens, $0.00/1M output tokens

Context Window:

131,072 tokens

Key Differentiator:

Cost-optimized for high-volume usage

Specifications

Context Window

131,072 tokens

Streaming

Yes

JSON Mode

Yes

Vision

Tier

Affordable

Capabilities

text

function-calling

json-mode

NVIDIA: Nemotron Nano 9B V2

Best For:

Pricing:

Context Window:

Key Differentiator:

Social

Legal