NVIDIA: Nemotron 3 Ultra

● Excellent

by Nvidia Intelligent Agentic Open Weight Rank #30 of 75

8.5
/ 10

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Choose a model to compare against NVIDIA: Nemotron 3 Ultra

Specifications

Specifications for NVIDIA: Nemotron 3 Ultra
AttributeValue
Lab Nvidia
Tags Intelligent Agentic Open Weight
Overall Score 8.5/10
Release Date 2026-06
Context Window 1,000,000 tokens
Input Price / 1M $0.50
Output Price / 1M $2.50
Input Modalities Text
Output Modalities Text

Strengths

  • Open frontier-reasoning model with 55B active parameters (550B total MoE)
  • Huge 1M-token hybrid Transformer-Mamba context window
  • Designed for long-running agent orchestration and complex deep research
  • Efficient pricing at competitive open-model rates

Weaknesses

  • NVIDIA is newer to LLMs — less ecosystem maturity vs established labs
  • Performance on pure benchmark scores not yet independently validated

Best For

  • Multi-step agent orchestration and coding pipelines
  • Deep research with very long documents
  • Enterprise agentic AI at open-model pricing

Sources & Further Reading

Related Models

Scores are aggregated from public benchmarks (MMLU, HumanEval, MATH, GSM8K, LMSYS) and normalized to a 1–10 scale. Methodology →