NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 vs NVIDIA: Nemotron 3 Super

Side-by-side comparison. Updated 2026-07-01.

Summary

Compare NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 and NVIDIA: Nemotron 3 Super across pricing, context window, modalities, strengths, weaknesses, and best-fit workloads.

Use the details below to weigh cost, context, modalities, and workload fit.

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

by Nvidia

Input pricing

$0.40 /1M input tokens

Full review →

NVIDIA: Nemotron 3 Super

by Nvidia

Input pricing

$0.09 /1M input tokens

Full review →

At a Glance

Side-by-side comparison of NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 and NVIDIA: Nemotron 3 Super
Attribute	NVIDIA: Llama 3.3 Nemotron Super 49B V1.5	NVIDIA: Nemotron 3 Super
Provider	Nvidia	Nvidia
Tags	Agentic Intelligent	Intelligent Agentic Open Weight Long Context
Context	131,072 tokens	1,000,000 tokens
Input price	$0.40/1M	$0.09/1M
Output price	$0.40/1M	$0.45/1M
Input modalities	Text	Text
Output modalities	Text	Text

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Strengths

49B parameters derived from Llama 3.3 70B with post-training
Optimized for agentic workflows: RAG, tool calling, code
128K context window for complex multi-step tasks
Competitive pricing at $0.40/$0.40 per 1M tokens
Strong across math, code, science, and instruction following

NVIDIA: Nemotron 3 Super Strengths

120B MoE with only 12B active params for high efficiency
1M token context window for very long tasks
Open-weight hybrid Mamba architecture
Strong multi-agent application performance

← All comparisons