Back to feed
Reddit r/LocalLLaMA·

Nemotron 3 Ultra. 550 billion parameters, 55B active. 1 million context

Signal
45
Hype
35
In three linesNVIDIA releases Nemotron 3 Ultra, a 550B parameter model with 55B active parameters and 1M token context window. Mixture-of-Experts architecture designed for efficient inference.
Read source
Your take?
Open sourceInfrastructure

Summary generated by Claude — human-verified