NVIDIA Unveils Nemotron 3 to Power the Next Wave of Multi-Agent AI Systems

The Nemotron 3 lineup includes Nano, Super and Ultra models built on a hybrid latent mixture-of-experts (MoE) architecture.

NVIDIA Unveils Nemotron 3 to Power the Next Wave of Multi-Agent AI Systems
(Photo-NVIDIA)

NVIDIA announced the Nemotron 3 family of open models, datasets and libraries, aimed at accelerating the development of transparent, efficient multi-agent AI systems across industries.

The Nemotron 3 lineup includes Nano, Super and Ultra models built on a hybrid latent mixture-of-experts (MoE) architecture.

According to NVIDIA, the design helps lower inference costs, reduce context drift and improve coordination between multiple AI agents working together on complex tasks.

“Open innovation is the foundation of AI progress,” NVIDIA founder and CEO Jensen Huang said. “With Nemotron, we’re transforming advanced AI into an open platform that gives developers the transparency and efficiency they need to build agentic systems at scale.”

Nemotron 3 Nano, available immediately, is a 30-billion-parameter model that activates up to 3 billion parameters per task. It is optimised for low-cost inference use cases such as summarisation, software debugging and AI assistants. NVIDIA said the model delivers up to four times higher token throughput than Nemotron 2 Nano and cuts reasoning token generation by up to 60%.

The Nano model is available on Hugging Face, through inference providers including Baseten, Fireworks and Together AI, and as an NVIDIA NIM microservice. It will also be available on AWS via Amazon Bedrock.

Nemotron 3 Super, a roughly 100-billion-parameter model, is designed for low-latency multi-agent applications, while the 500-billion-parameter Nemotron 3 Ultra targets deep reasoning and long-horizon planning. Both models use NVIDIA’s NVFP4 training format on Blackwell GPUs and are expected in the first half of 2026.

NVIDIA said the models support hybrid workflows that balance open and proprietary systems, align with sovereign AI initiatives, and are already being adopted by enterprises and partners including Accenture, ServiceNow, Siemens and Perplexity.

“NVIDIA and ServiceNow have been shaping the future of AI for years, and the best is yet to come,” Bill McDermott, chairman and CEO of ServiceNow. “Today, we’re taking a major step forward in empowering leaders across all industries to fast-track their agentic AI strategy. ServiceNow’s intelligent workflow automation combined with NVIDIA Nemotron 3 will continue to define the standard with unmatched efficiency, speed and accuracy.”

"Perplexity is built on the idea that human curiosity will be amplified by accurate AI built into exceptional tools, like AI assistants," said Aravind Srinivas, CEO of Perplexity. “With our agent router, we can direct workloads to the best fine-tuned open models, like Nemotron 3 Ultra, or leverage leading proprietary models when tasks benefit from their unique capabilities — ensuring our AI assistants operate with exceptional speed, efficiency and scale.”