What is Nemotron 3 Super?

The Nemotron-3 Super stands out as a groundbreaking addition to NVIDIA's Nemotron 3 series of open models, designed specifically to support advanced agentic AI systems capable of reasoning, planning, and executing complex multi-step workflows in challenging settings. It incorporates a distinctive hybrid Mamba-Transformer Mixture-of-Experts architecture that combines the streamlined capabilities of Mamba layers with the contextual richness offered by transformer attention mechanisms, enabling it to effectively handle long sequences and complicated reasoning tasks with notable precision and efficiency. By activating only a selected subset of its parameters for each token, this design greatly improves computational efficiency while ensuring strong reasoning skills, making it particularly suitable for scalable inference in demanding situations. With an impressive configuration of around 120 billion parameters, of which approximately 12 billion are engaged during inference, the Nemotron-3 Super significantly enhances its capacity for managing multi-step reasoning and facilitating collaborative interactions among agents in broad contexts. This combination of features not only empowers it to address a wide array of challenges in the AI landscape but also positions it as a key player in the evolution of intelligent systems. Overall, the model exemplifies the potential for future innovations in AI technology.

Screenshots and Video

Company Facts

Company Name:
NVIDIA
Date Founded:
1993
Company Location:
United States
Company Website:
nvidia.com

Product Details

Deployment
SaaS
Training Options
Documentation Hub
Online Training
Webinars
Video Library
Support
Web-Based Support

Product Details

Target Company Sizes
Individual
1-10
11-50
51-200
201-500
501-1000
1001-5000
5001-10000
10001+
Target Organization Types
Mid Size Business
Small Business
Enterprise
Freelance
Nonprofit
Government
Startup
Supported Languages
English

Nemotron 3 Super Categories and Features