lang icon En
March 18, 2025, 4:52 p.m.
1367

NVIDIA Launches Llama Nemotron Models to Advance AI Reasoning

Brief news summary

NVIDIA has unveiled the Llama Nemotron model family, designed to improve AI reasoning for developers and businesses focused on autonomous agents. These models feature a 20% bump in post-training accuracy and a fivefold boost in inference speed compared to current open models, making complex reasoning tasks more manageable and lowering operational costs. Prominent companies such as Microsoft, SAP, and Accenture are collaborating with NVIDIA to utilize these models to enhance operational efficiency across various industries. The Llama Nemotron family comprises three versions—Nano, Super, and Ultra—allowing for scalable deployment on edge devices and multi-GPU servers. Backed by NVIDIA's AI Enterprise platform, these models come equipped with the AI-Q Blueprint for knowledge integration and adaptable NIM microservices for real-time updates. Accessible via build.nvidia.com and Hugging Face, the Llama Nemotron models represent NVIDIA's dedication to advancing AI and accelerated computing in multiple sectors, with more innovations expected in the future.

**NVIDIA Unveils Llama Nemotron Models for Enhanced AI Reasoning** During GTC, NVIDIA introduced its open Llama Nemotron family of models, engineered to equip developers and businesses with advanced AI reasoning capabilities. These models allow for the creation of AI agents that can function independently or collaboratively to tackle intricate tasks. Based on Llama models, the Llama Nemotron family significantly boosts AI reasoning through enhanced post-training processes aimed at improving multistep math, coding, and decision-making accuracy by up to 20% and optimizing inference speed by 5x compared to other leading models. As a result, enterprises can expect better decision-making capabilities and reduced operational costs. Major industry collaborators like Accenture, Microsoft, SAP, and Deloitte are integrating these models into their platforms to enhance AI functionalities. For instance, Microsoft is embedding Llama Nemotron models in its Azure AI Foundry, while SAP is utilizing them to improve AI solutions for its Joule copilot. The Llama Nemotron models come in three sizes—Nano, Super, and Ultra—each tailored for specific deployment requirements.

These models are accessible as NVIDIA NIM™ microservices, with the Nano model optimal for PCs and edge devices, the Super model excelling on single GPUs, and the Ultra model designed for multi-GPU servers. NVIDIA plans to share the tools and datasets used for model training openly, allowing enterprises to create tailored reasoning models. The deployment of these models is streamlined through NVIDIA’s AI Enterprise suite, featuring essential tools like the AI-Q Blueprint and customizable data platforms. The Llama Nemotron Nano and Super models are currently available as APIs, while enterprises can run them in production with NVIDIA AI Enterprise. The forthcoming AI-Q Blueprint is expected in April, and developers can access the NVIDIA AgentIQ toolkit now on GitHub. NVIDIA continues to be a leader in accelerated computing, emphasizing its commitment to innovation, even as market conditions and technological landscapes evolve.


Watch video about

NVIDIA Launches Llama Nemotron Models to Advance AI Reasoning

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Jan. 20, 2026, 5:33 a.m.

AI Video Generation Tools Empower Marketers with …

The marketing industry is experiencing a transformative shift with the increasing adoption of artificial intelligence (AI) video generation tools that revolutionize the creation and delivery of personalized video content.

Jan. 20, 2026, 5:30 a.m.

Udio strikes AI licensing deal with Merlin after …

Independent music licensing group Merlin has partnered with AI music platform Udio to license recordings for training AI models.

Jan. 20, 2026, 5:23 a.m.

Google’s Gemini Sees Skyrocketing Business Sales

Artificial intelligence has moved beyond a futuristic concept reserved for tech giants and research labs.

Jan. 20, 2026, 5:20 a.m.

AI Voiceover: Complete Guide to Neural Network Sp…

Artificial intelligence has profoundly transformed many fields, with AI voiceover technology standing out as a particularly remarkable advancement.

Jan. 20, 2026, 5:19 a.m.

LinkedIn Rolls Out AI-Powered People Search

LinkedIn has launched a new AI-powered people search feature that allows users to find professionals using natural language descriptions instead of relying solely on names or strict filters.

Jan. 20, 2026, 5:13 a.m.

AI Overviews: Google's AI-Powered Search Feature

Google has unveiled an innovative feature called AI Overviews, designed to revolutionize how users engage with search results by providing AI-generated summaries at the very top of the search page.

Jan. 19, 2026, 1:24 p.m.

Marketing At The Speed Of AI: Building A Brand Fo…

Historically, marketing for most modern businesses focused on visibility—being seen and remembered to drive growth.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today