Meta to Launch Advanced Llama 4 AI Model with Over 100,000 Nvidia H100 GPUs

Meta CEO Mark Zuckerberg announced significant advancements in generative AI training during an earnings call on Wednesday. He revealed that the development of the Llama 4 model is currently underway, utilizing a cluster of GPUs that surpasses anything else previously reported, specifically boasting more than 100, 000 Nvidia H100 chips. The initial launch of Llama 4 is anticipated early next year, with smaller models likely available first. This increase in AI training scale is believed to be crucial for building more advanced AI models. While Meta seems to have a lead, other major players in the AI field are also pursuing large compute clusters. In March, it was noted that Meta and Nvidia had previously worked with around 25, 000 H100s for Llama 3, whereas Elon Musk claimed his xAI venture had established a similar setup with 100, 000 H100s. Zuckerberg hinted at Llama 4's potential for new modalities, stronger reasoning, and quicker performance, though he didn’t elaborate on specific capabilities. Unlike proprietary models from OpenAI and Google, Meta’s Llama models can be downloaded for free, which has appealed to startups and researchers wanting more control over their AI systems. However, the open-source label comes with restrictions on commercial use, and Meta does not disclose training details. Managing such an extensive array of chips for Llama 4 poses engineering and energy challenges.
It’s estimated that a cluster of this size would require 150 megawatts of power, while the largest U. S. supercomputer, El Capitan, uses just 30 megawatts. Meta plans to spend up to $40 billion this year on infrastructure, reflecting a 42% increase from 2023. Despite rising operating costs of about 9%, Meta's overall sales from advertising have surged over 22%, leading to higher margins and profits even as significant funds are allocated to the Llama projects. In contrast, OpenAI, the leading developer of cutting-edge AI, continues to incur challenges, currently training the larger GPT-5 model while managing its operation as a nonprofit endeavor. Zuckerberg defended Meta's open-source strategy, asserting its cost-effectiveness and trustworthiness compared to proprietary systems. He believes Llama 4 will enhance a variety of features across Meta’s platforms, including the popular Meta AI chatbot, which attracts over 500 million users monthly. The company foresees monetization opportunities through advertising in this feature, suggesting a path to subsidize Llama for broader access.
Brief news summary
On Wednesday, Meta CEO Mark Zuckerberg unveiled significant developments in the Llama 4 generative AI model, powered by over 100,000 Nvidia H100 GPUs. A partial version is expected to launch early next year, starting with smaller variants. This extensive training aims to boost advanced AI technologies, with Zuckerberg highlighting Llama 4's potential for enhanced reasoning skills, though specific features remain under wraps. Unlike competitors, Meta intends to provide Llama models for free download with some commercial restrictions, catering to startups and researchers. Operating such a massive GPU cluster, however, poses engineering and energy challenges, consuming around 150 megawatts. Nevertheless, Meta has seen a 22% rise in sales and improved profit margins linked to its AI efforts. As rivals like OpenAI introduce models such as GPT-5, Zuckerberg seeks to position Llama as a leading open-source AI tool for developers seeking customization. With over 500 million monthly users, Meta AI is strategically positioned for significant growth and new advertising avenues.
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Google's AI Tool Generates Convincing Deepfakes, …
Google recently launched Veo 3, an advanced AI video generation tool capable of producing hyper-realistic deepfake videos.

Blockchain: Bold Vision, Overhyped Dream
I recently discussed Pakistan’s emerging role in the crypto space with Raza Rumi on Naya Daur TV.

Broadcom Releases New Networking Chip to Support …
Broadcom has unveiled its newest networking chip, the Tomahawk 6, created to address the growing demands of artificial intelligence (AI) infrastructure.

Tether Launches Omnichain Gold Token ‘XAUt0’ On T…
Tether has teamed up with the TON Foundation to introduce XAUt0, an omnichain version of its gold-backed stablecoin XAUt, aiming to expand digital gold access across multiple blockchains.

AI-Powered Drug Discovery: A Game Changer in Phar…
Artificial intelligence (AI) is transforming the pharmaceutical industry by greatly improving the drug discovery process.

La tokenización inmobiliaria llega a Arabia Saudí
Rafal Real Estate, una empresa destacada en el sector inmobiliario, ha firmado un acuerdo pionero con la empresa estadounidense droppRWA para implementar la tokenización de activos inmobiliarios en Arabia Saudí.

AI in Education: Personalized Learning Experience…
Artificial intelligence (AI) is rapidly reshaping education by offering highly personalized learning experiences tailored to each student's unique needs.