lang icon English
Oct. 26, 2024, 6:22 p.m.
1234

Sustainable AI Deployment: The Shift to Smaller Models

In the past two years, large frontier AI models, boasting billions to trillions of parameters, have gained significant traction, integrating into various applications and business tools almost daily. However, the rapid expansion of these resource-intensive models raises concerns about their sustainability. This has prompted major tech companies, including Microsoft, Google, and AWS, to explore nuclear power to support the massive data center infrastructure needed for future AI growth. While cutting-edge models from OpenAI, NVIDIA, Google, and Anthropic demonstrate unprecedented capabilities, they often exceed requirements for many applications, likened to using a Formula 1 car for daily commuting. Thus, there's a growing interest in smaller models, typically containing hundreds of millions to less than 10 billion parameters, that are efficient and cost-effective. NVIDIA recently introduced its NIM (NVIDIA inference Microservice) technology, which streamlines AI deployment and allows for industry-specific applications, such as drug discovery and code generation, while minimizing computational demands.

The collaboration between NVIDIA and Accenture exemplifies how specialized microservices and expertise can facilitate faster AI adoption in enterprises. Similarly, IBM unveiled its Granite 3. 0 models, competitive with Llama and Mistral, highlighting the effectiveness of small language models in various applications. IBM’s open-source solutions can be implemented on multiple cloud platforms, including its own watsonx, demonstrating a strong focus on enterprise needs. The future of enterprise AI looks toward a blend of models and adaptable infrastructure that prioritizes outcome-based projects, driving advancements in areas like automation and digital labor. The notion that a single, large-scale model can address all enterprise needs is misguided; many defined use cases would benefit more from smaller, specialized models that offer better energy efficiency and cost management, while simplifying data governance. Research into sophisticated AI will continue, but for many business applications, smaller foundation models represent a more sustainable approach. This not only allows for broader and more efficient AI deployment but also significantly reduces operational costs, making it an attractive choice for enterprises seeking to harness the potential of generative and agentic AI solutions.



Brief news summary

Large-scale generative AI models, equipped with billions of parameters, are increasingly adopted in various industries. However, their substantial resource demands have prompted major tech firms like Microsoft, Google, and AWS to consider nuclear energy for powering data centers. While models from OpenAI, NVIDIA, Google, and Anthropic exhibit impressive capabilities, many applications don't require such advanced technologies, similar to using a sports car for everyday driving. This situation highlights the necessity for smaller, energy-efficient models tailored for specific tasks. Recent innovations, such as NVIDIA's NIM software and IBM's Granite 3.0, offer optimized inference engines that consume less energy than larger models like Llama. The industry is shifting towards adaptable infrastructures focusing on targeted AI solutions, fostering the development of customized models that improve energy and cost efficiency. Although high-level AI research remains crucial, smaller models often provide more practical solutions for businesses, facilitating sustainable and effective AI integration. This shift enables organizations to customize generative and agentic AI technologies according to their specific needs.

Watch video about

Sustainable AI Deployment: The Shift to Smaller Models

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Oct. 22, 2025, 6:26 a.m.

Built For What's Next: Tinuiti Introduces AI SEO,…

Tinuiti, the largest independent full-funnel marketing agency in the U.S., announced the launch of its expanded AI SEO offering, reflecting the evolving nature of search and AI-powered discovery.

Oct. 22, 2025, 6:22 a.m.

AI in Video Games: Creating More Immersive Experi…

The video game industry is undergoing a transformative shift as developers increasingly incorporate artificial intelligence (AI) into crafting game worlds and character behaviors.

Oct. 22, 2025, 6:21 a.m.

AI Use in American Newspapers is Widespread, Unev…

A recent extensive study analyzing 186,000 articles from the online editions of 1,500 American newspapers has found that approximately 9 percent of newly published articles are either partially or entirely generated by artificial intelligence (AI).

Oct. 22, 2025, 6:21 a.m.

OpenAI CEO Sam Altman Announces ChatGPT Will Allo…

Today marks National Cheese Curd Day, a celebration especially beloved by our readers in Wisconsin, a state renowned for its cheese production and rich dairy heritage.

Oct. 22, 2025, 6:20 a.m.

How AI is Redefining Hotel Marketing and Why It’s…

In recent years, artificial intelligence (AI) has evolved from a buzzword into a business essential, especially in the hotel industry.

Oct. 22, 2025, 6:18 a.m.

Tesla Earnings Are Almost Here. Pay Attention to …

Tesla is set to report its third-quarter earnings after the market closes on Wednesday.

Oct. 21, 2025, 2:32 p.m.

AI in Social Media, A $5.95 Billion Opportunity b…

The artificial intelligence (AI) market within the social media sector is experiencing remarkable growth, with forecasts predicting an increase from a market value of 1.68 billion US dollars in 2023 to an estimated 5.95 billion US dollars by 2028.

All news

AI team for your Business

Automate Marketing, Sales, SMM & SEO

and get clients on autopilot — from social media and search engines. No ads needed

and get clients today