None

Nvidia is expanding its presence in the AI industry beyond GPU sales by incorporating its AI-specific software development kit into more applications. The company recently announced that it will add support for its TensorRT-LLM SDK to Windows and models such as Stable Diffusion. Through this move, Nvidia aims to enhance the performance of large language models (LLMs) and related tools. TensorRT optimizes inference, the process of utilizing pretrained information to calculate probabilities and produce results, such as generating a newly created Stable Diffusion image. With this software, Nvidia strives to play a bigger role in the inference aspect of generative AI. By leveraging TensorRT-LLM, which accelerates LLMs on Nvidia's H100 GPUs, the company enables faster running of LLMs, including Meta's Llama 2 and Stability AI's Stable Diffusion. Nvidia claims that this acceleration significantly enhances the experience of more advanced LLM usage, like writing and coding assistants. In simpler terms, Nvidia aims to provide not only the GPUs for training and running LLMs but also the software that enables these models to operate more efficiently, thus discouraging users from seeking cost-effective alternatives for generative AI. The company announced that TensorRT-LLM will be publicly available to anyone interested in using or integrating it, with access to the SDK on its website.
Nvidia already dominates the market for powerful chips used in training LLMs, such as GPT-4, where multiple GPUs are typically required. The demand for its H100 GPUs has skyrocketed, with estimated prices reaching $40, 000 per chip. In response to this increasing demand, Nvidia announced that a newer version of its GPU, the GH200, would be released next year. However, new approaches have emerged to run LLMs without heavily relying on expensive GPUs, as the world of generative AI evolves quickly. Microsoft and AMD, for example, plan to develop their own chips to reduce dependence on Nvidia. Additionally, companies like SambaNova offer services that facilitate running models more easily. While Nvidia currently holds the hardware leadership position in generative AI, it seems to be envisioning a future where users will not have to rely on purchasing a large number of its GPUs. From our sponsor.
Brief news summary
None
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Blockchain-based HUMO token backed by government …
Tashkent, Uzbekistan, May 13th, 2025 – Uzbekistan is launching a pilot project for a new asset-backed token named HUMO, which will be tied to government bonds.

Trump's Saudi victory lap belies AI fears
During a recent visit to Saudi Arabia, former President Donald Trump announced a dramatic increase in U.S.-Saudi investment deals totaling over $600 billion.

Challenges loom for blockchain's promise of healt…
MobiHealthNews: Get the newest updates in digital health sent directly to your inbox every day

Donald Trump Announces $600 Billion in AI and Def…
During a high-profile visit to Saudi Arabia, former U.S. President Donald Trump announced a series of ambitious agreements valued at approximately $600 billion, spanning sectors including defense, artificial intelligence (AI), and other industries.

Blockchain's Role in Enhancing Digital Payments
FinTech Daily provides a comprehensive overview of blockchain technology’s transformative impact on digital payment systems globally.

Nvidia to Ship 18,000 Advanced AI Chips to Saudi …
Nvidia, the leading U.S. chipmaker known for advanced graphics processing units and AI technology, is set to deliver 18,000 of its latest AI chips to Saudi Arabia.

Hoskinson Says Cardano Could Be First Blockchain …
Charles Hoskinson, the founder of Cardano, is considering the development of a privacy-enabled stablecoin on the Cardano blockchain.