lang icon English
Auto-Filling SEO Website as a Gift

Launch Your AI-Powered Business and get clients!

No advertising investment needed—just results. AI finds, negotiates, and closes deals automatically

Oct. 17, 2023, 11:34 a.m.
172

None

Nvidia is expanding its presence in the AI industry beyond GPU sales by incorporating its AI-specific software development kit into more applications. The company recently announced that it will add support for its TensorRT-LLM SDK to Windows and models such as Stable Diffusion. Through this move, Nvidia aims to enhance the performance of large language models (LLMs) and related tools. TensorRT optimizes inference, the process of utilizing pretrained information to calculate probabilities and produce results, such as generating a newly created Stable Diffusion image. With this software, Nvidia strives to play a bigger role in the inference aspect of generative AI. By leveraging TensorRT-LLM, which accelerates LLMs on Nvidia's H100 GPUs, the company enables faster running of LLMs, including Meta's Llama 2 and Stability AI's Stable Diffusion. Nvidia claims that this acceleration significantly enhances the experience of more advanced LLM usage, like writing and coding assistants. In simpler terms, Nvidia aims to provide not only the GPUs for training and running LLMs but also the software that enables these models to operate more efficiently, thus discouraging users from seeking cost-effective alternatives for generative AI. The company announced that TensorRT-LLM will be publicly available to anyone interested in using or integrating it, with access to the SDK on its website.

Nvidia already dominates the market for powerful chips used in training LLMs, such as GPT-4, where multiple GPUs are typically required. The demand for its H100 GPUs has skyrocketed, with estimated prices reaching $40, 000 per chip. In response to this increasing demand, Nvidia announced that a newer version of its GPU, the GH200, would be released next year. However, new approaches have emerged to run LLMs without heavily relying on expensive GPUs, as the world of generative AI evolves quickly. Microsoft and AMD, for example, plan to develop their own chips to reduce dependence on Nvidia. Additionally, companies like SambaNova offer services that facilitate running models more easily. While Nvidia currently holds the hardware leadership position in generative AI, it seems to be envisioning a future where users will not have to rely on purchasing a large number of its GPUs. From our sponsor.



Brief news summary

None
Business on autopilot

AI-powered Lead Generation in Social Media
and Search Engines

Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Learn how AI can help your business.
Let’s talk!

May 14, 2025, 1:31 a.m.

Blockchain-based HUMO token backed by government …

Tashkent, Uzbekistan, May 13th, 2025 – Uzbekistan is launching a pilot project for a new asset-backed token named HUMO, which will be tied to government bonds.

May 14, 2025, 1:15 a.m.

Trump's Saudi victory lap belies AI fears

During a recent visit to Saudi Arabia, former President Donald Trump announced a dramatic increase in U.S.-Saudi investment deals totaling over $600 billion.

May 14, 2025, 12:08 a.m.

Challenges loom for blockchain's promise of healt…

MobiHealthNews: Get the newest updates in digital health sent directly to your inbox every day

May 13, 2025, 11:40 p.m.

Donald Trump Announces $600 Billion in AI and Def…

During a high-profile visit to Saudi Arabia, former U.S. President Donald Trump announced a series of ambitious agreements valued at approximately $600 billion, spanning sectors including defense, artificial intelligence (AI), and other industries.

May 13, 2025, 10:50 p.m.

Blockchain's Role in Enhancing Digital Payments

FinTech Daily provides a comprehensive overview of blockchain technology’s transformative impact on digital payment systems globally.

May 13, 2025, 10:15 p.m.

Nvidia to Ship 18,000 Advanced AI Chips to Saudi …

Nvidia, the leading U.S. chipmaker known for advanced graphics processing units and AI technology, is set to deliver 18,000 of its latest AI chips to Saudi Arabia.

May 13, 2025, 9:28 p.m.

Hoskinson Says Cardano Could Be First Blockchain …

Charles Hoskinson, the founder of Cardano, is considering the development of a privacy-enabled stablecoin on the Cardano blockchain.

All news