lang icon En
Feb. 26, 2025, 10:54 p.m.
2404

Inception's Revolutionary AI Model: A Blend of Diffusion and Language Technology

Brief news summary

Inception, a startup launched by Stanford professor Stefano Ermon in Palo Alto, has unveiled an innovative diffusion-based large language model (DLM). This model integrates the strengths of conventional large language models (LLMs) with the rapid processing capabilities of diffusion models, known for their prowess in generating multimedia content like images, videos, and audio. Ermon explains that traditional LLMs generate text sequentially, leading to slower outputs, while diffusion models leverage extensive data representations to facilitate parallel processing. This significantly accelerates text production, a breakthrough achieved through comprehensive research by Ermon and his student. The development has attracted interest from Fortune 100 companies eager to enhance AI performance by reducing latency and optimizing GPU utilization. Inception offers an API and various deployment solutions, claiming that their DLMs can deliver results up to ten times faster than current LLMs while lowering operational costs. With a strong emphasis on efficiency, Inception seeks to establish itself as a leading player in the dynamic AI landscape.

Inception, a newly founded company in Palo Alto, initiated by Stanford computer science professor Stefano Ermon, claims to have created a groundbreaking AI model utilizing “diffusion” technology. This innovative model is referred to as a diffusion-based large language model, or “DLM” for short. Currently, the generative AI models garnering the most attention can be categorized into two main types: large language models (LLMs) and diffusion models. LLMs, which are designed on transformer architecture, specialize in text generation. In contrast, diffusion models, the technology behind AI platforms like Midjourney and OpenAI’s Sora, primarily focus on generating images, video, and audio. According to Inception, its model combines the capabilities of conventional LLMs—such as code generation and question-answering—with significantly enhanced speed and lower computing costs. Ermon shared with TechCrunch that he has long explored the application of diffusion models to text generation in his research lab at Stanford. His work emerged from the observation that traditional LLMs operate at a slower pace compared to diffusion technologies. With LLMs, Ermon explained, “you cannot generate the second word until you’ve produced the first one, and the third word can’t be generated until the first two are complete. ” Seeking an approach to apply diffusion mechanisms to text generation, Ermon noted that, unlike LLMs that operate sequentially, diffusion models begin with a rough approximation of the output (for example, an image) and refine the data comprehensively in one go. Ermon theorized that generating and modifying substantial text blocks in parallel could be feasible using diffusion models.

After several years of research, he and one of his students achieved a significant breakthrough, which they documented in a research paper published last year. Recognizing the potential of this advancement, Ermon established Inception last summer, bringing on board former students Aditya Grover, a professor at UCLA, and Volodymyr Kuleshov from Cornell University to co-lead the venture. While Ermon opted not to disclose specific funding details for Inception, TechCrunch has learned that the Mayfield Fund is among its investors. Inception has already secured contracts with various clients, including unnamed Fortune 100 companies, by addressing their pressing requirements for lower AI latency and enhanced speed, according to Ermon. “Our models can leverage GPUs significantly more efficiently, ” Ermon asserted, referring to the graphics processing units typically employed to run production models. “I believe this is transformative and will alter how language models are developed. ” The company provides an API alongside options for on-premises and edge device deployments, model fine-tuning support, and a range of ready-to-use DLMs tailored for various applications. Inception claims that its DLMs can operate up to 10 times faster than traditional LLMs while incurring costs that are also 10 times lower. A company representative informed TechCrunch, “Our ‘small’ coding model equals the performance of [OpenAI’s] GPT-4o mini yet operates at more than 10 times the speed. Our ‘mini’ model surpasses small open-source alternatives like [Meta’s] Llama 3. 1 8B, achieving over 1, 000 tokens per second. ”


Watch video about

Inception's Revolutionary AI Model: A Blend of Diffusion and Language Technology

Try our premium solution and start getting clients — at no cost to you

Content creator image

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

March 9, 2026, 10:26 a.m.

British AI datacentre firm Nscale raises $2bn as …

Nscale, a UK company crucial to the government’s AI ambitions, has secured $2bn (£1.5bn) in a funding round and appointed former Meta executives Sheryl Sandberg and Nick Clegg to its board of directors.

March 9, 2026, 10:22 a.m.

AI-Driven Video Content Moderation: Balancing Aut…

In recent years, social media platforms have dramatically evolved their approach to content moderation, especially for video material, due to the surge in user-generated videos.

March 9, 2026, 10:19 a.m.

AI Overviews Reduce Clicks by 58%

Recent insights from Anicca reveal a significant impact of AI-generated content on user engagement with organic search results.

March 9, 2026, 10:15 a.m.

Vista Social Leads the Way as the First SMM Tool …

Vista Social, a leading social media marketing platform, has launched a groundbreaking integration with Canva's AI Text to Image generator, marking a major advancement in digital content creation.

March 9, 2026, 10:15 a.m.

Highspot Launches New Agentic AI to Help Sales Te…

Highspot has unveiled its Winter Product Release for 2026, introducing a revolutionary feature called Deal Intelligence powered by an agentic AI named Deal Agent.

March 9, 2026, 10:12 a.m.

Meta Integrates Manus AI into Ads Manager

Meta has significantly advanced its advertising platform by integrating Manus AI directly into Ads Manager, aiming to streamline and enhance campaign management through AI within a single interface.

March 9, 2026, 6:20 a.m.

Nvidia-backed Nscale valued at $14.6 billion in f…

In this article: March 9 (Reuters) - The British company Nscale, an artificial intelligence group supported by Nvidia, announced on Monday that it has been valued at $14

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

AI Company welcome image

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today