lang icon En
Nov. 21, 2025, 1:20 p.m.
2708

Decart AI's LSD v2 Breakthrough Enables Real-Time, Low-Latency AI Video Generation

Brief news summary

Over the past year, AI video diffusion models such as OpenAI’s Sora 2 and Google’s Veo 3 have advanced visual realism but face challenges with latency and limited video length due to sequential frame generation. Decart AI’s LSD v2 overcomes these issues using a causal, auto-regressive architecture that enables instant, continuous video creation without duration limits. Key innovations like improved diffusion forcing and history augmentation prevent error accumulation, allowing infinite, high-quality videos that seamlessly adapt to user input. To achieve the subsecond latency required for live interaction, Decart optimized Nvidia Hopper GPUs using techniques including a “mega kernel,” architecture-aware pruning, and shortcut distillation, enabling fast denoising in compact models. This breakthrough supports dynamic applications such as live streaming, gaming, education, and design, providing real-time content modification with minimal delay. LSD v2 marks a significant advancement in real-time, unlimited AI video generation, transforming interactive storytelling and creative expression.

Over the past year, AI-generated video diffusion models have made remarkable advances in visual realism, demonstrated by models like OpenAI’s Sora 2, Google’s Veo 3, and Runway Gen-4. AI video generation is reaching a pivotal stage, with the latest models able to create stunning, lifelike clips. However, these models’ architectures limit their use for real-time interactive applications, as they generate video frames sequentially via complex, computationally demanding steps. Processing each chunk before moving to the next causes latency, preventing live AI video streaming. Most AI practitioners focus on generating clips for later viewing, with live, instant AI video transformation still considered years away. Decart’s team challenged this architectural barrier and developed LSD v2, a model that demonstrates minimal latency is achievable through novel approaches applicable to various AI models. They optimized infrastructure to maximize GPU utilization and accelerated the denoising process critical for preventing error buildup. LSD v2 uses a causal, auto-regressive architecture to generate video instantly and continuously, without output duration limits. Key innovations include: 1. **Infinite Generation via Causal, Auto-regressive Models** To enable streaming output, video models must operate “causally, ” producing each frame based only on preceding frames, reducing computational load. This approach ensures continuity, but over time suffers from error accumulation—small inaccuracies like a misplaced shadow become increasingly distorted, limiting most models to short clips. To counter this, Decart enhanced “diffusion forcing” to denoise frames as they’re generated and introduced “history augmentation, ” training models to recognize and correct corrupted outputs. The causal feedback loop considers prior generated frames, the current input, and user prompts, enabling the model to identify and fix artifacts and output high-quality content indefinitely. This allows continuous real-time editing and transformation based on user input. 2. **Achieving Subsecond Latency Through GPU Optimization** Real-time interactive AI video requires generating each frame within 40 milliseconds to avoid visible lag.

However, causal AI models’ computational intensity clashes with modern GPUs’ design, which favors large batch processing over low latency. Decart addressed this by deeply optimizing Nvidia’s Hopper GPU kernels. Instead of numerous small kernels causing frequent stops, starts, and data movement—which wastes time and leaves much GPU capacity idle—they created a single “mega kernel” to run all model computations in one continuous pass. This approach dramatically improves GPU utilization and speeds processing by an order of magnitude, analogous to how Henry Ford’s assembly line revolutionized manufacturing by streamlining sequential workflows. 3. **Pruning and Shortcut Distillation for Efficiency** Neural networks tend to be over-parameterized, containing many parameters unnecessary for generating desired outputs. Decart applied “architecture-aware pruning” to remove redundant parameters, reducing computational workload and tailoring models closely to hardware architecture. Additionally, they developed “shortcut distillation, ” fine-tuning smaller, lightweight models to match the denoising speed of larger, more power-hungry models. Using these shortcut models reduces the steps needed to generate coherent frames, compounding incremental time savings and accelerating overall output generation. These breakthroughs collectively enable subsecond latency video generation, a crucial milestone that opens AI video to interactive use cases previously impossible. Users can continuously edit content on-the-fly, adapting videos live based on prompts or audience input. This capability offers exciting prospects for live-streaming influencers and Twitch streamers who can dynamically modify content as they broadcast. Beyond entertainment, this technology holds promise for live video games, enabling AI-generated sequences adapting in real-time to player choices—such as branching narratives shaped by user decisions. It also impacts extended reality, immersive education, and large-scale event marketing. Furthermore, AI-generated videos serve as neural rendering engines for professionals like architects and interior designers, enabling rapid prototyping of styles and themes via prompts before finalizing designs. Most remarkably, removing latency while enabling infinite video generation empowers creators to explore longform content interactively. They can adjust scenes, lighting, camera angles, and character expressions in real time as the video unfolds, transforming storytelling into a dynamic, user-driven experience. Kfir Aberman, founding member of Decart AI and head of its San Francisco office, leads efforts in transforming real-time generative video research into products. His work focuses on building interactive, personalized AI systems that blend research excellence with creative user experiences.


Watch video about

Decart AI's LSD v2 Breakthrough Enables Real-Time, Low-Latency AI Video Generation

Try our premium solution and start getting clients — at no cost to you

Content creator image

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

April 3, 2026, 10:25 a.m.

Oracle's AI Cloud Services: Transforming Enterpri…

Oracle Corporation has announced a major expansion of its cloud service portfolio by incorporating advanced artificial intelligence (AI) capabilities into its platform.

April 3, 2026, 10:23 a.m.

Docket Launches AI Seller to Reinvent Digital Buy…

Docket, a leading AI revenue platform designed for modern enterprises, has introduced its newest innovation: the AI Seller agent.

April 3, 2026, 10:18 a.m.

Amazon and Partners Transform Retail with AI and …

Next-Gen Retail Driven by AI and Cloud Technology In today’s rapidly changing retail environment, industry leaders are increasingly adopting cutting-edge technologies such as artificial intelligence (AI) and cloud computing to transform their operations, customer engagement, and supply chain management

April 3, 2026, 10:16 a.m.

PK SEO Announces Launch of Google AI Mode in Aust…

PK SEO, a leading digital marketing firm specializing in search engine optimization, has announced the launch of Google AI Mode—a groundbreaking approach that leverages advanced artificial intelligence to revolutionize SEO strategies in Australia.

April 3, 2026, 6:19 a.m.

Microsoft's Azure AI Introduces New Tools for Ent…

Microsoft’s Azure AI has introduced a comprehensive suite of advanced tools designed to enhance enterprise solutions by automating complex business processes and improving decision-making through sophisticated AI integration.

April 3, 2026, 6:16 a.m.

Top KingWin Secures $4.8 Million AI Robot Sales C…

Top KingWin Ltd, a leading AI robotics company, has made a major advance by securing a $4.8 million sales contract through its Colorado-based subsidiary, Top KingWin Hi Tech Inc.

April 3, 2026, 6:15 a.m.

Google AI Overviews: What's Changing for SEO & SE…

Google has officially broadened access to its AI Overviews feature, formerly known as the Search Generative Experience (SGE), extending availability beyond its experimental Labs phase.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

AI Company welcome image

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today