lang icon En
Sept. 15, 2024, 12:46 a.m.
3291

ElasticDiffusion: Enhancing Image Generation with AI at Rice University

Brief news summary

Generative artificial intelligence, especially diffusion models, often faces challenges in producing consistent and detailed images, particularly with maintaining fine features like facial symmetry in non-square formats. Researchers at Rice University have developed a novel approach called ElasticDiffusion, as presented by doctoral student Moayed Haji Ali at the IEEE 2024 Conference on Computer Vision and Pattern Recognition in Seattle. Unlike earlier models such as Stable Diffusion and DALL-E, which perform well with square images but struggle with distortion in other aspect ratios, ElasticDiffusion enhances image generation by distinguishing local pixel details from global shapes. This advancement minimizes errors in non-square images while ensuring visual coherence, without the need for additional training. Currently, ElasticDiffusion operates at a speed that is 6-9 times slower than conventional models; however, the researchers are optimizing its performance to align with existing methods, enabling its use across various aspect ratios.

Generative artificial intelligence (AI), including models like Stable Diffusion, Midjourney, and DALL-E, often struggles with producing consistent images, especially when it comes to details like facial symmetry and appropriate finger representation. These models generally generate square images, leading to issues when tasked with creating images in different aspect ratios, resulting in anomalies such as extra fingers or distorted shapes. To address these problems, computer scientists at Rice University have developed ElasticDiffusion, a novel method leveraging pre-trained diffusion models. Moayed Haji Ali, a doctoral student at Rice, presented this method at the IEEE 2024 Conference on Computer Vision and Pattern Recognition in Seattle. Haji Ali explained that traditional diffusion models can only generate images at a specific resolution, which is a consequence of overfitting, where an AI model performs well on familiar data but struggles with variations.

ElasticDiffusion improves the approach by separating local and global information during image generation, rather than combining them. This separation helps avoid visual imperfections arising from repetitive data when adapting to non-square images. Haji Ali noted that the process involves initially obtaining a global score encapsulating the image’s overall structure, followed by filling in pixel-level details in sections. This method enables the generation of clearer images across various aspect ratios without necessitating additional model training. While ElasticDiffusion offers enhanced consistency and adaptability in image generation, it comes with a trade-off: it currently requires 6-9 times longer to create images compared to conventional diffusion models. Haji Ali aims to optimize the method to achieve equivalent inference times while retaining the ability to generate high-quality images regardless of aspect ratio.


Watch video about

ElasticDiffusion: Enhancing Image Generation with AI at Rice University

Try our premium solution and start getting clients — at no cost to you

Content creator image

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

March 19, 2026, 2:22 p.m.

Google's AI Mode Cites Itself More Often, With Mo…

Recent data highlights a notable shift in how Google's AI-powered search functions reference information, with a significant rise in self-referencing citations to its own organic search results.

March 19, 2026, 2:21 p.m.

RocketReach Expands Signal Driven AI Prospecting …

RocketReach, a leading sales and recruiting intelligence platform, has announced a strategic partnership with Autobound to enhance its AI-driven prospecting capabilities.

March 19, 2026, 2:19 p.m.

Stability AI Releases Stable Video 3D, an AI Mode…

Stability AI, a leading artificial intelligence company, has introduced Stable Video 3D (SV3D), an innovative AI model that transforms single 2D images into dynamic orbital 3D videos, representing a major breakthrough in video generation technology.

March 19, 2026, 2:17 p.m.

China's Alibaba Targets $100B in AI and Cloud Rev…

China's technology giant Alibaba Group has announced a bold goal to generate over $100 billion in revenue from its artificial intelligence (AI) and cloud businesses within the next five years.

March 19, 2026, 2:13 p.m.

Imaginuity Launches AI Mail

Imaginuity, a leader in innovative marketing solutions, has launched AI Mail, an advanced artificial intelligence-driven direct mail marketing platform.

March 19, 2026, 2:12 p.m.

Meta's New AI-Powered Chatbots: Impressive Featur…

Meta Platforms has revealed a new suite of advanced artificial intelligence systems, which CEO Mark Zuckerberg describes as "the most intelligent AI assistant that you can freely use." These AI agents have been seamlessly integrated into some of Meta's most popular social media platforms, including Facebook, Instagram, and WhatsApp, with the goal of enhancing user interactions through smarter, more intuitive digital assistance.

March 19, 2026, 10:19 a.m.

SJinn Launches Revolutionary Video Agent, an AI C…

SJinn AI has officially introduced its innovative Video Agent, a revolutionary solution set to transform how advertisements and story videos are produced.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

AI Company welcome image

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today