lang icon English
Auto-Filling SEO Website as a Gift

Launch Your AI-Powered Business and get clients!

No advertising investment needed—just results. AI finds, negotiates, and closes deals automatically

March 20, 2025, 4:13 a.m.
103

Pruna AI Launches Open-Source Optimization Framework for AI Models

Pruna AI, a European startup focused on developing compression algorithms for AI models, is launching its optimization framework as open source this Thursday. The company has designed a framework that implements various efficiency techniques, including caching, pruning, quantization, and distillation, to optimize AI models effectively. “Our framework standardizes the process of saving and loading compressed models, combines these compression techniques, and evaluates the performance of your compressed model post-optimization, ” said John Rachwan, co-founder and CTO of Pruna AI, in an interview with TechCrunch. Specifically, Pruna AI’s framework can assess whether significant quality loss occurs after compressing a model and the performance improvements achieved. “To use a metaphor, we are akin to Hugging Face in terms of standardizing transformers and diffusers — establishing how to call them, save, and load them, etc. We are doing the same for efficiency methods, ” he noted. Major AI laboratories are already leveraging various compression techniques. For example, OpenAI has used distillation to develop faster iterations of its core models. This approach likely contributed to creating GPT-4 Turbo, a speedier version of GPT-4. The Flux. 1-schnell image generation model is another example, serving as a distilled variant of the Flux. 1 model from Black Forest Labs. Distillation involves extracting knowledge from a larger AI model through a "teacher-student" framework. Developers issue requests to a teacher model and capture the outputs. These responses may then be compared to a dataset for accuracy, guiding the training of the student model to emulate the teacher's behavior. “For large companies, they typically develop these solutions in-house. In the open-source community, you often find tools focusing on single methods, like one quantization technique for large language models or one caching approach for diffusion models, ” Rachwan explained. “However, there’s a lack of comprehensive tools that integrate and simplify all these methods.

This is the key benefit that Pruna offers. ” While Pruna AI supports any model types—ranging from large language models to diffusion models, speech-to-text systems, and computer vision applications—the company is currently placing greater emphasis on image and video generation models. Among Pruna AI’s current clients are Scenario and PhotoRoom. Besides the open-source version, Pruna AI offers an enterprise solution with advanced optimization capabilities, including an optimization agent. “The most thrilling feature we’ll soon release is a compression agent, ” Rachwan revealed. “You simply provide your model and specify, ‘I need more speed without sacrificing accuracy by more than 2%. ’ The agent then performs its magic, determining the best combination and presenting it to you without any additional work required from the developer. ” Pruna AI charges by the hour for its professional version. “It’s comparable to renting a GPU on AWS or other cloud services, ” Rachwan added. If your model is a critical element of your AI infrastructure, optimizing it can lead to substantial cost savings on inference. For instance, Pruna AI has reduced a Llama model’s size by eight times with minimal loss through its compression framework. The company aspires for clients to view its compression framework as a self-sustaining investment. Recently, Pruna AI completed a seed funding round, raising $6. 5 million. Notable investors include EQT Ventures, Daphni, Motier Ventures, and Kima Ventures.



Brief news summary

Pruna AI, a European startup specializing in AI model compression, has launched an open-source optimization framework designed to enhance the efficiency of AI models through methods like caching, pruning, quantization, and distillation. Co-founder and CTO John Rachwan highlighted that this framework helps users evaluate the quality trade-offs associated with compression while significantly improving performance. By combining various compression techniques into a user-friendly platform, Pruna AI aligns itself with Hugging Face and its transformers. The framework currently supports models for image and video generation and serves notable clients, including Scenario and PhotoRoom. Additionally, Pruna AI offers an enterprise version with enhanced optimization tools and plans to introduce a "compression agent" to optimize processes further. Its professional model operates on a pay-as-you-go pricing model, achieving reductions in model size of up to eight times for specific applications without compromising performance. Recently, the startup raised $6.5 million in seed funding from prominent investors, establishing itself as a competitive and cost-effective player in the AI sector.
Business on autopilot

AI-powered Lead Generation in Social Media
and Search Engines

Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Learn how AI can help your business.
Let’s talk!

May 30, 2025, 7:15 p.m.

Blockchain ecosystem sets stage for 4B football f…

0xFútbol seeks to unite the global football community by integrating blockchain technology, enabling fans to actively participate, influence, and gain ownership within the sport.

May 30, 2025, 6:41 p.m.

Behind the Curtain: The Great Fusing

The ongoing convergence between the U.S. government and leading technology firms signals a transformative shift in artificial intelligence (AI) and space technology.

May 30, 2025, 5:28 p.m.

Why privacy in blockchain must start with open so…

Traditionally, trust was placed in centralized institutions like banks, payment networks, and clearinghouses—closed systems where users relied on external audits, government regulation, and long compliance histories to feel secure.

May 30, 2025, 4:57 p.m.

AI in Autonomous Vehicles: Navigating the Road Ah…

Artificial intelligence (AI) is central to the rapidly advancing autonomous vehicle industry, driving major changes in how vehicles function and interact with their environment.

May 30, 2025, 3:43 p.m.

Bergen County launches blockchain pilot to modern…

Bergen County has entered into a five-year partnership with blockchain startup Balcony to digitize and secure 370,000 property deeds, representing approximately $240 billion in real estate value.

May 30, 2025, 3:06 p.m.

AI in Healthcare: Enhancing Diagnostic Accuracy a…

Artificial intelligence (AI) is increasingly transforming healthcare by enhancing how medical professionals diagnose, treat, and manage various conditions.

May 30, 2025, 1:53 p.m.

This platform offers a blockchain solution to out…

Backed by major investors like Circle, Coinbase, and Solana Ventures, Zebec Network aims to build real-world financial infrastructure by bridging Web2 and Web3 with streaming payroll, crypto cards, and enterprise tools.

All news