lang icon English
Auto-Filling SEO Website as a Gift

Launch Your AI-Powered Business and get clients!

No advertising investment needed—just results. AI finds, negotiates, and closes deals automatically

May 6, 2025, 11:47 a.m.
90

AI Hallucinations Worsen Despite Advances in Reasoning Models - Industry Challenges Explained

Artificial intelligence models have long faced the challenge of hallucinations, an industry euphemism for the false information that large language models frequently present as fact. Judging by the direction taken by the latest "reasoning" models—developed by companies like Google and OpenAI to "think" through problems before responding—the issue is worsening rather than improving. As reported by The New York Times, as AI models grow more powerful, their tendency to hallucinate increases, not decreases. This is an inconvenient reality as more users flock to AI chatbots like OpenAI's ChatGPT, applying them to a growing variety of tasks. When chatbots generate doubtful or incorrect statements, users risk embarrassment or worse consequences. Even more troubling, AI firms are struggling to identify why chatbots are producing more errors now than before—a perplexing situation that underscores the fact that even the creators of AI do not fully understand how the technology functions. This alarming pattern challenges the widespread belief within the industry that scaling AI models will inherently make them more reliable and capable. The stakes are incredibly high, as companies continue to invest tens of billions of dollars into building AI infrastructure for increasingly large and powerful "reasoning" models. Some experts believe hallucinations may be intrinsic to the technology, making it nearly impossible to eliminate the problem entirely. “Despite our best efforts, they will always hallucinate, ” Amr Awadallah, CEO of AI startup Vectara, told The New York Times.

“That will never go away. ” The problem is so pervasive that entire companies now specialize in helping businesses manage and mitigate hallucinations. “Not dealing with these errors properly basically eliminates the value of AI systems, ” Pratik Verma, cofounder of Okahu, a consultancy assisting businesses in leveraging AI more effectively, told the NYT. This comes after OpenAI's latest reasoning models, o3 and o4-mini, released late last month, were found to hallucinate more frequently than earlier versions. On OpenAI’s internal accuracy benchmark, the o4-mini model hallucinated 48 percent of the time, demonstrating poor truthfulness. The o3 model had a hallucination rate of 33 percent, roughly double that of the company’s previous reasoning models. Similarly, as the New York Times notes, competitors such as Google and DeepSeek are facing the same problems, indicating this is an industry-wide challenge. Experts warn that as AI models grow larger, the improvements each new model brings over the last are diminishing. With firms rapidly exhausting available training data, many are resorting to synthetic—or AI-generated—data to train models, which could have potentially disastrous effects. In summary, despite ongoing efforts, hallucinations are more widespread than ever, and currently, the technology is not showing signs of improvement. For more on AI hallucinations, see: “You Can’t Lick a Badger Twice”: Google's AI Is Making Up Explanations for Nonexistent Folksy Sayings.



Brief news summary

Artificial intelligence models increasingly generate false information known as “hallucinations,” where they present incorrect facts despite improvements in reasoning abilities. This issue affects widely used AI tools like OpenAI’s ChatGPT and contributes to misinformation spread. Surprisingly, larger and more advanced models tend to hallucinate more, challenging the belief that bigger models are always more reliable. Experts consider hallucinations an inherent limitation of current AI technology, even as investments in AI grow. To combat this, companies are launching services to detect and manage hallucinations, recognizing that ignoring them reduces AI’s value. Studies reveal OpenAI’s latest models hallucinate nearly 48% of the time, with similar problems seen in Google’s models, showing an industry-wide challenge. Factors like synthetic training data and expanding model size may worsen hallucination rates. In summary, AI hallucinations remain a significant, escalating problem with no clear solutions yet.
Business on autopilot

AI-powered Lead Generation in Social Media
and Search Engines

Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Learn how AI can help your business.
Let’s talk!

May 30, 2025, 7:15 p.m.

Blockchain ecosystem sets stage for 4B football f…

0xFútbol seeks to unite the global football community by integrating blockchain technology, enabling fans to actively participate, influence, and gain ownership within the sport.

May 30, 2025, 6:41 p.m.

Behind the Curtain: The Great Fusing

The ongoing convergence between the U.S. government and leading technology firms signals a transformative shift in artificial intelligence (AI) and space technology.

May 30, 2025, 5:28 p.m.

Why privacy in blockchain must start with open so…

Traditionally, trust was placed in centralized institutions like banks, payment networks, and clearinghouses—closed systems where users relied on external audits, government regulation, and long compliance histories to feel secure.

May 30, 2025, 4:57 p.m.

AI in Autonomous Vehicles: Navigating the Road Ah…

Artificial intelligence (AI) is central to the rapidly advancing autonomous vehicle industry, driving major changes in how vehicles function and interact with their environment.

May 30, 2025, 3:43 p.m.

Bergen County launches blockchain pilot to modern…

Bergen County has entered into a five-year partnership with blockchain startup Balcony to digitize and secure 370,000 property deeds, representing approximately $240 billion in real estate value.

May 30, 2025, 3:06 p.m.

AI in Healthcare: Enhancing Diagnostic Accuracy a…

Artificial intelligence (AI) is increasingly transforming healthcare by enhancing how medical professionals diagnose, treat, and manage various conditions.

May 30, 2025, 1:53 p.m.

This platform offers a blockchain solution to out…

Backed by major investors like Circle, Coinbase, and Solana Ventures, Zebec Network aims to build real-world financial infrastructure by bridging Web2 and Web3 with streaming payroll, crypto cards, and enterprise tools.

All news