lang icon En
March 14, 2025, 10:20 a.m.
1868

Study Reveals Accuracy Issues with AI News Search Tools

Brief news summary

A recent study from the Tow Center for Digital Journalism, covered by the Columbia Journalism Review, reveals critical accuracy issues with generative AI tools used for news searches. Researchers examined eight AI systems and found that over 60% of news queries returned incorrect information, despite the fact that about 25% of Americans prefer these AI tools over traditional search engines. The error rates varied significantly, with Perplexity at 37%, ChatGPT Search at 67%, and Grok 3 alarming at 94%. Analyzing 1,600 queries, the study showed that AI tools frequently provided plausible but false information, even with limited training. Notably, some premium services, like Grok 3, performed worse than free alternatives. Other concerns included non-compliance with Robot Exclusion Protocols, unauthorized access to paywalled content, and reliance on syndicated sources, resulting in broken links. These issues present major challenges for publishers utilizing free AI tools. Although OpenAI and Microsoft acknowledged the findings, they did not specifically respond to the highlighted problems. For further information, refer to the Columbia Journalism Review’s website.

A recent study by the Columbia Journalism Review's Tow Center for Digital Journalism reveals significant accuracy problems with generative AI models used for news searches. Testing eight AI-driven search tools, researchers found that over 60% of queries yielded incorrect information about news sources. About 25% of Americans currently use these AI models instead of traditional search engines, raising concerns about their reliability. The error rates varied among different tools. For instance, Perplexity made mistakes in 37% of queries, while ChatGPT Search saw a 67% error rate (134 out of 200 queries). Grok 3 had the highest at 94%. To conduct the tests, researchers provided direct excerpts from real news articles and asked the AI tools to identify corresponding details like headlines, publishers, dates, and URLs, totaling 1, 600 queries. A concerning trend noted was that instead of declining to answer when unsure, the models often offered plausible-sounding but erroneous responses, a pattern consistent across all tools tested. Premium versions of these AI tools, like Perplexity Pro ($20/month) and Grok 3's premium service ($40/month), sometimes performed worse, as they frequently provided incorrect answers despite correctly address a higher number of prompts. Their tendency to offer uncertain responses contributed to higher overall error rates. The study also raised issues regarding publishers' control over their content.

Some AI tools ignored the Robot Exclusion Protocols meant to prevent unauthorized access to certain content. For instance, Perplexity’s free version identified excerpts from paywalled National Geographic articles, even though access was explicitly disallowed. Furthermore, when AI tools did cite sources, they often linked to syndicated content on sites like Yahoo News instead of original publishers. A significant problem arose with URL fabrication—over half of the citations from Google's Gemini and Grok 3 led to broken or non-existent pages, with Grok 3 having 154 out of 200 citations result in error pages. This situation places publishers in a difficult position: blocking AI crawlers could eliminate attribution, while allowing access facilitates content reuse without benefiting the original sites. Mark Howard, COO of Time magazine, expressed concerns regarding transparency and control, but also suggested potential for improvement, stating that current AI tools will evolve positively. Howard pointedly criticized users who expect complete accuracy from free AI services, suggesting that skepticism is necessary. OpenAI and Microsoft acknowledged the study's findings but did not directly respond to the issues raised. OpenAI emphasized its commitment to supporting publishers, while Microsoft claimed compliance with Robot Exclusion Protocols. This report builds on earlier findings from November 2024, which similarly highlighted accuracy issues with ChatGPT's handling of news content. For further details, the full report is available on the Columbia Journalism Review's website.


Watch video about

Study Reveals Accuracy Issues with AI News Search Tools

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Dec. 10, 2025, 1:25 p.m.

Kodec AI Research Reveals 'Rogue Sales Rep' Probl…

Newark, DE, Dec.

Dec. 10, 2025, 1:23 p.m.

Databricks CEO Ali Ghodsi says his company will b…

In August, Ghodsi told the Wall Street Journal that he believed Databricks, which is reportedly negotiating to raise funding at a $134 billion valuation, had "a shot to be a trillion-dollar company." At Fortune’s Brainstorm AI conference in San Francisco on Tuesday, he detailed how this could occur, outlining a “trifecta” of growth areas set to fuel the company’s next phase of expansion.

Dec. 10, 2025, 1:17 p.m.

James Shears Joins ThinkAnalytics as Senior VP, A…

James Shears has assumed the role of senior vice president of advertising at ThinkAnalytics, where he leads the global strategy and commercial expansion of the company’s AI-powered advertising solutions.

Dec. 10, 2025, 1:13 p.m.

SEO Is Dead: Welcome To GEO And Generative AI Sea…

The search engine landscape is undergoing a transformative shift, signaling the end of traditional search as we know it.

Dec. 10, 2025, 1:13 p.m.

Radnor High School investigating reports of ‘inap…

Officials at Radnor High School have announced that an investigation is underway following reports of an AI-generated video allegedly depicting students engaging in inappropriate behavior circulating within the school.

Dec. 10, 2025, 9:49 a.m.

Microsoft Slashes AI Sales Growth Targets Amid Cu…

Microsoft has recently revised its sales growth targets for its AI agent products after many sales personnel struggled to meet their quotas during the fiscal year ending in June, as reported by The Information.

Dec. 10, 2025, 9:28 a.m.

AI Seepage in Products and Marketing: Impact on C…

AI-generated content is increasingly appearing in product descriptions and marketing campaigns, a trend explored by Pangram.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today