lang icon En
Jan. 19, 2025, 4:20 p.m.
2484

Epoch AI Scrutinized for Disclosing OpenAI Funding after FrontierMath Release

Brief news summary

Epoch AI, a nonprofit dedicated to setting math benchmarks for AI, faced backlash after revealing OpenAI funding on December 20. This financial support is intended for creating FrontierMath, a tool designed to assess AI mathematical skills, particularly in relation to the upcoming o3 model. Concerns arose regarding potential bias and transparency, as many contributors were reportedly unaware of this backing. Critics, including the contractor "Meemi" from LessWrong, voiced skepticism about the benchmarks' impartiality, citing OpenAI’s prior access to testing materials. In defense of the initiative, Tamay Besiroglu, Epoch AI's associate director, acknowledged the transparency issues but staunchly supported FrontierMath’s integrity. He explained that legal constraints had affected timely disclosures and emphasized improved communication with contributors. Besiroglu also pointed out an informal agreement that bars OpenAI from utilizing benchmark data for training purposes. Ellot Glazer, Epoch AI's chief mathematician, accepted that FrontierMath's results had not been independently validated by OpenAI but expressed optimism about their reliability.

A nonprofit organization working on math benchmarks for AI has recently come under scrutiny for not disclosing its financial backing from OpenAI until now, prompting accusations of impropriety within the AI community. Epoch AI, a nonprofit primarily supported by Open Philanthropy— a research and grant-making foundation— announced on December 20 that OpenAI funded the development of FrontierMath. This benchmarking test features expert-level problems to evaluate an AI's mathematical capabilities and was utilized by OpenAI to demonstrate its upcoming flagship AI, o3. In a post on the forum LessWrong, a contractor for Epoch AI using the username "Meemi" claimed that many contributors to the FrontierMath benchmark were unaware of OpenAI's involvement until it was publicly revealed. "The communication regarding this has been non-transparent, ” Meemi stated. “In my opinion, Epoch AI should have disclosed OpenAI’s funding, and contributors ought to have clear information about the potential implications of their work before deciding to participate in a benchmark. " Some users on social media expressed concerns that the lack of transparency could damage FrontierMath's standing as an impartial benchmark. Alongside funding FrontierMath, OpenAI had access to numerous problems and solutions within the benchmark—a detail Epoch AI did not share before December 20, the day o3 was announced. In response to Meemi's comments, Tamay Besiroglu, the associate director of Epoch AI and one of its co-founders, maintained that the integrity of FrontierMath was unaffected but acknowledged that Epoch AI "erred" in failing to be more forthright. "We were bound by restrictions on disclosing the partnership until around the o3 launch, and in hindsight, we should have insisted on being more transparent with benchmark contributors as soon as feasible, " Besiroglu wrote.

"Our mathematicians deserved to know who might have access to their contributions. Even with contractual limitations on our disclosures, we should have prioritized transparency with our contributors in our agreement with OpenAI. " Besiroglu clarified that, while OpenAI has access to FrontierMath, there is a "verbal agreement" preventing it from using the problem set to train its AI—essentially avoiding "teaching to the test. " Additionally, Epoch AI maintains a "separate holdout set" to ensure independent verification of FrontierMath benchmark results, Besiroglu explained. "OpenAI has …fully supported our choice to retain a separate, unseen holdout set, " he added. However, the situation was complicated when Epoch AI's lead mathematician, Ellot Glazer, noted in a Reddit post that Epoch AI has not yet been able to independently verify OpenAI's FrontierMath results for o3. "In my view, [OpenAI's] score is genuine (i. e. , they did not train on the dataset), and they have no motivation to misrepresent their internal benchmark performances, " Glazer remarked. "However, we cannot provide confirmation until our independent evaluation concludes. "


Watch video about

Epoch AI Scrutinized for Disclosing OpenAI Funding after FrontierMath Release

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Jan. 20, 2026, 1:36 p.m.

The 18 social media trends to shape your 2026 str…

Creating a social media marketing trends report for 2026 revealed the complexity and fragmentation of current trends, which no longer follow linear or predictable patterns.

Jan. 20, 2026, 1:28 p.m.

Bluefish AI Raises $20M in Series A Funding to En…

Bluefish AI, a New York-based marketing technology firm specializing in AI-driven search engine optimization (SEO) tools, has secured $20 million in Series A funding to accelerate growth and enhance its innovative SEO platform.

Jan. 20, 2026, 1:24 p.m.

AI Company Announces Breakthrough in Natural Lang…

LanguageTech AI, a leader in AI-driven language solutions, has announced a major breakthrough in language processing technology.

Jan. 20, 2026, 1:19 p.m.

Olelo Intelligence: $1 Million Angel Round Closed…

Olelo Intelligence, a Honolulu-based startup creating an AI sales coaching platform specifically for high-volume automotive repair shops, has secured $1 million in angel funding to improve its product and expand deployments across North America.

Jan. 20, 2026, 1:17 p.m.

AI Video Conferencing Tools Improve Remote Work C…

The rise of remote work has greatly accelerated the adoption of AI-powered video conferencing platforms.

Jan. 20, 2026, 1:15 p.m.

Perplexity AI Interview Explains How AI Search Wo…

I recently spoke with Jesse Dwyer of Perplexity about SEO and AI search, focusing on what SEOs should prioritize when optimizing for AI search.

Jan. 20, 2026, 9:33 a.m.

Olelo Intelligence Raises $1M Led by Hawaiʻi Ange…

HONOLULU, Jan.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today