lang icon En
Jan. 19, 2025, 4:20 p.m.
2238

Epoch AI Scrutinized for Disclosing OpenAI Funding after FrontierMath Release

Brief news summary

Epoch AI, a nonprofit dedicated to setting math benchmarks for AI, faced backlash after revealing OpenAI funding on December 20. This financial support is intended for creating FrontierMath, a tool designed to assess AI mathematical skills, particularly in relation to the upcoming o3 model. Concerns arose regarding potential bias and transparency, as many contributors were reportedly unaware of this backing. Critics, including the contractor "Meemi" from LessWrong, voiced skepticism about the benchmarks' impartiality, citing OpenAI’s prior access to testing materials. In defense of the initiative, Tamay Besiroglu, Epoch AI's associate director, acknowledged the transparency issues but staunchly supported FrontierMath’s integrity. He explained that legal constraints had affected timely disclosures and emphasized improved communication with contributors. Besiroglu also pointed out an informal agreement that bars OpenAI from utilizing benchmark data for training purposes. Ellot Glazer, Epoch AI's chief mathematician, accepted that FrontierMath's results had not been independently validated by OpenAI but expressed optimism about their reliability.

A nonprofit organization working on math benchmarks for AI has recently come under scrutiny for not disclosing its financial backing from OpenAI until now, prompting accusations of impropriety within the AI community. Epoch AI, a nonprofit primarily supported by Open Philanthropy— a research and grant-making foundation— announced on December 20 that OpenAI funded the development of FrontierMath. This benchmarking test features expert-level problems to evaluate an AI's mathematical capabilities and was utilized by OpenAI to demonstrate its upcoming flagship AI, o3. In a post on the forum LessWrong, a contractor for Epoch AI using the username "Meemi" claimed that many contributors to the FrontierMath benchmark were unaware of OpenAI's involvement until it was publicly revealed. "The communication regarding this has been non-transparent, ” Meemi stated. “In my opinion, Epoch AI should have disclosed OpenAI’s funding, and contributors ought to have clear information about the potential implications of their work before deciding to participate in a benchmark. " Some users on social media expressed concerns that the lack of transparency could damage FrontierMath's standing as an impartial benchmark. Alongside funding FrontierMath, OpenAI had access to numerous problems and solutions within the benchmark—a detail Epoch AI did not share before December 20, the day o3 was announced. In response to Meemi's comments, Tamay Besiroglu, the associate director of Epoch AI and one of its co-founders, maintained that the integrity of FrontierMath was unaffected but acknowledged that Epoch AI "erred" in failing to be more forthright. "We were bound by restrictions on disclosing the partnership until around the o3 launch, and in hindsight, we should have insisted on being more transparent with benchmark contributors as soon as feasible, " Besiroglu wrote.

"Our mathematicians deserved to know who might have access to their contributions. Even with contractual limitations on our disclosures, we should have prioritized transparency with our contributors in our agreement with OpenAI. " Besiroglu clarified that, while OpenAI has access to FrontierMath, there is a "verbal agreement" preventing it from using the problem set to train its AI—essentially avoiding "teaching to the test. " Additionally, Epoch AI maintains a "separate holdout set" to ensure independent verification of FrontierMath benchmark results, Besiroglu explained. "OpenAI has …fully supported our choice to retain a separate, unseen holdout set, " he added. However, the situation was complicated when Epoch AI's lead mathematician, Ellot Glazer, noted in a Reddit post that Epoch AI has not yet been able to independently verify OpenAI's FrontierMath results for o3. "In my view, [OpenAI's] score is genuine (i. e. , they did not train on the dataset), and they have no motivation to misrepresent their internal benchmark performances, " Glazer remarked. "However, we cannot provide confirmation until our independent evaluation concludes. "


Watch video about

Epoch AI Scrutinized for Disclosing OpenAI Funding after FrontierMath Release

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Dec. 12, 2025, 1:42 p.m.

Disney Sends Cease-and-Desist to Google Over AI C…

The Walt Disney Company has initiated a significant legal action against Google by issuing a cease-and-desist letter, accusing the tech giant of infringing on Disney’s copyrighted content during the training and development of generative artificial intelligence (AI) models without providing compensation.

Dec. 12, 2025, 1:35 p.m.

AI and the Future of Search Engine Optimization

As artificial intelligence (AI) advances and increasingly integrates into digital marketing, its influence on search engine optimization (SEO) is becoming significant.

Dec. 12, 2025, 1:33 p.m.

Artificial Intelligence: MiniMax and Zhipu AI Pla…

MiniMax and Zhipu AI, two leading artificial intelligence companies, are reportedly preparing to go public on the Hong Kong Stock Exchange as early as January next year.

Dec. 12, 2025, 1:31 p.m.

OpenAI Appoints Slack CEO Denise Dresser as Chief…

Denise Dresser, CEO of Slack, is set to leave her position to become Chief Revenue Officer at OpenAI, the company behind ChatGPT.

Dec. 12, 2025, 1:30 p.m.

AI Video Synthesis Techniques Improve Film Produc…

The film industry is experiencing a major transformation as studios increasingly incorporate artificial intelligence (AI) video synthesis techniques to improve post-production workflows.

Dec. 12, 2025, 1:24 p.m.

19 best social media AI tools to transform your s…

AI is revolutionizing social media marketing by offering tools that simplify and enhance audience engagement.

Dec. 12, 2025, 9:42 a.m.

AI Influencers on Social Media: Opportunities and…

The emergence of AI-generated influencers on social media signifies a major shift in the digital environment, sparking widespread debates about the authenticity of online interactions and the ethical concerns tied to these virtual personas.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today