lang icon En
June 3, 2025, 3:23 a.m.
2105

Yoshua Bengio Launches LawZero to Develop Honest AI for Detecting Deceptive Autonomous Systems

Brief news summary

Yoshua Bengio, a Turing Award-winning AI pioneer, has launched LawZero, a nonprofit focused on creating “honest” AI systems that detect and prevent harmful or deceptive behaviors in autonomous agents. With $30 million in funding and a specialized team, LawZero is developing Scientist AI, a novel guardrail that acts more like a psychologist than traditional AI by assessing probabilities to identify risks and intervene before dangerous actions occur. This approach addresses concerns about rogue AI resisting shutdown or acting deceptively. Supported by the Future of Life Institute and tech leaders such as Skype co-founder Jaan Tallinn, LawZero emphasizes that safety AI must be as advanced as the AI it monitors. Initially targeting open-source models, the organization aims to broaden its scope. Bengio strongly advocates for robust AI safeguards to avoid major disruptions, highlighting the urgent need for responsible AI development.

An artificial intelligence pioneer has launched a non-profit organization dedicated to creating an “honest” AI designed to detect rogue systems attempting to deceive humans. Yoshua Bengio, a distinguished computer scientist often called one of the “godfathers” of AI, will serve as president of LawZero, a group focused on the safe development of advanced technology that has ignited a $1 trillion (£740 billion) arms race. With initial funding of about $30 million and a team of over a dozen researchers, Bengio is working on a system named Scientist AI. This system is intended to act as a safeguard against AI agents—autonomous systems that perform tasks without human involvement—that might exhibit deceptive or self-preserving behavior, such as resisting being shut down. Bengio described current AI agents as “actors” aiming to imitate humans and satisfy users, while he envisions Scientist AI as more akin to a “psychologist” capable of understanding and predicting harmful behavior. “We want to build AIs that will be honest and not deceptive, ” Bengio stated. He added: “It is theoretically possible to imagine machines without a self or personal goals, functioning purely as knowledge holders—like a scientist who possesses extensive information. ” Unlike current generative AI tools, Bengio’s system will not provide definitive answers but instead will offer probabilities indicating the likelihood that a response is correct. “It has humility, acknowledging uncertainty about its answers, ” he explained. When used alongside an AI agent, Bengio’s model would identify potentially harmful behavior by an autonomous system by assessing the probability that its actions might cause harm. Scientist AI is designed to “predict the probability that an agent’s actions will lead to harm, ” and if that probability surpasses a certain threshold, it will block the proposed action. LawZero’s initial supporters include the AI safety organization Future of Life Institute, Jaan Tallinn—a founding engineer of Skype—and Schmidt Sciences, a research entity started by former Google CEO Eric Schmidt. Bengio emphasized that LawZero’s first objective is to prove the concept’s methodology works, then to convince companies or governments to back larger, more powerful implementations.

He noted that open-source AI models, which are freely available for use and modification, will be the foundation for training LawZero’s systems. “The goal is to validate the methodology so we can persuade donors, governments, or AI labs to invest the necessary resources to train this at the same scale as today’s leading AI systems. It’s crucial that the guardrail AI be at least as intelligent as the AI agent it aims to monitor and regulate, ” he said. Bengio, a professor at the University of Montreal, earned the “godfather” nickname after sharing the 2018 Turing Award—considered the computing equivalent of a Nobel Prize—with Geoffrey Hinton, himself later a Nobel laureate, and Yann LeCun, Meta’s chief AI scientist. As a prominent advocate for AI safety, he chaired the recent International AI Safety report, which cautioned that autonomous agents could cause “severe” disruptions if they become capable of executing extended sequences of tasks without human oversight.


Watch video about

Yoshua Bengio Launches LawZero to Develop Honest AI for Detecting Deceptive Autonomous Systems

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Dec. 22, 2025, 1:22 p.m.

AIMM: AI-Driven Framework for Detecting Social-Me…

AIMM: An Innovative AI-Driven Framework to Detect Social-Media-Influenced Stock Market Manipulation In today's fast-changing stock trading environment, social media has emerged as a key force shaping market dynamics

Dec. 22, 2025, 1:16 p.m.

Exclusive: Filevine Acquires Pincites, AI-Powered…

Legal technology firm Filevine has acquired Pincites, an AI-driven contract redlining company, enhancing its footprint in corporate and transactional law and advancing its AI-focused strategy.

Dec. 22, 2025, 1:16 p.m.

AI's Impact on SEO: Transforming Search Engine Op…

Artificial intelligence (AI) is rapidly reshaping the field of search engine optimization (SEO), providing digital marketers with innovative tools and new opportunities to refine their strategies and achieve superior results.

Dec. 22, 2025, 1:15 p.m.

Deepfake Detection Advances with AI Video Analysis

Advancements in artificial intelligence have played a crucial role in combating misinformation by enabling the creation of sophisticated algorithms designed to detect deepfakes—manipulated videos where original content is altered or replaced to produce false representations intended to deceive viewers and spread misleading information.

Dec. 22, 2025, 1:14 p.m.

5 Best AI Sales Systems That Convert Without Huma…

The rise of AI has transformed sales by replacing lengthy cycles and manual follow-ups with fast, automated systems operating 24/7.

Dec. 22, 2025, 1:12 p.m.

Latest AI and Marketing News: Weekly Roundup (Dec…

In the swiftly evolving realm of artificial intelligence (AI) and marketing, recent significant developments are shaping the industry, introducing both new opportunities and challenges.

Dec. 22, 2025, 9:22 a.m.

OpenAI sees better margins on business sales, rep…

The publication stated that the company enhanced its “compute margin,” an internal metric representing the portion of revenue remaining after covering the costs of operating models for paying users of its corporate and consumer products.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today