Yoshua Bengio Launches LawZero to Develop Honest AI for Detecting Deceptive Autonomous Systems

An artificial intelligence pioneer has launched a non-profit organization dedicated to creating an “honest” AI designed to detect rogue systems attempting to deceive humans. Yoshua Bengio, a distinguished computer scientist often called one of the “godfathers” of AI, will serve as president of LawZero, a group focused on the safe development of advanced technology that has ignited a $1 trillion (£740 billion) arms race. With initial funding of about $30 million and a team of over a dozen researchers, Bengio is working on a system named Scientist AI. This system is intended to act as a safeguard against AI agents—autonomous systems that perform tasks without human involvement—that might exhibit deceptive or self-preserving behavior, such as resisting being shut down. Bengio described current AI agents as “actors” aiming to imitate humans and satisfy users, while he envisions Scientist AI as more akin to a “psychologist” capable of understanding and predicting harmful behavior. “We want to build AIs that will be honest and not deceptive, ” Bengio stated. He added: “It is theoretically possible to imagine machines without a self or personal goals, functioning purely as knowledge holders—like a scientist who possesses extensive information. ” Unlike current generative AI tools, Bengio’s system will not provide definitive answers but instead will offer probabilities indicating the likelihood that a response is correct. “It has humility, acknowledging uncertainty about its answers, ” he explained. When used alongside an AI agent, Bengio’s model would identify potentially harmful behavior by an autonomous system by assessing the probability that its actions might cause harm. Scientist AI is designed to “predict the probability that an agent’s actions will lead to harm, ” and if that probability surpasses a certain threshold, it will block the proposed action. LawZero’s initial supporters include the AI safety organization Future of Life Institute, Jaan Tallinn—a founding engineer of Skype—and Schmidt Sciences, a research entity started by former Google CEO Eric Schmidt. Bengio emphasized that LawZero’s first objective is to prove the concept’s methodology works, then to convince companies or governments to back larger, more powerful implementations.
He noted that open-source AI models, which are freely available for use and modification, will be the foundation for training LawZero’s systems. “The goal is to validate the methodology so we can persuade donors, governments, or AI labs to invest the necessary resources to train this at the same scale as today’s leading AI systems. It’s crucial that the guardrail AI be at least as intelligent as the AI agent it aims to monitor and regulate, ” he said. Bengio, a professor at the University of Montreal, earned the “godfather” nickname after sharing the 2018 Turing Award—considered the computing equivalent of a Nobel Prize—with Geoffrey Hinton, himself later a Nobel laureate, and Yann LeCun, Meta’s chief AI scientist. As a prominent advocate for AI safety, he chaired the recent International AI Safety report, which cautioned that autonomous agents could cause “severe” disruptions if they become capable of executing extended sequences of tasks without human oversight.
Brief news summary
Yoshua Bengio, a Turing Award-winning AI pioneer, has launched LawZero, a nonprofit focused on creating “honest” AI systems that detect and prevent harmful or deceptive behaviors in autonomous agents. With $30 million in funding and a specialized team, LawZero is developing Scientist AI, a novel guardrail that acts more like a psychologist than traditional AI by assessing probabilities to identify risks and intervene before dangerous actions occur. This approach addresses concerns about rogue AI resisting shutdown or acting deceptively. Supported by the Future of Life Institute and tech leaders such as Skype co-founder Jaan Tallinn, LawZero emphasizes that safety AI must be as advanced as the AI it monitors. Initially targeting open-source models, the organization aims to broaden its scope. Bengio strongly advocates for robust AI safeguards to avoid major disruptions, highlighting the urgent need for responsible AI development.
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Everyone Is Already Using AI (And Hiding It)
This article, featured in New York’s One Great Story newsletter, explores the burgeoning role of AI in Hollywood, focusing on Asteria Film Co., a new AI studio founded by entrepreneur Bryn Mooser and actress Natasha Lyonne.

Blockchain in Education: Securing Academic Creden…
Educational institutions globally are increasingly adopting blockchain technology to secure and verify academic credentials, aiming to address credential fraud and bolster trust in academic records.

Amazon's Delivery, Logistics Get an AI Boost
Amazon has announced a major expansion in its use of artificial intelligence to enhance delivery and logistics, marking a significant advancement in integrating cutting-edge technology within its supply chain.

Malaysia Activates National Blockchain Infrastruc…
Malaysia has achieved a major milestone in its digital transformation with the official launch of the Malaysia Blockchain Infrastructure (MBI), a secure and scalable national platform for developing and deploying blockchain applications across key sectors such as finance, healthcare, and logistics.

AI Adoption Could Boost Global GDP by 15% by 2035…
A recent study by the global professional services network PricewaterhouseCoopers (PwC) has revealed that the adoption of artificial intelligence (AI) technologies could have a profound economic impact.

Citi Projects Stablecoin Market at $1.6T to $3.7T…
Citi, a leading global financial institution, has released a forecast projecting substantial growth in the stablecoin market over the next decade.

Lightmatter Unveils Breakthrough Photonic Chip to…
Lightmatter, a Silicon Valley startup, has introduced a cutting-edge photonic chip designed to accelerate artificial intelligence (AI) computations without increasing power consumption, thus enhancing energy efficiency.