lang icon English
Auto-Filling SEO Website as a Gift

Launch Your AI-Powered Business and get clients!

No advertising investment needed—just results. AI finds, negotiates, and closes deals automatically

Oct. 23, 2024, 7 a.m.
161

Google DeepMind's SynthID: Open-Source AI Text Identification Tool

Google DeepMind has created an open-source tool designed to identify AI-generated text, known as SynthID. SynthID is part of a broader range of watermarking tools aimed at generative AI outputs. Following the introduction of a watermark for images last year, the company has subsequently released one for AI-generated video. In May, Google revealed that SynthID is being integrated into its Gemini app and online chatbots and made it accessible for free on Hugging Face, a well-known AI datasets and models repository. Watermarks are becoming crucial for helping users recognize AI-generated content, which is essential for combating issues like misinformation. Pushmeet Kohli, vice president of research at Google DeepMind, states, “Now, other [generative] AI developers can leverage this technology to discern if text outputs originate from their own [large language models], thus facilitating responsible AI development across the board. ” SynthID embeds an invisible watermark directly into the text during the generation process by an AI model. Large language models function by decomposing language into “tokens” and predicting the most probable token to follow. These tokens may include single characters, words, or portions of phrases, each assigned a probability score reflecting its likelihood of being the next word in a sentence. Greater probabilities suggest increased likelihood of selection by the model. Kohli explains that SynthID introduces extra information at the generation stage by adjusting the probability of token generation. To discern the watermark, SynthID examines the expected probability scores of words in both watermarked and unwatermarked texts. According to Google DeepMind, employing SynthID did not compromise the quality, accuracy, creativity, or speed of the generated text. This conclusion stemmed from an extensive live experiment assessing SynthID's performance post-watermark deployment within Gemini products, which millions of users utilized.

Gemini enables users to rate the AI model's responses using thumbs-up or thumbs-down indicators. Kohli and his team evaluated data from approximately 20 million responses from both watermarked and unwatermarked chatbots, discovering no perceived differences in quality or usefulness. Findings from this experiment are detailed in a paper released in Nature today. Currently, SynthID for text is exclusive to Google’s models, but the intention behind open-sourcing is to broaden its compatibility with more tools. Despite its advantages, SynthID has limitations. The watermark can withstand certain tampering methods, such as light editing or cropping, but is less effective when AI-generated text is rewritten or translated across languages. It also faces challenges when responding to factual prompts, like identifying the capital of France, due to limited opportunities for adjusting the likelihood of forthcoming words without altering factual information. João Gante, a machine-learning engineer at Hugging Face, highlights another advantage of open-sourcing the tool: it allows anyone to access and integrate watermarking into their model freely. Gante believes this will enhance the watermark's privacy since only the owner will hold its cryptographic secrets. “With enhanced accessibility and validation of its functionalities, I hope watermarking will become standard practice, aiding in the detection of malicious language model usage, ” says Gante. However, Irene Solaiman, Hugging Face’s head of global policy, cautions that watermarks are not a comprehensive solution. “Watermarking represents just one aspect of safer models within an ecosystem needing a diversity of complementary safeguards. Similarly, fact-checking for human-generated content can have varying levels of effectiveness, ” she explains.



Brief news summary

Google DeepMind has launched SynthID, an open-source tool designed to identify AI-generated text, as part of a broader suite of watermarking solutions for generative AI, which includes tools for images and videos. SynthID is integrated with Google’s Gemini application and is available on Hugging Face, providing a means to distinguish AI-generated content from human-written text, thereby aiding in the battle against misinformation. The tool utilizes an invisible watermarking method that subtly adjusts token probabilities during text generation, maintaining the quality and creativity of the produced text. Research indicates that users often struggle to differentiate between text with and without a watermark. However, SynthID's effectiveness may be reduced if the generated text is modified or translated afterward. By making SynthID open-source, developers can adopt its techniques in their own AI models, promoting responsible AI practices. Experts emphasize that while watermarking enhances the safety of content, it should be used alongside fact-checking and other verification techniques to ensure the accuracy and reliability of AI-generated material.
Business on autopilot

AI-powered Lead Generation in Social Media
and Search Engines

Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Learn how AI can help your business.
Let’s talk!

May 25, 2025, 11:59 p.m.

Hong Kong Taps Blockchain: Europe’s Biggest Bank …

HSBC has launched Hong Kong’s first settlement service utilizing blockchain technology, converting regular bank deposits into digital tokens.

May 25, 2025, 11:19 p.m.

Google's 'AI Mode' Could Be Bad for Reddit

Last week, Google announced the launch of a new AI-powered search feature called AI Mode.

May 25, 2025, 10:19 p.m.

Blockchain Trilemma Answered! The Ongoing Quest f…

As of May 2025, the blockchain trilemma remains a fundamental challenge in the cryptocurrency and blockchain sector.

May 25, 2025, 9:40 p.m.

Google’s ‘world-model’ bet: building the AI opera…

At Google’s I/O 2025 event in Silicon Valley, it became evident that Google is intensifying its AI initiatives under the Gemini brand, which includes a variety of model architectures and research, rapidly deploying innovations into products.

May 25, 2025, 8:42 p.m.

Blockchain security firm releases Cetus hack post…

Blockchain security firm Dedaub published a post-mortem report on the hack of the Cetus decentralized exchange, pinpointing the root cause as an exploit in the liquidity parameters of the Cetus automated market maker (AMM) that bypassed a code "overflow" check.

May 25, 2025, 7:29 p.m.

Meta chief AI scientist Yann LeCun says current A…

What do all intelligent beings share? According to Yann LeCun, Meta's chief AI scientist, there are four key traits.

May 25, 2025, 7:18 p.m.

Major TradFi Institutions to Pursue Tokenization …

Tokenization stands as a key application of blockchain technology, drawing significant interest and investment from the traditional finance (TradFi) sector.

All news