lang icon En
Oct. 18, 2023, 3:49 p.m.
865

None

Brief news summary

None

Tech is not a reliable ally; we, on the other hand, are here to assist you. Join The Tech Friend newsletter for valuable insights. Recently, a lawsuit was filed in a New York federal court against tech companies for utilizing scraped web content to train their AI models. This practice has enabled the development of groundbreaking chatbots like ChatGPT by companies such as OpenAI and Google, sparking a competitive race to sell AI tools. The plaintiffs in the lawsuit, which includes well-known figures like Huckabee, Tsh Oxenreider, and Lysa TerKeurst, argue that while using books as part of the data set is not inherently problematic, employing pirated or stolen books does not adequately compensate authors and publishers for their creative efforts. The lawsuit targets Meta, Microsoft, and financial data provider Bloomberg L. P. , all of whom have trained their own "large language models" using web data. Specifically, the lawsuit focuses on the inclusion of an infamous collection of pirated books known as "books3" in a freely accessible compilation of data sources known as "the pile, " created by nonprofit organization EleutherAI to provide smaller companies with broader access to data for AI training. EleutherAI is also named as a defendant in the lawsuit.

As a proposed class-action suit, it aims to secure damages and an injunction against the continued use of the plaintiffs' works by the companies involved. Microsoft declined to comment, and representatives from Meta, Bloomberg, and EleutherAI did not respond to requests for comment. Large language models are typically trained on billions of sentences sourced from the internet, including news articles, Wikipedia, and social media comments. Though OpenAI, Google, and Microsoft do not publicly disclose the specifics of their data sources, critics of AI have long suspected that collections of pirated books are included. The debate over whether tech companies can freely obtain data from the internet, without payment or permission, to train their potentially profitable AI models is intensifying. Numerous lawsuits initiated by comedians, writers, and artists have targeted these tech giants. While tech executives argue that taking data from the public web falls under the concept of "free use" in copyright law, which allows exemptions for works substantially different from their source material, the dispute continues.


Watch video about

None

Try our premium solution and start getting clients — at no cost to you

Content creator image

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

March 11, 2026, 2:31 p.m.

Nvidia Developing 'NemoClaw' AI Agent to Compete …

Nvidia is developing a new AI agent called NemoClaw, designed to compete with existing platforms like OpenClaw and other similar AI tools.

March 11, 2026, 2:24 p.m.

Social media algorithm: 2025 guide for all major …

There are no quick shortcuts to instantly boost your content on social media algorithms, but legitimate strategies exist to maximize organic reach while adhering to community guidelines.

March 11, 2026, 2:18 p.m.

OpenAI Develops AI Jobs Platform to Compete with …

OpenAI is making notable progress in transforming the employment landscape through two major initiatives that leverage artificial intelligence to connect job seekers with employers while enhancing AI skills within the workforce.

March 11, 2026, 2:16 p.m.

The New SEO: From Rankings To Recommendations In …

The rapidly evolving field of artificial intelligence is transforming search technologies, prompting businesses to rethink content strategies.

March 11, 2026, 2:15 p.m.

Microsoft Touts AI Sales at Town Hall, Reveals Ba…

Microsoft Corporation recently highlighted major progress in the adoption of its artificial intelligence (AI) tools among corporate clients during a companywide town hall meeting.

March 11, 2026, 2:15 p.m.

Recall.ai: Building the infrastructure behind AI …

Imagine onboarding a new employee solely through written materials—emails, documents—without any conversation.

March 11, 2026, 10:24 a.m.

How SMM Panels are Changing Social Media Marketin…

Digital Marketing How SMM Panels Are Transforming Social Media Marketing and Growth in 2026 By Simran Mishra | Reviewed by Manisha Sharma Overview: SMM panels enhance early engagement on social media, boosting post visibility and enabling content to reach larger audiences faster

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

AI Company welcome image

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today