Amazon reportedly investigating Perplexity AI after accusations it scrapes websites without consent - Engadget
Amazon Web Services is currently conducting an investigation to determine if Perplexity AI is in violation of its rules, as reported by Wired. Specifically, the cloud division of the company is looking into allegations that Perplexity AI is utilizing a crawler hosted on its servers that disregards the Robots Exclusion Protocol. This protocol, a web standard, involves developers placing a robots. txt file on a domain to instruct bots on whether they can or cannot access a particular page. While adherence to these instructions is voluntary, reputable companies have generally respected them since their implementation in the 1990s. In a previous article, Wired discovered a virtual machine hosted on an Amazon Web Services server with the IP address 44. 221. 181. 252, which was bypassing the robots. txt instructions on its website. This machine is said to have visited various Condé Nast properties multiple times over the last three months to scrape their content. Other publications such as The Guardian, Forbes, and The New York Times also reported multiple visits from the same machine. Wired conducted an experiment where they inputted headlines or brief descriptions of their articles into Perplexity's chatbot to verify if the company was scraping their content.
The chatbot's responses closely paraphrased the articles "with minimal attribution. " A recent Reuters report suggests that Perplexity is not the only AI company bypassing robots. txt files to gather content for training large language models. However, Wired only provided Amazon with information regarding Perplexity AI's crawler. Amazon Web Services stated, "AWS’s terms of service prohibit abusive and illegal activities, and our customers are responsible for complying with those terms. " They further mentioned that they regularly receive reports of alleged abuse and investigate them accordingly. Perplexity's spokesperson, Sara Platnick, responded to Amazon's inquiries, asserting that their crawlers abide by the Robots Exclusion Protocol and do not violate AWS Terms of Service. Platnick also mentioned that Amazon's scrutiny of Wired's media inquiry follows their standard protocol for investigating potential resource abuse reports. However, Platnick admitted to Wired that PerplexityBot will disregard robots. txt when users include a specific URL in their chatbot inquiry. Aravind Srinivas, the CEO of Perplexity, previously denied claims that his company disregarded the Robots Exclusion Protocol and then lied about it. Srinivas did admit that Perplexity utilizes third-party web crawlers in addition to its own, where the bot identified by Wired is one of them. Update, June 28, 2024, 2:20 PM ET: This post has been updated to include Perplexity's statement to Engadget. Update, June 28, 2024, 8:27 PM ET: This post has been updated to include a statement from Amazon Web Services.
Create a post
based on this news in the Content Maker
![](https://cdn.mos.cms.futurecdn.net/m8DdaRtTMmfHtcS3rs2y4E-1200-80.png)
Which AI chatbot is best at search — I compared C…
I rely on Google and Tom's Guide for all my online searches and deals for electronics
![](https://www.ft.com/__origami/service/image/v2/images/raw/https%3A%2F%2Fcms-image-bucket-production-ap-northeast-1-a7d2.s3.ap-northeast-1.amazonaws.com%2Fimages%2F7%2F9%2F0%2F7%2F47907097-1-eng-GB%2FSK+Hynix.jpg?width=1260&height=630&fit=cover&gravity=faces&source=nar-cms)
SK Group aims to secure $56bn by 2026 to invest i…
SK Group, the second-largest conglomerate in South Korea, has set a target of generating 80 trillion won ($56 billion) by 2026
![](https://images.indianexpress.com/2024/07/meta-ai-whatsapp-main.jpg)
Meta AI is fun, accessible, and free. Maybe it’s …
Last week, I noticed a blue ring icon on WhatsApp, signaling the rollout of Meta's new AI chatbot across various apps
![](https://www.pymnts.com/wp-content/uploads/2024/06/Amazon-Adept.jpg)
Amazon Recruits Execs From Adept for AGI Effort -…
According to a recent report from Bloomberg News, Amazon has reportedly recruited executives from Adept AI for its artificial general intelligence (AGI) project
![](https://static.euronews.com/articles/stories/08/51/22/74/1200x675_cmsv2_eaed935c-a0bc-554a-93d4-afdc3790ba23-8512274.jpg)
Could AI save Europe's rare and endangered langua…
The No Language Left Behind (NLLB) project, led by Meta, aims to make Facebook and Instagram posts more accessible in 200 lesser-spoken languages worldwide
AI images suggest a new era of surrealism - The W…
The rise of technology has brought forth a surge of strange and unreal images that evoke surrealism
![](https://cdn.mos.cms.futurecdn.net/XTSiA3EiD6gwX8XXVB62gg-1200-80.jpg)
SK hynix plans $74.6 billion investment to streng…
Memory supplier SK hynix, with a 35% market share in the DRAM market, has announced plans to invest $74