Perplexity AI: the answer engine with a lot of question marks

June 28, 2024

In the coming weeks, Reddit will start blocking most automated bots from accessing its public data. You’ll need to make a licensing deal, like Google and OpenAI have done, to use Reddit content for model training and other commercial purposes.

While this has technically been Reddit’s policy already, the company is now enforcing it by updating its robots.txt file, a core part of the web that dictates how web crawlers are allowed to access a site. “It’s a signal to those who don’t have an agreement with us that they shouldn’t be accessing Reddit data,” the company’s chief legal officer, Ben Lee, tells me. “It’s also a signal to bad actors that the word ‘allow’ in robots.txt doesn’t mean, and has never meant, that they can use the data however they want.”

Perplexity AI: the answer engine with a lot of question marks

Recent Articles

The Shadow Side of AutoML: When No-Code Tools Hurt More Than Help

High street hacks, and Disney’s Wingdings woe • Graham Cluley

Class Activation Maps (CAM). How Your Neural Net Sees Cats & Dogs! | by Prateek Karkare | May, 2025

The Rings of Power’s Cast Teases What’s in Store for Gandalf and Sauron in Season 3

NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)

Related Stories

Leave A Reply Cancel reply