Blocking AI bots from Microsoft, others has been “pain in the a**”: Reddit CEO | Huffman says companies must pay to scrape Reddit data even though Reddit itself relies on free, user-generated content

ForgottenFlux@lemmy.world · edit-2 2 months ago

Blocking AI bots from Microsoft, others has been “pain in the a**”: Reddit CEO | Huffman says companies must pay to scrape Reddit data even though Reddit itself relies on free, user-generated content

morgunkorn@discuss.tchncs.de · edit-2 2 months ago

Honestly, any platforms hosting user-generated content who use the legal argument that they only provide hosting and aren’t responsible for what their user post shouldn’t also be able to sell the same data and claim owning any of it.

Otherwise, take away their legal immunity. Nazis or pedophiles post something awful? You get in front of the judge.

edit: typo

Justin@lemmy.jlh.name · 2 months ago

Exactly this. You can claim that their scraping is abusing your servers, but the moment you claim copyright for the content of the site, then you give up your Section 230 rights.

fuckwit_mcbumcrumble@lemmy.dbzer0.com · 2 months ago

You’d also probably lose a whole lot more processing power trying to stop the crawlers vs just letting them have API access with some sort of limit to queries.

rbits@lemm.ee · 2 months ago

I don’t think they actually block malicious bots, the change they’ve made is just to the robots.txt, they don’t have to do anything.

tb_@lemmy.world · 2 months ago

Robots.txt does literally nothing. It’s a piece of courtesy that’s easily ignored if you don’t care.