renzev@lemmy.world to Lemmy Shitpost@lemmy.worldEnglish · 2 months agoThe internet kind of sucks right nowlemmy.worldimagemessage-square146linkfedilinkarrow-up1957arrow-down115
arrow-up1942arrow-down1imageThe internet kind of sucks right nowlemmy.worldrenzev@lemmy.world to Lemmy Shitpost@lemmy.worldEnglish · 2 months agomessage-square146linkfedilink
minus-squareRooty@lemmy.worldlinkfedilinkarrow-up27arrow-down3·edit-22 months agoIDGAF about LLM bots scraping public forums, they are public and available to anyone. I do mind them scraping shadow libraries, and training on copywritten material, which they should not do
minus-squareTja@programming.devlinkfedilinkarrow-up14·2 months agoPublic and copyrighted are not mutually exclusive.
minus-squareacosmichippo@lemmy.worldlinkfedilinkEnglisharrow-up2·edit-22 months agoalso “public for actual people who support my forum business model” is not the same as “public for AI scrapers who detract from my business model.”
minus-squareWawe@lemmy.worldlinkfedilinkarrow-up11·2 months agoLLM bots are scraping so much that increases costs of maintaing forums and sometimes even ddosin them for example Codeberg.
IDGAF about LLM bots scraping public forums, they are public and available to anyone. I do mind them scraping shadow libraries, and training on copywritten material, which they should not do
Public and copyrighted are not mutually exclusive.
also “public for actual people who support my forum business model” is not the same as “public for AI scrapers who detract from my business model.”
Public is public, tho.
LLM bots are scraping so much that increases costs of maintaing forums and sometimes even ddosin them for example Codeberg.