renzev@lemmy.world to Lemmy Shitpost@lemmy.worldEnglish · 13 days agoThe internet kind of sucks right nowlemmy.worldimagemessage-square144linkfedilinkarrow-up1954arrow-down115
arrow-up1939arrow-down1imageThe internet kind of sucks right nowlemmy.worldrenzev@lemmy.world to Lemmy Shitpost@lemmy.worldEnglish · 13 days agomessage-square144linkfedilink
minus-squareRooty@lemmy.worldlinkfedilinkarrow-up27arrow-down3·edit-212 days agoIDGAF about LLM bots scraping public forums, they are public and available to anyone. I do mind them scraping shadow libraries, and training on copywritten material, which they should not do
minus-squareTja@programming.devlinkfedilinkarrow-up14·12 days agoPublic and copyrighted are not mutually exclusive.
minus-squareacosmichippo@lemmy.worldlinkfedilinkEnglisharrow-up2·edit-211 days agoalso “public for actual people who support my forum business model” is not the same as “public for AI scrapers who detract from my business model.”
minus-squareWawe@lemmy.worldlinkfedilinkarrow-up11·12 days agoLLM bots are scraping so much that increases costs of maintaing forums and sometimes even ddosin them for example Codeberg.
IDGAF about LLM bots scraping public forums, they are public and available to anyone. I do mind them scraping shadow libraries, and training on copywritten material, which they should not do
Public and copyrighted are not mutually exclusive.
also “public for actual people who support my forum business model” is not the same as “public for AI scrapers who detract from my business model.”
Public is public, tho.
LLM bots are scraping so much that increases costs of maintaing forums and sometimes even ddosin them for example Codeberg.