Andy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square200fedilinkarrow-up11.09Karrow-down115cross-posted to: technology@beehaw.org
arrow-up11.08Karrow-down1external-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agomessage-square200fedilinkcross-posted to: technology@beehaw.org
minus-squareUltraviolet@lemmy.worldlinkfedilinkEnglisharrow-up6·edit-29 months agoBetter yet, point the crawler to a massive text file of almost but not quite grammatically correct garbage to poison the model. Something it will recognize as language and internalize, but severely degrade the quality of its output.
minus-squareodelik@lemmy.todaylinkfedilinkEnglisharrow-up3·9 months agoMaybe one of the lorem ipsum generators could help.
Better yet, point the crawler to a massive text file of almost but not quite grammatically correct garbage to poison the model. Something it will recognize as language and internalize, but severely degrade the quality of its output.
Maybe one of the lorem ipsum generators could help.