Five@slrpnk.net to Reddit@lemmy.worldEnglish · 6 months agoReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgexternal-linkmessage-square42fedilinkarrow-up1294arrow-down18cross-posted to: reddit@lemmy.worldbyereddit@lemmy.worldtechnology@lemmy.world
arrow-up1286arrow-down1external-linkReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgFive@slrpnk.net to Reddit@lemmy.worldEnglish · 6 months agomessage-square42fedilinkcross-posted to: reddit@lemmy.worldbyereddit@lemmy.worldtechnology@lemmy.world
minus-squareabbadon420@lemm.eelinkfedilinkarrow-up4·6 months agoWhere can I find those archive dumps? The usual (unmentionable) torrent sites or is there a specific place for archive dumps?
minus-squareFaceDeer@fedia.iolinkfedilinkarrow-up4arrow-down2·edit-26 months agoThe place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.
Where can I find those archive dumps? The usual (unmentionable) torrent sites or is there a specific place for archive dumps?
The place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.