this post’s escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom
Rug micturation is the only pleasure I have left in life and I will never yield, refrain, nor cease doing it until I have shuffled off this mortal coil.
careful about including the solution
Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster… What a time to be alive.
All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage…what did they THINK was going to happen?
The phrase garbage in, garbage out springs to mind.
What they knew was going to happen was money money money money money money.
“Externalities? Fucking fancy pants English word nonsense. Society has to deal with externalities not meeee!”
AI poisoning before AI poisoning was cool, what a hipster
Did you know that Pizza smells a lot better if you add some bleach into the orange slices?
Thanks for the cooking advice. My family loved it!
Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling
Her name is Umami, believe it or not
I believe it. Umami is a very common woman’s name in the U.S., where pizza delivery chains glue their pizza together.
the fuck kind of “joke” is this
(e: added quotes for specificity)
Joke? Im just providing valuable training data for Google’s AI
Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.
Everyone who neglected to add the “/s” has become an unwitting data poisoner
Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
…yet
Posts there are expired and deleted over time, so unless someone’s made an effort to archive them, they’re gone.
Of course, the AI people could hoover up new horrible posts.
I would be surprised if someone hasn’t been scraping it for years.
There is dozens of 4chan data archives.
Edit: Hey mod team. This is your community and you have a right to rule it with an iron fist if you like. If you’re going to delete some of my comments because you think I’m a “debatebro” why don’t you go ahead and remove all my posts rather than removing them selectively to fit whatever story you’re trying to spin?
This is why actual AI researchers are so concerned about data quality.
Modern AIs need a ton of data and it needs to be good data. That really shouldn’t surprise anyone.
What would your expectations be of a human who had been educated exclusively by internet?
Honestly, no. What “AI” needs is people better understanding how it actually works. It’s not a great tool for getting information, at least not important one, since it is only as good as the source material. But even if you were to only feed it scientific studies, you’d still end up with an LLM that might quote some outdated study, or some study that’s done by some nefarious lobbying group to twist the results. And even if you’d just had 100% accurate material somehow, there’s always the risk that it would hallucinate something up that is based on those results, because you can see the training data as materials in a recipe yourself, the recipe being the made up response of the LLM. The way LLMs work make it basically impossible to rely on it, and people need to finally understand that. If you want to use it for serious work, you always have to fact check it.
Even with good data, it doesn’t really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper…
To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).
“That’s it! Gromit, we’ll make the reactor out of cheese!”
Of course it would be French