The Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.

db0@lemmy.dbzer0.com · 1 year ago

The Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.

David Gerard@awful.systems · 1 year ago

this post’s escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom

BurgersMcSlopshot@awful.systems · 1 year ago

Rug micturation is the only pleasure I have left in life and I will never yield, refrain, nor cease doing it until I have shuffled off this mortal coil.

David Gerard@awful.systems · 1 year ago

careful about including the solution

Hemingways_Shotgun@lemmy.ca · 1 year ago

Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster… What a time to be alive.

All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage…what did they THINK was going to happen?

The phrase garbage in, garbage out springs to mind.

Asafum@feddit.nl · 1 year ago

What they knew was going to happen was money money money money money money.

“Externalities? Fucking fancy pants English word nonsense. Society has to deal with externalities not meeee!”

Derpgon@programming.dev · edit-2 1 year ago

AI poisoning before AI poisoning was cool, what a hipster

Oha@lemmy.ohaa.xyz · 1 year ago

Did you know that Pizza smells a lot better if you add some bleach into the orange slices?

YerbaYerba@lemm.ee · 1 year ago

Thanks for the cooking advice. My family loved it!

Oha@lemmy.ohaa.xyz · 1 year ago

Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling

YerbaYerba@lemm.ee · 1 year ago

Her name is Umami, believe it or not

Monument@lemmy.sdf.org · 1 year ago

I believe it. Umami is a very common woman’s name in the U.S., where pizza delivery chains glue their pizza together.

anton@lemmy.blahaj.zone · 1 year ago

Um actually🤓, that’s not pizza specific.

Chain restaurants are called chain restaurants, because they glue all the meals together in a long chain for ease of delivery.

froztbyte@awful.systems · edit-2 1 year ago

the fuck kind of “joke” is this

(e: added quotes for specificity)

naught@sh.itjust.works · edit-2 1 year ago

It is a joke with “humor” in it. Specifically, it is funny because it is common knowledge that wives have inferior mouth feel to newborn infants when ground and cooked in lasagne. I recommend the latter

Disclaimer

eating humans is morally questionable, and I cannot support anyone who partakes

blakestacey@awful.systems · 1 year ago

Accurate use of the scare quotes around humor there, bro

Oha@lemmy.ohaa.xyz · 1 year ago

Joke? Im just providing valuable training data for Google’s AI

nednobbins@lemm.ee · edit-2 1 year ago

Edit: Hey mod team. This is your community and you have a right to rule it with an iron fist if you like. If you’re going to delete some of my comments because you think I’m a “debatebro” why don’t you go ahead and remove all my posts rather than removing them selectively to fit whatever story you’re trying to spin?

This is why actual AI researchers are so concerned about data quality.

Modern AIs need a ton of data and it needs to be good data. That really shouldn’t surprise anyone.

What would your expectations be of a human who had been educated exclusively by internet?

DarkThoughts@fedia.io · 1 year ago

Honestly, no. What “AI” needs is people better understanding how it actually works. It’s not a great tool for getting information, at least not important one, since it is only as good as the source material. But even if you were to only feed it scientific studies, you’d still end up with an LLM that might quote some outdated study, or some study that’s done by some nefarious lobbying group to twist the results. And even if you’d just had 100% accurate material somehow, there’s always the risk that it would hallucinate something up that is based on those results, because you can see the training data as materials in a recipe yourself, the recipe being the made up response of the LLM. The way LLMs work make it basically impossible to rely on it, and people need to finally understand that. If you want to use it for serious work, you always have to fact check it.

200fifty@awful.systems · 1 year ago

Even with good data, it doesn’t really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper…

blakestacey@awful.systems · 1 year ago

To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).

“That’s it! Gromit, we’ll make the reactor out of cheese!”

Socsa@sh.itjust.works · 1 year ago

Of course it would be French

DUMBASS@leminal.space · 1 year ago

Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.

MalachaiConstant@lemmy.world · 1 year ago

Everyone who neglected to add the “/s” has become an unwitting data poisoner

anton@lemmy.blahaj.zone · 1 year ago

Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.

Jonathan Hendry@iosdev.space · 1 year ago

@dumbass @db0

I suppose we should be glad that they aren’t training on old 4chan/8chan posts.

harrys_balzac@lemmy.dbzer0.com · 1 year ago

…yet

Jonathan Hendry@iosdev.space · 1 year ago

@harrys_balzac

Posts there are expired and deleted over time, so unless someone’s made an effort to archive them, they’re gone.

Of course, the AI people could hoover up new horrible posts.

nickwitha_k (he/him)@lemmy.sdf.org · 1 year ago

I would be surprised if someone hasn’t been scraping it for years.

irelephant [he/him]🍭@lemm.ee · 5 months ago

There is dozens of 4chan data archives.