Sex offender banned from using AI tools in landmark UK case

girlfreddy@lemmy.ca · 7 months ago

Sex offender banned from using AI tools in landmark UK case

shneancy@lemmy.world · 7 months ago

meta data is trivially easy to strip off a picture, you don’t even need to bother using tools for it - just take a screenshot and delete the original

Allero@lemmy.today · edit-2 7 months ago

Can be baked in pixels, or even better sent to identification for a system similar to what Apple uses to detect CSAM, but as an “alright” ID (but just in police’s hands, not on device or something).

Bob Robertson IX @discuss.tchncs.de · 7 months ago

But even then, if every pixel gets marked as ‘created by AI’, it would still be trivial to take real CSAM and run it through an image-to-image generator with denoising turned down to 0.05 and suddenly you have real CSAM that has been marked as ‘legal’ since it is technically AI generated.

Also, keep in mind that there are several open source projects out there where anyone who knows what they are doing could just strip out any protections that might be put in place.

Allero@lemmy.today · 7 months ago

Apple-like ID system solves the latter by technical means.

As per image-to-image, feeding the model with recognised CSAM should be unavailable to begin with.

hikaru755@feddit.de · 7 months ago

Yeah but the point is you can’t easily add it to any picture you want (if it’s implemented well), thus providing a way to prove that the pictures were created using AI and no harm has been done to children in their creation. It would be a valid solution to the “easy to hide actual CSAM between AI generated pictures” problem.

Syn_Attck@lemmy.today · edit-2 7 months ago

you can’t easily add it to any picture you want (if it’s implemented well

Edit for the downvoters: StackExchange - How do I add exif data to an image?

Going to need you to elaborate on this. EXIF data is just bytes in a file, like any of the other bytes in the file. It can be changed and is often changed without the users consent. Are you proposing we create a new type of hardware, something akin to Secure Enclave, and then mass-produce and add it to every consumer CPU to ensure some specific types of exif data isn’t tampered with?

jkrtn@lemmy.ml · 7 months ago

I disagree that it should be allowed, but I think their proposal would be something like attaching an identifier to the model, the random seed, the “temperature,” and any other relevant parameters that allow exact reproduction of the image without having access to anything but the model. Then you can prove it came from the model.

Here’s a thought experiment, though, what would prevent someone from taking a real image and a model, then working with them until they can reproduce a very close approximation of the real image from text and parameter input? These models aren’t like a hash function, they can be viewed in reverse to some extent. Backpropagation is how they are trained.

hikaru755@feddit.de · 7 months ago

I was thinking of an approach based on cryptographic signatures. If all images that come from a certain AI model are signed with a digital certificate, you can tamper with metadata all you want, you’re not gonna be able to produce the correct signature to add to an image unless you have access to the certificate’s private key. This technology has been around for ages and is used in every web browser and would be pretty simple to implement.

The only weak point with this approach would be that it relies on the private key not being publicly accessible, which makes this a lot harder or maybe even impossible to implement for open source models that anyone can run on their own hardware. But then again, at least for what we’re talking about here, the goal wouldn’t need to be a system covering every model, just one that makes at least a couple models safe to use for this specific purpose.

I guess the more practical question is whether this would be helpful for any other use case. Because if not, I hardly doubt it’s gonna be implemented. Nobody is gonna want the PR nightmare of building a feature with no other purpose than to help pedophiles generate stuff to get off to “safely”, no matter how well intentioned