ImageNet contains naturally occurring Apple NeuralHash collisions
endisneigh 2021-08-19 17:06:02 +0000 UTC [ - ]
PhotoDNA has existed for over a decade years doing the same thing with no instances that I have heard of.
If some corrupt government wants to get you they don’t need this. They can just unilaterally say you’ve done something bad without evidence and imprisonment you. It happens all the time. It’s even happened in America. Just look up DNA exonerations - people have had DNA on the scene that literally proves their innocence and they’re still locked up.
bawolff 2021-08-19 17:08:46 +0000 UTC [ - ]
minitoar 2021-08-19 17:08:04 +0000 UTC [ - ]
endisneigh 2021-08-19 17:09:00 +0000 UTC [ - ]
alfalfasprout 2021-08-19 16:59:21 +0000 UTC [ - ]
criticaltinker 2021-08-19 17:03:32 +0000 UTC [ - ]
> Perhaps the most concerning part of the whole scheme is the database itself. Since the original images are (understandably) not available for inspection, it's not obvious how we can trust that a rogue actor (like a foreign government) couldn't add non-CSAM hashes to the list to root out human rights advocates or political rivals. Apple has tried to mitigate this by requiring two countries to agree to add a file to the list, but the process for this seems opaque and ripe for abuse.
cat199 2021-08-19 17:05:51 +0000 UTC [ - ]
version_five 2021-08-19 17:08:37 +0000 UTC [ - ]
matsemann 2021-08-19 17:03:37 +0000 UTC [ - ]
Someone got more details on that? How does the birthday paradox come into play here?
JimBlackwood 2021-08-19 17:04:16 +0000 UTC [ - ]
criticaltinker 2021-08-19 17:00:41 +0000 UTC [ - ]
> This is a false-positive rate of 2 in 2 trillion image pairs (1,431,168^2). Assuming the NCMEC database has more than 20,000 images, this represents a slightly higher rate than Apple had previously reported. But, assuming there are less than a million images in the dataset, it's probably in the right ballpark.
It's great to see the ingenuity and attention this whole debacle is receiving from the community. Maybe it will lead to advances in perceptual hashing (and also advances in consumer awareness of tech related privacy issues).
bawolff 2021-08-19 17:06:49 +0000 UTC [ - ]