Tuesday, January 23, 2018

[csmdfofy] Sample only 1% of users

If you are collecting personal information to improve your service or software, uniformly sample only 1%.  Your statistics will remain valid, but the collected Big Data is unlikely to be misusable in a targeted attack against a particular person: any particular person has a 99% chance of having escaped your mass surveillance.

Services and software providers can advertise the percentage that they sample and compete with each other for the lowest percentage.  They can get certifications from trusted third parties that they are only collecting 1%.

