From MPI-SWS, Germany, paper and slides
1. Problem
To protect user privacy in distributed systems from leaking by statistical queries.
2. Challenges
The most direct solutions are
to anonymize + add noise to user data.
[-] utility, de-anonymize
differential privacy. add noise to answer of queries.
[-] scale, churn tolerance, malicious client
3. Solution
PDDP: Practical Distributed Differential Privacy.
- Binary answer in bucket. The query result should not be distorted by the client arbitrarily.
- Blind noise addition. The Malicious should not be trusted. Private data are controlled by its user only.
3.1 Assumption
Clients and analysts are potentially malicious. Proxy is HbC (honest but curious) and should not have access to noise-free result.
3.2 Work Flow
Query Initialization(Analyst -> Proxy)
Query Forwarding (Proxy -> Client)
Client Respond (Client -> Proxy)
answers are encrypted with the analyst’s public key.
Differential Private Noise Addition.
collaborative coin generation with a GM cryptosystem. Unbiased proxy flip encrypted coins from clients randomly and thus transform them into unbiased ones. Coins serve as DP noises.
Noisy Answers to Analyst ( Proxy -> Analyst)
3.3 Implementation and Deployment
600+ Client = Firefox add-on + SQLite
Proxy = Tomcat web service + MySQL
Analyst = Java program
4. Conclusion
The authors achieve scalable, churn-tolerant user privacy against malicious analyst and clients by
- making a trade off between utility and privacy. (differential privacy)
- introduce distributed system to traditionally centralized differential privacy environment. (distributed)