Rdp for subsampled gaussian mechanism under the sampling without replacement scheme #67

yuxiangw · 2019-08-14T23:59:24Z

Function compute_rdp_sample_without_replacement implements Theorem 9 of Wang, Balle, Kasiviswanathan. "Subsampled Renyi Differential Privacy and Analytical Moments Accountant." AISTATS'2019.
Added a test script.
Added a reference to why compute_rdp is correctly calculating the RDP for subsampled Gaussian mechanism under the Poisson sampling scheme (independently include a data point with probability q). At least for integer order...

…eplacement. Also, adding references for the existing Poisson sampling scheme calculations.

…dp_for_subsampled_gaussian automerged

googlebot · 2019-08-14T23:59:30Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it!) and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

yuxiangw · 2019-08-15T01:30:16Z

@googlebot I signed it!

…

On Aug 14, 2019, at 4:59 PM, googlebot ***@***.***> wrote: @googlebot I signed it!

googlebot · 2019-08-15T01:30:20Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

googlebot · 2019-08-15T01:30:21Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

kairouzp · 2019-08-20T15:46:18Z

privacy/analysis/rdp_accountant.py

+  Accountant." AISTATS'2019.
+
+  A strengthened version -- Theorem 27 -- applies subsampled-Gaussian mechanism. An implementation
+  is available at https://github.com/yuxiangw/autodp


Would you be able to also include an implementation of the strengthened version? Initially, I thought we would only need the result in Theorem 27 because TensorFlow Privacy only uses the Gaussian Mechanism. But now that you have an implementation of Theorem 9, it'd be useful to keep ti for the future (when we start including other mechanisms).

The implemented version is for the Gaussian mechanism. Although it is easy to modify it and make it more modular to cover other mechanisms. The strengthened version (Theorem 27) is somewhat tricky to implement... and the improvement over Theorem 9 is only in the third term of the expansion onwards. I think it makes sense to first merge this PR and then start another branch to investigate how to add Theorem 27 in the most light-weight manner.

kairouzp · 2019-08-20T16:07:22Z

privacy/analysis/rdp_accountant.py

+
+  for i in range(2, alpha+1):
+    if i == 2:
+      log_coef_i = math.log(special.binom(alpha, i)) + i * math.log(q)


Do you think the use of numpy's log1p would lead to more stable calculations? Any other tricks to stabilize the calculations?

I was following the implementation from compute_rdp. For small alpha, this should OK. For large alpha, maybe doing everything in the log-space might be better. It really depends on how special.binom is implemented in scipy.

kairouzp · 2019-08-20T16:10:19Z

privacy/analysis/test_rdp_calculations.py

+
+#  A simple test script to demonstrate the calculated RDP
+
+q= 0.01


nit: q = 0.01

kairouzp · 2019-08-20T16:26:26Z

privacy/analysis/test_rdp_calculations.py

+results2 = compute_rdp_sample_without_replacement(q, noise_multiplier, steps, orders)
+
+
+import matplotlib.pyplot as plt


I believe pylint will complain about this. Perhaps best to do it at the beginning.

npapernot · 2019-12-18T11:37:06Z

@kairouzp what is the status on this?

ChrisWaites · 2020-06-06T07:26:48Z

+1 🤔

yuxiangw added 2 commits August 14, 2019 16:44

adding RDP for subsampled Gaussian mechanism under sampling without r…

2f6de67

…eplacement. Also, adding references for the existing Poisson sampling scheme calculations.

Merge branch 'master' of https://github.com/tensorflow/privacy into r…

45162ab

…dp_for_subsampled_gaussian automerged

googlebot added the cla: no cla: no label Aug 14, 2019

googlebot added cla: yes cla: yes and removed cla: no cla: no labels Aug 15, 2019

kairouzp reviewed Aug 20, 2019

View reviewed changes

fix spacings and other things pylint will complain

f506c0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rdp for subsampled gaussian mechanism under the sampling without replacement scheme #67

Rdp for subsampled gaussian mechanism under the sampling without replacement scheme #67

Uh oh!

yuxiangw commented Aug 14, 2019

Uh oh!

googlebot commented Aug 14, 2019

Uh oh!

yuxiangw commented Aug 15, 2019 via email

Uh oh!

googlebot commented Aug 15, 2019

Uh oh!

googlebot commented Aug 15, 2019

Uh oh!

kairouzp Aug 20, 2019

Uh oh!

yuxiangw Aug 22, 2019

Uh oh!

kairouzp Aug 20, 2019

Uh oh!

yuxiangw Aug 21, 2019

Uh oh!

kairouzp Aug 20, 2019

Uh oh!

yuxiangw Aug 22, 2019

Uh oh!

kairouzp Aug 20, 2019

Uh oh!

yuxiangw Aug 22, 2019

Uh oh!

npapernot commented Dec 18, 2019

Uh oh!

ChrisWaites commented Jun 6, 2020

Uh oh!

Uh oh!


		# A simple test script to demonstrate the calculated RDP

		q= 0.01

		results2 = compute_rdp_sample_without_replacement(q, noise_multiplier, steps, orders)


		import matplotlib.pyplot as plt

Rdp for subsampled gaussian mechanism under the sampling without replacement scheme #67

Are you sure you want to change the base?

Rdp for subsampled gaussian mechanism under the sampling without replacement scheme #67

Uh oh!

Conversation

yuxiangw commented Aug 14, 2019

Uh oh!

googlebot commented Aug 14, 2019

What to do if you already signed the CLA

Individual signers

Corporate signers

Uh oh!

yuxiangw commented Aug 15, 2019 via email

Uh oh!

googlebot commented Aug 15, 2019

Uh oh!

googlebot commented Aug 15, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

npapernot commented Dec 18, 2019

Uh oh!

ChrisWaites commented Jun 6, 2020

Uh oh!

Uh oh!