Skip to content

Use proper Poisson sampling in documented examples and keras_api #84

@ryan112358

Description

@ryan112358

The JAX library core library includes APIs and Implementations of batch selection strategies, including standard Poisson sampling. However, some of our example binaries do not use this, and instead load in the data sequentially as fixed size batches, which is common practice without privacy. This is a problematic flaw with the implementations that invalidates the privacy guarantee and should be fixed.

An example that does correct sampling is given in dp_logistic_regression.py.

Task:

  • Update keras_api.py so that Poisson sampling is used instead of deterministic fixed-batch sampling. This may require imposing additional constraints on the input dataset format. In particular, the dataset needs to support efficient random access (in-memory arrays, or arrayrecord format).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions