Examples Guide

The examples/ directory contains runnable scripts and notebooks that demonstrate how to use the JAX Privacy building blocks. The examples fall into two broad categories:

End-to-end mechanism implementations — complete DP training pipelines that compose clipping, noise addition, batch selection, and accounting into a single program.
Isolated component demonstrations — scripts that exercise or benchmark an individual building block (e.g., noise generation, data loading, or privacy accounting) without running a full training loop.

A note on correctness

JAX Privacy provides the building blocks for writing correct DP implementations, but that does not mean every program composed from those blocks — including some of our own examples — uses them in the recommended way.

Several examples aim to be end-to-end correct reference implementations. These examples use proper Poisson sampling for batch selection (via CyclicPoissonSampling or the Keras API with poisson_sampling_in_fit=True) and treat the batch size as private information (normalizing by the expected batch size rather than the actual batch size). If any bugs are found in these files, we would consider those unexpected and serious. These examples are marked with ✅ below.

All other end-to-end examples may take shortcuts or make simplifications that could invalidate the formal privacy guarantee. These are known limitations and exist by design so the examples remain accessible for research and demonstration purposes. The most common issues are:

Not using Poisson sampling during training. Several examples use fixed-size batches via tf.data shuffling or do not opt into Poisson sampling in the Keras API, even though the privacy accounting assumes Poisson-sampled batches.
Treating the (possibly padded) batch size as public information, for example by dividing gradients by the actual batch size inside the JIT-compiled update step, when it should be treated as private.

We intend to improve more examples — particularly the Gemma fine-tuning notebooks — over time. Until then, treat the examples below accordingly.

End-to-end mechanism implementations

These examples compose multiple JAX Privacy building blocks into a full DP training pipeline.

`dp_logistic_regression.py` ✅

Trains a logistic regression model on synthetic data using DP-BandMF. Demonstrates the full pipeline: BandMFExecutionPlanConfig for calibration and plan construction, CyclicPoissonSampling for batch selection with proper padding, clipped gradient computation, correlated noise addition, and post-training privacy auditing with canary scores.

Correctness status: Aims to be a correct reference implementation.

`secure_noise_example.py` ✅

Trains a logistic regression model using DP-SGD with the discrete Gaussian mechanism backed by cryptographically secure CPU-side randomness (via randomgen.RDRAND). Uses CyclicPoissonSampling for Poisson-sampled batches, integer-grid rounding of clipped gradients, and RDP accounting.

Correctness status: Aims to be a correct reference implementation.

`dp_sgd_transformer.py` ✅

Trains a character-level Transformer decoder on the Tiny Shakespeare dataset using DP-SGD with BandMFExecutionPlanConfig. Demonstrates next-character prediction with per-sequence privacy. Uses CyclicPoissonSampling (via the execution plan) for batch selection and normalizes by the expected batch size.

Correctness status: Aims to be a correct reference implementation.

`keras_api_example.py` ✅

Trains a CNN on MNIST using the JAX Privacy Keras integration (keras_api.DPKerasConfig / make_private) with poisson_sampling_in_fit=True. Demonstrates how to enable DP-SGD with minimal code changes on top of a standard Keras training loop.

Correctness status: Aims to be a correct reference implementation.

`jax_api_example.py`

Trains a linear regression model on synthetic data using the JAX Privacy core API (clipped_grad, gaussian_privatizer, PLD-based noise calibration). Shows both DP and non-DP training paths for comparison.

Correctness status: Research / demonstration. Uses tf.data shuffling with fixed-size batches (drop_remainder=True) rather than Poisson sampling, and divides gradients by the fixed batch size.

`dp_sgd_flax_linen_mnist.ipynb` (notebook)

Step-by-step Colab tutorial that trains a Flax Linen CNN on MNIST with DP-SGD. Walks through hyper-parameter setup, PLD-based noise calibration, per-example gradient clipping via dp_sgd.grad_clipping, and comparing DP vs. non-DP accuracy.

Correctness status: Research / demonstration. Uses tf.data shuffling with fixed-size batches rather than Poisson sampling.

`dp_sgd_keras_gemma3_lora_finetuning_samsum.ipynb` (notebook)

Colab tutorial for DP-SGD LoRA fine-tuning of Gemma 3 (4B) on the SAMSum summarization dataset using the Keras API. Covers data preprocessing, mixed-precision training, enabling DP via DPKerasConfig, and ROUGE evaluation.

Correctness status: Research / demonstration. Does not enable poisson_sampling_in_fit in the Keras API configuration.

`dp_sgd_keras_gemma3_synthetic_data.ipynb` (notebook)

Colab tutorial for generating differentially private synthetic data by DP fine-tuning Gemma 3 (12B) with LoRA on IMDb reviews and then sampling from the tuned model. Includes MAUVE-based evaluation of synthetic data quality.

Correctness status: Research / demonstration. Does not enable poisson_sampling_in_fit in the Keras API configuration.

Isolated component demonstrations

These examples focus on a single building block and do not implement a full DP training loop.

`data_loading.py`

Demonstrates how to integrate jax_privacy.BatchSelectionStrategy with PyGrain for efficient data loading from on-disk datasets. Includes a reusable CustomBatchIterator class with checkpointing support, and benchmarks throughput comparing the custom iterator against a standard PyGrain pipeline.

`distributed_noise_generation.py`

Standalone tool for benchmarking distributed correlated noise generation with DP-BandMF. Visualizes array sharding across a TPU mesh and reports compilation time, per-step runtime, and memory usage. Useful for determining feasibility of a given model on a given TPU topology.

`dpmf_strategy_optimization.py`

Numerically optimizes DP Matrix Factorization (DP-MF) strategy matrices and evaluates the expected error of different parameterizations (dense, banded, Toeplitz, BLT, etc.). The reported errors correspond to unamplified DP-MF and can be used to compare strategy quality across different configurations.

`balls_in_bins_accounting.py`

Uses Monte Carlo accounting to calibrate the noise multiplier for DP-SGD under the “balls-in-bins” batch selection strategy. Demonstrates sample generation, noise multiplier sweeping, and verification-based calibration from the jax_privacy.experimental.monte_carlo module.

Examples Guide

A note on correctness

End-to-end mechanism implementations

dp_logistic_regression.py ✅

secure_noise_example.py ✅

dp_sgd_transformer.py ✅

keras_api_example.py ✅

jax_api_example.py

dp_sgd_flax_linen_mnist.ipynb (notebook)

dp_sgd_keras_gemma3_lora_finetuning_samsum.ipynb (notebook)

dp_sgd_keras_gemma3_synthetic_data.ipynb (notebook)

Isolated component demonstrations

data_loading.py

distributed_noise_generation.py

dpmf_strategy_optimization.py

balls_in_bins_accounting.py

`dp_logistic_regression.py` ✅

`secure_noise_example.py` ✅

`dp_sgd_transformer.py` ✅

`keras_api_example.py` ✅

`jax_api_example.py`

`dp_sgd_flax_linen_mnist.ipynb` (notebook)

`dp_sgd_keras_gemma3_lora_finetuning_samsum.ipynb` (notebook)

`dp_sgd_keras_gemma3_synthetic_data.ipynb` (notebook)

`data_loading.py`

`distributed_noise_generation.py`

`dpmf_strategy_optimization.py`

`balls_in_bins_accounting.py`