feat(baselines): add verifiable aggregation workflow baseline (Message API) by rwilliamspbg-ops · Pull Request #6892 · flwrlabs/flower

rwilliamspbg-ops · 2026-03-30T12:43:41Z

Description
This PR adds a new Flower baseline that demonstrates a reproducible verifiable aggregation workflow using the Message API.
The goal is to provide an isolated community contribution under baselines that does not change Flower core behavior by default, while still showing how to add optional verification hooks around server-side aggregation outputs.

Related issues/PRs
Fixes #6880.
Related: #6881.

Proposal
Explanation
The PR introduces a self-contained baseline named verifiableagg with the standard baseline structure and contributor tooling support.

Main changes:

Added a new baseline package with Message API apps:
Client-side local train and evaluate app
Server-side app orchestration
FedAvg-derived strategy with optional aggregation verification hooks
Added deterministic and reproducible baseline setup:
Synthetic deterministic per-client data generation
Seeded defaults in configuration
Configurable verification tolerance and verification toggle
Added benchmark and reporting utilities:
Script to run and summarize benchmark output
JSON report containing run config, round metrics, verification pass or fail, max absolute difference, and aggregate hash
Added full baseline documentation:
Environment setup
Run instructions
Expected results
Contributor check commands
Added baseline-local ignore rules:
Ignore runtime artifacts so generated files are not committed
Validation performed:

Baseline structure check passed:
./dev/test-baseline-structure.sh verifiableagg
Baseline quality checks passed:
./dev/test-baseline.sh verifiableagg
Includes isort, black, docformatter, ruff, mypy, pylint, and pytest
Checklist
Implement proposed change
Write tests
Update documentation
Address LLM-reviewer comments, if applicable (e.g., GitHub Copilot)
Make CI checks pass
Ping maintainers on Slack (channel #contributions)
Any other comments?
This is intentionally a focused first-phase contribution with an isolated baseline and reproducible workflow.
If preferred, follow-up PRs can extend this baseline with larger-scale benchmark variants or additional verification mechanisms.

Copilot

Pull request overview

Adds a new self-contained verifiableagg baseline under baselines/ demonstrating a reproducible (synthetic, seeded) verifiable aggregation workflow using Flower’s Message API, without changing Flower core behavior.

Changes:

Introduces a Message API ServerApp/ClientApp baseline that runs FedAvg with optional post-aggregation verification (replay + tolerance + hashing).
Adds deterministic synthetic per-client dataset generation and JSON reporting utilities (plus a benchmark helper script).
Adds baseline packaging/configuration (pyproject.toml), docs, and baseline-local ignore rules.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
baselines/verifiableagg/verifiableagg/utils.py	Adds `as_bool` helper for run_config parsing.
baselines/verifiableagg/verifiableagg/strategy.py	Implements `VerifiableFedAvg` with replay-based aggregation verification and hashing.
baselines/verifiableagg/verifiableagg/server_app.py	Orchestrates training, writes model + JSON report artifacts.
baselines/verifiableagg/verifiableagg/reporting.py	Adds JSON report writer utility.
baselines/verifiableagg/verifiableagg/model.py	Defines small MLP + train/eval loops for synthetic task.
baselines/verifiableagg/verifiableagg/dataset.py	Implements deterministic synthetic per-partition dataset + loaders.
baselines/verifiableagg/verifiableagg/client_app.py	Implements Message API client train/evaluate handlers.
baselines/verifiableagg/verifiableagg/init.py	Package marker/docstring.
baselines/verifiableagg/run_benchmark.sh	Convenience script to run baseline + summarize report.
baselines/verifiableagg/pyproject.toml	Baseline dependencies, tooling config, Flower app/federation config defaults.
baselines/verifiableagg/benchmark_report.py	CLI tool to tabulate verification rounds and exit non-zero on failures.
baselines/verifiableagg/README.md	Baseline documentation (setup, run, expected outputs).
baselines/verifiableagg/.gitignore	Ignores baseline runtime artifacts (e.g., `artifacts/`).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

baselines/verifiableagg/verifiableagg/strategy.py

baselines/verifiableagg/verifiableagg/server_app.py

baselines/verifiableagg/verifiableagg/reporting.py

baselines/verifiableagg/verifiableagg/client_app.py

baselines/verifiableagg/verifiableagg/server_app.py

rwilliamspbg-ops · 2026-03-30T12:54:02Z

@copilot apply changes based on the comments in this thread

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

baselines/verifiableagg/verifiableagg/strategy.py

baselines/verifiableagg/README.md

baselines/verifiableagg/run_benchmark.sh

baselines/verifiableagg/pyproject.toml

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

baselines/verifiableagg/verifiableagg/utils.py

rwilliamspbg-ops · 2026-03-31T14:27:15Z

@copilot apply changes based on the comments in this thread

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

baselines/verifiableagg/verifiableagg/server_app.py

baselines/verifiableagg/README.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

rwilliamspbg-ops · 2026-03-31T14:40:55Z

@copilot apply changes based on the comments in this thread

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 13 out of 13 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

baselines/verifiableagg/verifiableagg/utils.py

baselines/verifiableagg/README.md

baselines/verifiableagg/verifiableagg/server_app.py

baselines/verifiableagg/verifiableagg/strategy.py

rwilliamspbg-ops · 2026-04-04T13:31:31Z

@copilot apply changes based on the comments in this thread

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

baselines: add verifiable aggregation workflow baseline

e3f6b53

Copilot AI review requested due to automatic review settings March 30, 2026 12:43

rwilliamspbg-ops requested review from chongshenng, danieljanes, jafermarq and tanertopal as code owners March 30, 2026 12:43

Copilot started reviewing on behalf of rwilliamspbg-ops March 30, 2026 12:44 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

rwilliamspbg-ops requested a review from Copilot March 30, 2026 12:59

Copilot started reviewing on behalf of rwilliamspbg-ops March 30, 2026 13:00 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

Update baselines/verifiableagg/pyproject.toml

efd49c6

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

github-actions bot added the Contributor Used to determine what PRs (mainly) come from external contributors. label Mar 30, 2026

rwilliamspbg-ops and others added 3 commits March 30, 2026 15:37

Merge branch 'flwrlabs:main' into main

26b4e61

Update baselines/verifiableagg/run_benchmark.sh

a475b48

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

chore(baselines): polish verifiableagg determinism and robustness

2bff846

rwilliamspbg-ops requested a review from Copilot March 31, 2026 14:23

Copilot started reviewing on behalf of rwilliamspbg-ops March 31, 2026 14:23 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

baselines/verifiableagg/verifiableagg/utils.py Show resolved Hide resolved

rwilliamspbg-ops requested a review from Copilot March 31, 2026 14:28

Copilot started reviewing on behalf of rwilliamspbg-ops March 31, 2026 14:29 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

baselines/verifiableagg/verifiableagg/server_app.py Outdated Show resolved Hide resolved

baselines/verifiableagg/README.md Show resolved Hide resolved

baselines/verifiableagg/README.md Show resolved Hide resolved

Update baselines/verifiableagg/verifiableagg/server_app.py

2e7bd45

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

rwilliamspbg-ops requested a review from Copilot March 31, 2026 14:42

Copilot started reviewing on behalf of rwilliamspbg-ops March 31, 2026 14:42 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

rwilliamspbg-ops requested a review from Copilot April 4, 2026 13:11

Copilot started reviewing on behalf of rwilliamspbg-ops April 4, 2026 13:12 View session

Copilot AI reviewed Apr 4, 2026

View reviewed changes

Update baselines/verifiableagg/verifiableagg/strategy.py

50b7b0b

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Conversation

rwilliamspbg-ops commented Mar 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rwilliamspbg-ops commented Mar 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

rwilliamspbg-ops commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rwilliamspbg-ops commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rwilliamspbg-ops commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants