Skip to content

feat(datasets): Add flwr-datasets create CLI#6514

Merged
jafermarq merged 24 commits intomainfrom
add-flwr-dataset-cli
Feb 5, 2026
Merged

feat(datasets): Add flwr-datasets create CLI#6514
jafermarq merged 24 commits intomainfrom
add-flwr-dataset-cli

Conversation

@yan-gao-GY
Copy link
Member

Proposal

Add flwr_datasets_creator CLI for demo federated dataset generation.

Copilot AI review requested due to automatic review settings February 3, 2026 15:50
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new CLI tool flwr-datasets-creator to facilitate federated dataset creation for demo purposes. The tool enables users to partition Hugging Face datasets using IID partitioning and save them to disk.

Changes:

  • Added a new CLI command flwr-datasets-creator registered in pyproject.toml
  • Implemented CLI functionality in flwr_datasets/cli/flwr_datasets_creator.py with argument parsing and dataset partitioning logic
  • Created CLI module with proper exports in flwr_datasets/cli/__init__.py

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 9 comments.

File Description
datasets/pyproject.toml Registers the new flwr-datasets-creator CLI script entry point
datasets/flwr_datasets/cli/flwr_datasets_creator.py Implements the main CLI logic for creating and saving federated dataset partitions
datasets/flwr_datasets/cli/init.py Exports the flwr_datasets_creator function from the CLI module

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@github-actions github-actions bot added the Contributor Used to determine what PRs (mainly) come from external contributors. label Feb 3, 2026
@yan-gao-GY yan-gao-GY changed the title feat(datasets): Add flwr_datasets_creator CLI feat(datasets): Add flwr-datasets create CLI Feb 4, 2026
yan-gao-GY and others added 6 commits February 4, 2026 20:42
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
yan-gao-GY and others added 2 commits February 4, 2026 22:14
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
yan-gao-GY and others added 2 commits February 5, 2026 20:45
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
@jafermarq jafermarq merged commit 57b3cb4 into main Feb 5, 2026
76 checks passed
@jafermarq jafermarq deleted the add-flwr-dataset-cli branch February 5, 2026 15:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Contributor Used to determine what PRs (mainly) come from external contributors.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants