How to use

Set-up environment
Download Dataset
Download Pretrained Weights
Train and evaluate
Configuration files and input arguments
Monitor experiments
Project Structure

Set-up environment

First, clone the repository to your local machine:

$ git clone https://github.com/rubenpt91/PFL-DocVQA-Competition.git
$ cd PFL-DocVQA-Competition

To install all the dependencies, you need to create a new conda environment with the provided yml file:

$ conda env create -f environment.yml
$ conda activate pfl_docvqa

Then, you need to manually install Ray library (due to some incompatibility issues, it does not allow to install it with the rest of the packages).

$ (pfl_docvqa) pip install ray==1.11

You will see the following error message, but you can ignore it.
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. flwr 1.4.0 requires grpcio!=1.52.0,<2.0.0,>=1.48.2, but you have grpcio 1.43.0 which is incompatible..

To check that the proper ray version is installed, print the libarary version.

$ (pfl_docvqa) python

import ray
print(ray.__version__)  # Output: 1.11.0

Download dataset

Download the dataset from the ELSA Benchmarks Platform.
Modify in the dataset configuration file configs/datasets/PFL-DocVQA.yml the following keys:
- imdb_dir: Path to the imdb directory with all train and validation clients.
- images_dir: Path to the dataset images.
- provider_docs: Path to data_points.json.

Download pretrained weights

Download the pretrained weights on SP-DocVQA.
Unzip the weights unzip vt5_mp-docvqa.ckpt.zip.
Change the model_weights path in configs/models/VT5.yml to point to your local ''.ckpt'' path.

Train and evaluate

To use the framework you only need to call the train.py or eval.py scripts with the dataset and model you want to use. The framework is not prepared to performed centralized training. Therefore, you always must specify --flower flag. For example:

$ (pfl_docvqa) python train.py --dataset PFL-DocVQA --model VT5 --flower

The name of the dataset and the model must match with the name of the configuration under the configs/dataset and configs/models respectively. This allows having different configuration files for the same dataset or model.
In addition, to apply or Differential Privacy, you just need to specify --use_dp.

$ (pfl_docvqa) python train.py --dataset PFL-DocVQA --model VT5 --flower --use_dp

Below, we show a descriptive list of the possible input arguments that can be used.

Configuration files and input arguments

Input arguments

Parameter	Input param	Required	Description
Model	`-m` `--model`	Yes	Name of the model config file
Dataset	`-d` `--dataset`	Yes	Name of the dataset config file
Batch size	`-bs`, `--batch-size`	No	Batch size
Initialization seed	`--seed`	No	Initialization seed
Federated Learning	`--flower`	No	Specify to use Federated Learning or not
Federated Learning - Sample Clients	`--sample_clients`	No	Number of sampled clients during FL
Federated Learning - Num Rounds	`--num_rounds`	No	Number of FL Rounds to train
Federated Learning - Iterations per Round	`--iteration_per_fl_round`	No	Number of iterations per provider during each FL round
Differential Privacy	`--use_dp`	No	Add Differential Privacy noise
Differential Privacy - Sampled providers per Round	`--providers_per_fl_round`	No	Number of groups (providers) sampled in each FL Round when DP is used
Differential Privacy - Noise sensitivity	`--sensitivity`	No	Upper bound of the contribution per group (provider)
Differential Privacy - Noise multiplier	`--noise_multiplier`	No	Noise multiplier

Most of these parameters are specified in the configuration files. However, you can overwrite those parameters through the input arguments.

Datasets configuration files

Parameter	Description	Values
dataset_name	Name of the dataset to use.	PFL-DocVQA
imdb_dir	Path to the numpy annotations file.	<Path>
images_dir	Path to the images dir.	<Path>
provider_docs	Path to the `data_points.json` file.	<Path>

Models configuration files

Parameter	Description	Values
model_name	Name of the dataset to use.	VT5
model_weights	Path to the model weights dir. It can be either local path or huggingface weights id.	<Path>, <Huggingface path>
max_input_tokens	Max number of text tokens to input into the model.	Integer: Usually is 512, 768 or 1024.
save_dir	Path where the checkpoints and log files will be saved.	<Path>
device	Device to be used	cpu, cuda
visual_module	Visual module parameters Check section	Visual Module
training_parameters	Training parameters specified in the model config file.	Training parameters
fl_parameters	Federated Learning parameters are specified in the model config file. Check section	FL parameters
dp_parameters	Differential Privacy parameterstraining parameters are specified in the model config file. Check section	DP parameters

Visual Module

Parameter	Description	Values
model	Name of the model used to extract visual features.	ViT, DiT
model_weights	Path to the model weights dir. It can be either local path or huggingface weights id.	<Path>, <Huggingface path>
finetune	Whether the visual module should be fine-tuned during training or not.	Boolean

Training parameters

Parameter	Description	Values
lr	Learning rate.	Float (2^-4)
batch_size	Batch size.	Integer

Federated Learning parameters

Parameter	Description	Values
sample_clients	Number of sampled clients at each FL round.	Integer (2)
total_clients	Total number of training clients.	Integer (10)
num_rounds	Number of FL Rounds to train.	Integer (5)
iterations_per_fl_round	Number of iterations per provider during each FL round.	Integer (1)

Differential Privacy Parameters

Parameter	Description	Values
providers_per_fl_round	Number of groups (providers) sampled in each FL Round.	Integer (50)
sensitivity	Differential Privacy Noise sensitivity.	Float (0.5)
noise_multiplier	Differential Privacy noise multiplier.	Float (1.182)

Monitor experiments

By default, the framework will log all the training and evaluation process in Weights and Biases (wandb).

The first time you run the framework, wandb will ask you to provide your wandb account information. You can decide either to create a new account, provide a authorization token from an already existing account, or do not visualize the logging process. The two first options are straightforward and wandb should properly guide you. In the case you don't want to visualize the results you might get an error when running the experiments. To prevent this, you need to disable wandb by typing:

$ (pfl_docvqa) export WANDB_MODE="offline"

Moreover, Flower integrates their own monitoring system tools with Grafana and Prometheus. You can check how to do it in the Flower Monitor Simulation documentation.

Project structure

PFL-DocVQA
├── configs
│   ├── datasets
│   │   └── PFL-DocVQA.yml
│   └── models
│       └── VT5.yml
├── datasets
│   └── PFL-DocVQA.yml
├── models
│   └── VT5.py
├── communication
│   ├── compute_tensor_size.py
│   ├── log_communication.py
│   └── tests/
├── differential_privacy
│   ├── dp_utils.py
│   └── test_dp_utils.py
├── readme.md
├── environment.yml
├── utils.py
├── utils_parallel.py
├── build_utils.py
├── logger.py
├── checkpoint.py
├── train.py
├── eval.py
└── metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!