Here we include some scripts for organizing the Algonauts 2023 dataset in the Huggingface format.
Run download_data.sh to download the official challenge data. Make sure you have gdown installed.
bash download_data.shYou can also copy or link the challenge data to algonauts_2023_challenge_data/ if you already have it downloaded.
Generate private splits derived from the official training split. This will generate a list of indices for each subject and split (train, val, testval) saved in npy format in derived_splits/.
python generate_splits.pyGenerate a processed huggingface dataset for each split and fixed image size. Note, this may take a few hours.
splits="train val testval test"
for split in $splits; do
python generate_dataset.py --split $split --img_size 256 --workers 16
done