childmindresearch
diff --git a/‎.gitattributes‎
Lines changed: 2 additions & 0 deletions b/‎.gitattributes‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎.github/arch.svg‎
Lines changed: 1 addition & 0 deletions b/‎.github/arch.svg‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 43 additions & 0 deletions b/‎.gitignore‎
Lines changed: 43 additions & 0 deletions
diff --git a/‎.pre-commit-config.yaml‎
Lines changed: 32 additions & 0 deletions b/‎.pre-commit-config.yaml‎
Lines changed: 32 additions & 0 deletions
diff --git a/‎LICENSE‎
Lines changed: 21 additions & 0 deletions b/‎LICENSE‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 119 additions & 0 deletions b/‎README.md‎
Lines changed: 119 additions & 0 deletions
diff --git a/‎algonauts23/__init__.py‎
Lines changed: 30 additions & 0 deletions b/‎algonauts23/__init__.py‎
Lines changed: 30 additions & 0 deletions
@@ -0,0 +1,2 @@
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.png filter=lfs diff=lfs merge=lfs -text
@@ -0,0 +1,43 @@
+.DS_Store
+
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+
+# Vim
+*.swp
+
+# Distribution / packaging
+*.egg-info/
+dist
+build
+.venv/
+*/_version.py
+
+# Unit test / coverage reports
+.coverage
+.pytest_cache
+htmlcov
+
+# VScode IDE
+.vscode/
+.env
+
+# Local data and scratch
+.scratch
+dataset/algonauts_2023_challenge_data
+dataset/derived_splits
+dataset/processed
+projections
+figures
+results
+
+# Jupyter
+.ipynb_checkpoints
+
+# SLURM
+slurm-*
+job.txt
+
+# Wandb
+wandb/
@@ -0,0 +1,32 @@
+exclude: 'build|third-party|.github'
+
+default_language_version:
+    python: python3
+
+repos:
+-   repo: https://github.com/pre-commit/pre-commit-hooks
+    rev: v4.3.0
+    hooks:
+    -   id: trailing-whitespace
+    -   id: check-ast
+    -   id: check-merge-conflict
+    # -   id: no-commit-to-branch
+    #     args: ['--branch=main']
+    -   id: end-of-file-fixer
+
+-   repo: https://github.com/psf/black
+    rev: 22.10.0 # Replace by any tag/version: https://github.com/psf/black/tags
+    hooks:
+      - id: black
+        language_version: python3 # Should be a command that runs python3.6+
+
+-   repo: https://github.com/pycqa/flake8
+    rev: 5.0.4
+    hooks:
+    - id: flake8
+
+- repo: https://github.com/pycqa/isort
+  rev: 5.12.0
+  hooks:
+    - id: isort
+      name: isort (python)
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) Child Mind Institute
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
@@ -0,0 +1,119 @@
+# Algonauts 2023
+
+Code for the [CMI-DAIR submission](https://arxiv.org/abs/2308.02351) to the [Algonauts 2023 Challenge](http://algonauts.csail.mit.edu/) (team "BlobGPT").
+
+<p align="center">
+  <img src=".github/arch.svg" alt="model architecture" width="360">
+</p>
+
+Our model consists of a multi-subject linear encoding head attached to a pretrained trunk model. The multi-subject head has three components: (1) a shared multi-layer feature projection, (2) shared plus subject-specific low-dimension linear transformations, and (3) a shared frozen PCA embedding. The feature projection is "factorized" as a 1x1 convolution followed by learned depthwise spatial pooling.
+
+Our submission model used an `eva02_base_patch14_224.mim_in22k` trunk from [timm](https://github.com/huggingface/pytorch-image-models). We first trained the encoding head only with the trunk frozen (phase 1). Then we unfroze the trunk's attention blocks and fine-tuned the model end-to-end (phase 2). See [our report](https://arxiv.org/abs/2308.02351) for more details.
+
+## Results
+
+| Model | Val score | Test score | Config | Weights |
+| --- | --- | --- | --- | --- |
+| GroupLin-P1 | 20.4% | 58.8% | [config](config/phase1_head_only.yaml) | [weights](https://github.com/cmi-dair/algonauts23/releases/download/v0.1.0/grouplin_phase1.pt) |
+| GroupLin-P2 | 20.9% | 60.3% | [config](config/phase2_finetune.yaml) | [weights](https://github.com/cmi-dair/algonauts23/releases/download/v0.1.0/grouplin_phase2.pt) |
+
+**Val score**: median R<sup>2</sup> on our validation set
+
+**Test score**: mean noise-normalized R<sup>2</sup> on official challenge test set
+
+## Installation
+
+Clone the repository.
+
+```bash
+git clone https://github.com/cmi-dair/algonauts23.git
+```
+
+Create a new environment.
+
+```bash
+cd algonauts23
+python3.10 -m venv --prompt algonauts23 .venv
+source .venv/bin/activate
+pip install -U pip
+```
+
+Install the dependencies
+
+```bash
+pip install -r requirements.txt
+```
+
+Install the package
+
+```bash
+pip install .
+```
+
+## Dataset preparation
+
+Follow the steps [here](dataset/) to download and prepare the data for training.
+
+## Training
+
+### WandB setup (optional)
+
+We used [WandB](https://wandb.ai/) for experiment tracking. To set up WandB, create an account if you don't have one already and get your [API key](https://docs.wandb.ai/quickstart#common-questions). Then include these commands before launching training.
+
+```bash
+export WANDB_API_KEY="XXXXXXX"
+wandb login
+opts="--wandb"
+```
+
+### Download PCA weights
+
+Our model uses a frozen group PCA embedding. You can download the weights [here](https://github.com/cmi-dair/algonauts23/releases/download/v0.1.0/group_pca_d-2048.pt).
+
+You can also re-run the group PCA using [`scripts/fit_group_pca.py`](scripts/fit_group_pca.py).
+
+### Phase 1: Multi-subject linear head only
+
+In the first phase of training, we only train the multi-subject linear head with the trunk model frozen.
+
+```bash
+python scripts/train_group_encoder.py config/phase1_head_only.yaml \
+    --out_dir results --workers 4 $opts
+```
+
+### Phase 2: Partial fine-tuning
+
+Next, we partially fine-tune the full model starting from the best checkpoint from the first phase.
+
+```bash
+# Path to checkpoint
+ckpt_run="PHASE1_RUN_NAME"
+ckpt="results/algonauts23-group-encoder/${ckpt_run}/checkpoints/ckpt-best.pt"
+
+python scripts/train_group_encoder.py config/phase2_finetune.yaml \
+    --out_dir results --ckpt $ckpt --workers 4 $opts
+```
+
+## Submission
+
+Run the [`zip_submission.sh`](scripts/zip_submission.sh) to prepare a zip file for submission to the [leaderboard](https://codalab.lisn.upsaclay.fr/competitions/9304).
+
+```bash
+./scripts/zip_submission.sh RESULT_DIR
+```
+
+## Acknowledgements
+
+This code was built with elements and inspiration from [timm](https://github.com/huggingface/pytorch-image-models).
+
+## Citation
+If you find this repository helpful, please consider citing:
+
+```
+@article{lane2023algonauts,
+  author  = {Connor Lane and Gregory Kiar},
+  title   = {A Parameter-efficient Multi-subject Model for Predicting fMRI Activity},
+  journal = {arXiv preprint arXiv:2308.02351},
+  year    = {2023},
+}
+```
@@ -0,0 +1,30 @@
+import os
+from pathlib import Path
+
+ALGONAUTS_DIR = Path(
+    os.environ.get("ALGONAUTS_DIR", Path(__file__).parent.parent.absolute())
+)
+
+ALGONAUTS_DATA_DIR = Path(
+    os.environ.get("ALGONAUTS_DATA_DIR", ALGONAUTS_DIR / "dataset")
+)
+
+ALGONAUTS_RAW_DIR = Path(
+    os.environ.get(
+        "ALGONAUTS_RAW_DIR",
+        ALGONAUTS_DATA_DIR / "algonauts_2023_challenge_data",
+    )
+)
+
+SUBS = tuple([f"subj{ii:02d}" for ii in range(1, 9)])
+NUM_SUBS = len(SUBS)
+FMRI_DIM = 39548
+
+ROI_GROUPS = {
+    "streams": "Anatomical streams",
+    "prf-visualrois": "Early retinotopic",
+    "floc-bodies": "Body-selective",
+    "floc-faces": "Face-selective",
+    "floc-places": "Place-selective",
+    "floc-words": "Word-selective",
+}
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+*.pt filter=lfs diff=lfs merge=lfs -text`
	`2`	`+*.png filter=lfs diff=lfs merge=lfs -text`