🧠 Project Cere

Multimodal AI for Social Good

AI4Good Lab @Mila • Montreal 2025 Cohort

🔍 Project Overview

Project Cere develops multimodal machine learning models that integrate visual, textual, and audio data to address pressing social challenges. This repository contains our codebase, experiments, and documentation for creating interpretable AI systems with real-world impact.

✨ Features

Modular Architecture: Easy to extend with new classifiers
Multiple Classifier Types: Classical ML, Neural Networks, and Ensemble methods
Flexible Data Pipeline: Support for various fMRI data formats
Comprehensive Evaluation: Cross-validation, metrics, and visualization
Hyperparameter Optimization: Built-in grid search capabilities
Experiment Management: YAML-based configuration system
Extensible Design: Factory pattern for seamless classifier addition

🛠️ Installation

Prerequisites

Python 3.10+
Git

Setup

Clone the repository:

git clone https://github.com/marialagakos/AI4Good-MTL-Group-2.git
cd AI4Good-MTL-Group-2

Create and activate virtual environment:

python -m venv cere-env
# Linux/MacOS
source cere-env/bin/activate
# Windows (PowerShell)
.\cere-env\Scripts\Activate.ps1

Install dependencies:

pip install --upgrade pip
pip install -e .  # Editable install for development
pip install -r requirements.txt  # Optional: Full dependency install

Here’s a concise team onboarding guide and your personal command cheat sheet for working with this repository:

Creating A Fork

Fork the Repository
- Each member creates a personal fork:
  - Go to github.com/marialagakos/AI4Good-MTL-Group-2 → Click "Fork" (top-right).

Clone Your Fork

git clone https://github.com/YOUR-USERNAME/AI4Good-MTL-Group-2.git
cd AI4Good-MTL-Group-2

Set Up Remotes

git remote add upstream https://github.com/marialagakos/AI4Good-MTL-Group-2.git
git remote -v  # Verify: origin=your fork, upstream=original

Sync with Upstream

git fetch upstream
git checkout main
git merge upstream/main
git push origin main  # Keep your fork updated

📂 Repository Structure

Project-Cere/
├── main.py                 # Main execution script
├── README.md               # Project documentation
├── data/                   # Raw and processed datasets
│   ├── feature_extraction.py    # Feature extraction utilities
│   ├── DATA_INSTRUCTIONS.md
│   ├── .ipnyb_checkpoints/
│   ├── src/                 
│   │   ├── telepath/
│   │   ├── telepath.egg-info/
│   │   └── temp_audio_chunks/
│   ├── pca_data/
│   ├── loaders.py            # loading (fmri, audio, text, visual)
│   ├── preprocessors.py      # preprocessing
│   ├── transforms.py         # transformations
│   ├── fmri/                    # fMRI data
│   ├── audio/                   # Audio samples
│   ├── transcripts/             # Text corpora
│   └── visual/                  # Image/video data
├── models/                    # Classifier implementations
│   ├── base_classifier.py    # Abstract base class
│   ├── classical/            # Traditional ML methods
│   │   ├── svm.py
│   │   ├── random_forest.py
│   │   └── logistic_regression.py
│   ├── neural/               # Neural network methods
│   │   ├── mlp.py
│   │   ├── cnn.py
│   │   ├── lstm.py
│   │   └── transformer.py
│   └── ensemble/             # Ensemble methods
│       ├── voting.py
│       └── stacking.py
├── utils/                     # Utility functions
│   ├── metrics.py            # Evaluation metrics
│   ├── visualization.py     # Plotting functions
│   └── io_utils.py          # I/O operations
├── experiments/              # Experiment management
│   ├── experiment_runner.py
│   └── hyperparameter_search.py
├── .gitignore              # File control
├── docs/                   # Technical documentation
├── tests/                  # Unit and integration tests
└── LICENSE.md

🚀 Usage

Running the Pipeline

python src/main.py --modality all --config configs/default.yaml

Key Arguments

--modality: Choose audio, text, visual, or all
--config: Path to YAML configuration file

Jupyter Notebooks

jupyter lab notebooks/

Maintaining Forks

1. Start a New Feature

git checkout -b feature/your-feature-name  # e.g., feature/login-form

2. Commit & Push to Your Fork

git add .
git commit -m "Description of changes"
git push -u origin feature/your-feature-name

3. Sync with Upstream

git checkout main
git fetch upstream
git merge upstream/main  # Or use `git rebase upstream/main`
git push origin main

4. Update Your Feature Branch

git checkout feature/your-feature-name
git rebase main  # Apply your changes on top of latest updates
git push --force  # Only if you've rebased

5. Create a Pull Request (PR)

Go to your fork on GitHub.
Click "Compare & Pull Request" for your branch.
Target marialagakos/AI4Good-MTL-Group-2:main as the base.

Key Rules for the Team

Never push directly to upstream (only PRs).
Always branch from main (no direct commits to main).
Rebase instead of merge to keep history clean (use git rebase main).

Troubleshooting

Permission denied?

git remote set-url origin https://github.com/YOUR-USERNAME/AI4Good-MTL-Group-2.git

Broken branch?

git checkout main
git branch -D broken-branch

Print this or save it as a text file! Need a visual workflow diagram? Let me know.

🗺️ Project Roadmap

Phase	Key Deliverables
Data Analysis	EDA reports, preprocessing pipelines
Modeling	Multimodal fusion architectures
Evaluation	Cross-modal attention visualizations
Deployment	Flask API for model serving

👥 Team

Maria Lagakos - Feature Extraction
Sophie Strassmann - Creative Director, Pipeline Architecture, and Classification Team
Yujie Chen - Classification Team
Keyu Liang - Feature Extraction and Data Migration
Maria Gallamoso - Feature Extraction Team
Catherina Medeiros Director of Imaging, Feature Extraction Team

📜 License

This project is licensed under the MIT License - see LICENSE.md for details.

🙏 Acknowledgments

We gratefully acknowledge:

Jennifer Addison and Yosra Kazemi for their expertise and leadership
The AI4Good Lab Montreal and Mila team for their support
Our TA Hugo Berard and Laetitia Constantin

Consulting Scholars and Mentors:

Rose Landry - Mila
Adel Halawa - McGill University
Dr. Lune Bellec - Université de Montréal
Dr. Mayada Elsabbagh - Transforming Autism Care Consortium
The Algonauts Project
Compute Canada for their computational resources
The Digital Research Alliance of Canada

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Project Cere

Multimodal AI for Social Good

🔍 Project Overview

✨ Features

📋 Table of Contents

🛠️ Installation

Prerequisites

Setup

Creating A Fork

📂 Repository Structure

🚀 Usage

Running the Pipeline

Key Arguments

Jupyter Notebooks

Maintaining Forks

1. Start a New Feature

2. Commit & Push to Your Fork

3. Sync with Upstream

4. Update Your Feature Branch

5. Create a Pull Request (PR)

Key Rules for the Team

Troubleshooting

🗺️ Project Roadmap

👥 Team

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
configs		configs
data		data
experiments		experiments
models		models
scripts		scripts
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
ignored.txt		ignored.txt
job.sh		job.sh
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

sophiestrazie/Project-Telepath

Folders and files

Latest commit

History

Repository files navigation

🧠 Project Cere

Multimodal AI for Social Good

🔍 Project Overview

✨ Features

📋 Table of Contents

🛠️ Installation

Prerequisites

Setup

Creating A Fork

📂 Repository Structure

🚀 Usage

Running the Pipeline

Key Arguments

Jupyter Notebooks

Maintaining Forks

1. Start a New Feature

2. Commit & Push to Your Fork

3. Sync with Upstream

4. Update Your Feature Branch

5. Create a Pull Request (PR)

Key Rules for the Team

Troubleshooting

🗺️ Project Roadmap

👥 Team

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages