ColliderFM is a self-supervised learning project on ColliderML Release 1. The current runtime path is calo-only: build point-cloud views from calo_hits, train a Panda-inspired student-teacher model, save checkpoints and metrics, and generate diagnostics and run-level plots.
The project uses a shared OmegaConf config in config/default.yaml. Runtime scripts keep the merged config as a DictConfig and only convert it to plain containers at JSON or logging boundaries.
src/collider_fm/data.pyloads ColliderML and applies the project split conventionssrc/collider_fm/views.pybuilds the current point-view and masking pipelinesrc/collider_fm/model.pycontains the current point-level student-teacher training scaffoldscripts/train.pyruns training with checkpointing, JSONL metrics, optional Comet logging, and mixed precisionscripts/plot_diagnostics.pyandscripts/plot_training_run.pycover diagnostics and completed-run plottingnotebooks/dataset_walkthrough.ipynb,notebooks/sonata_views.ipynb, andnotebooks/model_walkthrough.ipynbexplain the data and training path
/
|- src/collider_fm/ # Package code for data, views, model, diagnostics, and vendored PTv3 pieces
|- tests/ # Unit tests for the current pipeline
|- scripts/ # Download, inspection, training, smoke-test, and plotting scripts
|- notebooks/ # Newcomer walkthrough notebooks
|- slurm/ # Cluster job scripts for setup, downloads, smoke tests, and training
|- apptainer/ # Container helper scripts
|- Panda_repo/ # Panda reference submodule
|- markdown/ # Project markdown docs except README.md and AGENTS.md
|- README.md # User-facing overview
|- AGENTS.md # Short operational guidance for coding agents
|- pyproject.toml # Project metadata and dependencies
`- uv.lock # Locked dependency resolution for uv
The project targets Python 3.12 and uses uv for dependency management.
uv venv --python 3.12
source .venv/bin/activate
uv sync --devFor cluster-specific setup and SLURM usage, see markdown/HPC.md.
markdown/WORKFLOWS.md: runtime details, config overrides, caching, local commands, outputs, and loggingmarkdown/HPC.md: cluster setup and checked-in SLURM jobsmarkdown/PLAN.md: current roadmap and next prioritiesAGENTS.md: short operational instructions for coding agents
- Panda paper: https://arxiv.org/abs/2512.01324
- ColliderML dataset: https://huggingface.co/datasets/ColliderML/ColliderML
- ColliderML website: https://colliderml.github.io/