MedSeg-TTA is a benchmark for test-time adaptation in medical image segmentation. This repository centers on the public leaderboard, the benchmark figures behind it, and the currently available local method implementations organized by paradigm.
π Click here to explore the full leaderboard in detail.
The benchmark unifies medical TTA evaluation around a shared surface that connects source-target dataset pairs, paradigm-level comparisons, and local method code roots.
MedSeg-TTA covers seven modalities and multiple cross-domain source-target pairs spanning MRI, CT, US, PATH, DER, OCT, and CXR.
MedSeg-TTA/
βββ medseg_tta/
βββ site/
βββ feature_level_alignment/
β βββ GraTa/
β βββ DANN/
β βββ UDA-MIMA/
β βββ Testfit/
βββ input_level_transformation/
β βββ SFDA-FSM/
β βββ DLTTA/
β βββ AIF-SFDA/
β βββ STDR/
β βββ RSA/
βββ output_level_regularization/
β βββ DG-TTA/
β βββ SaTTCA/
β βββ UPL-SFDA/
β βββ tent/
βββ prior_estimation/
β βββ ExploringTTA/
β βββ AdaMI/
β βββ PASS/
β βββ ProSFDA/
βββ ASSETS.md
Dataset provenance, third-party code sources, redistribution notes, and license details are documented in ASSETS.md.
If you find this project useful, please cite:
@article{anonymous2025medsegtta,
title = {A Large Scale Benchmark for Test Time Adaptation Methods in Medical Image Segmentation},
author = {Anonymous Authors},
journal = {Anonymous preprint},
year = {2025}
}This project is released under the MIT License. See LICENSE for details.

