Skip to content

dmhdmhdmh/Awesome-Foundation-Models-for-Pathology-Image-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

42 Commits
Β 
Β 

Repository files navigation

Awesome-Foundation-Models-for-Pathology-Image-Analysis

Awesome License: MIT PRs Welcome

πŸ”₯πŸ”₯ This is a collection of awesome articles about Foundation Models in Pathology Image AnalysisπŸ”₯πŸ”₯

Introduction

Foundation models have gained popularity in recent years for a broad range of pathological imaging applications.

With the aim of providing easier access for researchers, this repo contains a comprehensive paper list of Foundation models in Pathology Image Analysis, including papers, codes, and related websites.
We considered a sum of 102 research papers spanning from 2022 to 2026.


papers

(Each section is ordered by the publication dates)

Large-scale Pre-trained Models


Vision Foundation Models


Visual Representation Learning Models

  1. πŸ“œ Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning

    • πŸ“– Conference: CVPR, 2022
    • πŸ“„ PDF
    • πŸ’» Code
  2. πŸ“œ Transformer-based unsupervised contrastive learning for histopathological image classification

    • πŸ“– Journal: Medical Image Analysis, 2022
    • πŸ“„ PDF
    • πŸ’» Code
  3. πŸ“œ Benchmarking Self-Supervised Learning on Diverse Pathology Datasets

    • πŸ“– Conference: CVPR, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  4. πŸ“œ Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling

    • πŸ“– Preprint: MedRxiv, 2023
    • πŸ“„ PDF
  5. πŸ“œ A foundation model for clinical-grade computational pathology and rare cancers detection

    • πŸ“– Journal: Nature Medicine, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  6. πŸ“œ Rotation-Agnostic Image Representation Learning for Digital Pathology

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ RudolfV: A Foundation Model by Pathologists for Pathologists

    • πŸ“– Preprint: arXiv, 2024
    • πŸ“„ PDF
  8. πŸ“œ Towards a general-purpose foundation model for computational pathology

    • πŸ“– Journal: Nature Medicine, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  9. πŸ“œ Computational Pathology at Health System Scale- Self-Supervised Foundation Models from Billions of Images

    • πŸ“– AAAI 2024 Spring Symposium
    • πŸ“„ PDF
  10. πŸ“œ A whole-slide foundation model for digital pathology from real-world data

    • πŸ“– Nature 2024
    • πŸ“„ PDF
    • πŸ’» Code
  11. πŸ“œ PLUTO: Pathology-Universal Transformer

    • πŸ“– ICML 2024 FM-Wild Workshop
    • πŸ“„ PDF
  12. πŸ“œ A generalizable pathology foundation model using a unified knowledge distillation pretraining framework

    • πŸ“– Journal: Nature BME 2025
    • πŸ“„ PDF
    • πŸ’» Code
  13. πŸ“œ PathoDuet: Foundation models for pathological slide analysis of H&E and IHC stains

    • πŸ“– Journal: Medical Image Analysis, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  14. πŸ“œ Multistain Pretraining for Slide Representation Learning in Pathology

    • πŸ“– ECCV, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  15. πŸ“œ VIRCHOW 2: SCALING SELF-SUPERVISED MIXED MAGNIFICATION MODELS IN PATHOLOGY

    • πŸ“– Preprint: arXiv, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  16. πŸ“œ Rotation-agnostic image representation learning for digital pathology

    • πŸ“– CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  17. πŸ“œ Tissue Concepts: supervised foundation models in computational pathology

    • πŸ“– Journal: Computers in Biology and Medicine
    • πŸ“„ PDF
    • πŸ’» Code
  18. πŸ“œ A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images

    • πŸ“– Journal: Nature Communications
    • πŸ“„ PDF
    • πŸ’» Code

Task-specific Pre-trained Vision Models

  1. πŸ“œ Foundation models for fast, label-free detection of glioma infiltration

    • πŸ“– Journal: Nature, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  2. πŸ“œ SegAnyPath: A Foundation Model for Multi-resolution Stain-variant and Multi-task Pathology Image Segmentation

    • πŸ“– Journal: IEEE Transactions on Medical Imaging
    • πŸ“„ PDF
    • πŸ’» Code

Return to List


Multi-modal Foundation Models


Multi-modal Representation Learning Models

  1. πŸ“œ Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images

    • πŸ“– Conference: CVPR, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  2. πŸ“œ A visual-language foundation model for pathology image analysis using medical Twitter

    • πŸ“– Journal: Nature Medicine, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  3. πŸ“œ Quilt-1M: One Million Image-Text Pairs for Histopathology

    • πŸ“– Conference: NeurIPS 2023
    • πŸ“„ PDF
    • πŸ’» Code
  4. πŸ“œ A visual-language foundation model for computational pathology

    • πŸ“– Journal: Nature Medicine, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  5. πŸ“œ Knowledge-Enhanced Visual-Language Pretraining for Computational Pathology

    • πŸ“– Conference: ECCV, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  6. πŸ“œ PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology

    • πŸ“– Preprint: arXiv, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ Transcriptomics-guided Slide Representation Learning in Computational Pathology

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  8. πŸ“œ CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  9. πŸ“œ A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model

    • πŸ“– Preprint: arXiv, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  10. πŸ“œ PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration

    • πŸ“– Conference: ICLR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  11. πŸ“œ Benchmarking PathCLIP for Pathology Image Analysis

    • πŸ“– Journal: Journal of Imaging Informatics in Medicine, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  12. πŸ“œ A pathology foundation model for cancer diagnosis and prognosis prediction

    • πŸ“– Journal: Nature, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  13. πŸ“œ CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology

    • πŸ“– Conference: CVPR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  14. πŸ“œ A vision–language foundation model for precision oncology

    • πŸ“– Journal: Nature, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  15. πŸ“œ A visual–omics foundation model to bridge histopathology with spatial transcriptomics

    • πŸ“– Journal: Nature Methods, 2025
    • πŸ“„ PDF
    • πŸ’» Code

Return to List


Multi-modal Large Language Models

  1. πŸ“œ PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology

    • πŸ“– Conference: AAAI, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  2. πŸ“œ A multimodal generative AI copilot for human pathology

    • πŸ“– Journal: Nature, 2024
    • πŸ“„ PDF
  3. πŸ“œ PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration

    • πŸ“– Conference: ICLR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  4. πŸ“œ Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  5. πŸ“œ SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding

    • πŸ“– Conference: CVPR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  6. πŸ“œ CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology

    • πŸ“– Conference: CVPR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

    • πŸ“– Conference: ICCV, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  8. πŸ“œ Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner

    • πŸ“– Conference: AAAI, 2026
    • πŸ“„ PDF
    • πŸ’» Code
  9. πŸ“œ Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning

    • πŸ“– Conference: AAAI, 2026
    • πŸ“„ PDF
    • πŸ’» Code

Task-specific Pre-trained Multi-modal Models

  1. πŸ“œ Generating dermatopathology reports from gigapixel whole slide images with HistoGPT
    • πŸ“– Journal: Nature Communications, 2025
    • πŸ“„ PDF
    • πŸ’» Code

Return to List

Adaptation of Foundation Models

Pathological Classification:

  1. πŸ“œ Text-Guided Foundation Model Adaptation for Pathological Image Classification

    • πŸ“– Conference: MICCAI, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  2. πŸ“œ Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning

    • πŸ“– Conference: MICCAI, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  3. πŸ“œ CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts

    • πŸ“– Conference: ICCVW, 2023
    • πŸ“„ PDF
  4. πŸ“œ Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning

    • πŸ“– Conference: MICCAI, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  5. πŸ“œ The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification

    • πŸ“– Conference: NeurIPS, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  6. πŸ“œ Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  8. πŸ“œ Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning

    • πŸ“– Conference: MICCAI, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  9. πŸ“œ PathoTune: Adapting Visual Foundation Model to Pathological Specialists

    • πŸ“– Conference: MICCAI, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  10. πŸ“œ VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification

    • πŸ“– Journal: IEEE Transactions on Medical Imaging, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  11. πŸ“œ Prompting Vision Foundation Models for Pathology Image Analysis

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  12. πŸ“œ ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

    • πŸ“– Conference: CVPR, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  13. πŸ“œ Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification

    • πŸ“– Conference: ECCV, 2024
    • πŸ“„ PDF
  14. πŸ“œ Prompting Whole Slide Image Based Genetic Biomarker Prediction

    • πŸ“– Conference: MICCAI, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  15. πŸ“œ MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning

    • πŸ“– Conference: IEEE Transactions on Medical Imaging, 2025
    • πŸ“„ PDF
    • πŸ’» Code

Pathological Component Segmentation:

  1. πŸ“œ AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder

    • πŸ“– Conference: BMVC, 2023
    • πŸ“„ PDF
  2. πŸ“œ CellViT: Vision Transformers for precise cell segmentation and classification

    • πŸ“– Journal: Medical Image Analysis, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  3. πŸ“œ All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning

    • πŸ“– Journal of Physics: Conference Series
    • πŸ“„ PDF
  4. πŸ“œ SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology

    • πŸ“– MICCAI 2023 Workshops
    • πŸ“„ PDF
  5. πŸ“œ TPRO: Text-Prompting-Based Weakly Supervised Histopathology Tissue Segmentation

    • πŸ“– MICCAI 2023
    • πŸ“„ PDF
    • πŸ’» Code
  6. πŸ“œ SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation

    • πŸ“– MLMI 2023
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ Evaluation and Improvement of Segment Anything Model for Interactive Histopathology Image Segmentation

    • πŸ“– MICCAI 2023 Workshops
    • πŸ“„ PDF
    • πŸ’» Code
  8. πŸ“œ Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

    • πŸ“– ECCV 2024
    • πŸ“„ PDF
    • πŸ’» Code
  9. πŸ“œ Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation

    • πŸ“– Preprint: arXiv, 2024
    • πŸ“„ PDF
  10. πŸ“œ WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images

    • πŸ“– MICCAI 2024 Workshop
    • πŸ“„ PDF
    • πŸ’» Code
  11. πŸ“œ GlandSAM: Injecting Morphology Knowledge Into Segment Anything Model for Label-Free Gland Segmentation

    • πŸ“– Journal: IEEE Transactions on Medical Imaging, 2025
    • πŸ“„ PDF
    • πŸ’» Code

Other Applications:

  1. πŸ“œ Improving Mitosis Detection on Histopathology Images Using Large Vision-Language Models

    • πŸ“– ISBI, 2024
    • πŸ“„ PDF
  2. πŸ“œ Zero-Shot Nuclei Detection via Visual-Language Pre-trained Models

    • πŸ“– MICCAI, 2023
    • πŸ“„ PDF
    • πŸ’» Code
  3. πŸ“œ SAMMS: Multi-modality Deep Learning with the Foundation Model for the Prediction of Cancer Patient Survival

    • πŸ“– BIBM, 2024
    • πŸ“„ PDF
  4. πŸ“œ SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification

    • πŸ“– ACM International Conference on Multimedia, 2024
    • πŸ“„ PDF
    • πŸ’» Code
  5. πŸ“œ Automatic Report Generation for Histopathology Images Using Pre-Trained Vision Transformers and BERT

    • πŸ“– ISBI, 2024
    • πŸ“„ PDF
  6. πŸ“œ Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology

    • πŸ“– ICLR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  7. πŸ“œ Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

    • πŸ“– CVPR, 2025
    • πŸ“„ PDF
    • πŸ’» Code
  8. πŸ“œ ToPoFM: Topology-Guided Pathology Foundation Model for High-Resolution Pathology Image Synthesis with Cellular-Level Control

    • πŸ“– IEEE Transactions on Medical Imaging, 2025
    • πŸ“„ PDF

About

A Comprehensive Survey of Foundation Models in Pathology Image Analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors