Awesome-Foundation-Models-for-Pathology-Image-Analysis

🔥🔥 This is a collection of awesome articles about Foundation Models in Pathology Image Analysis🔥🔥

Introduction

Foundation models have gained popularity in recent years for a broad range of pathological imaging applications.

With the aim of providing easier access for researchers, this repo contains a comprehensive paper list of Foundation models in Pathology Image Analysis, including papers, codes, and related websites.
We considered a sum of 102 research papers spanning from 2022 to 2026.

papers

Large-scale Pre-trained Models
- Vision Foundation Models
  - Visual Representation Learning Models
  - Task-specific Pre-trained Vision Models
- Multi-modal Foundation Models
Adaptation of Foundation Models

(Each section is ordered by the publication dates)

Large-scale Pre-trained Models

Vision Foundation Models

Visual Representation Learning Models

📜 Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
- 📖 Conference: CVPR, 2022
- 📄 PDF
- 💻 Code
📜 Transformer-based unsupervised contrastive learning for histopathological image classification
- 📖 Journal: Medical Image Analysis, 2022
- 📄 PDF
- 💻 Code
📜 Benchmarking Self-Supervised Learning on Diverse Pathology Datasets
- 📖 Conference: CVPR, 2023
- 📄 PDF
- 💻 Code
📜 Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling
- 📖 Preprint: MedRxiv, 2023
- 📄 PDF
📜 A foundation model for clinical-grade computational pathology and rare cancers detection
- 📖 Journal: Nature Medicine, 2024
- 📄 PDF
- 💻 Code
📜 Rotation-Agnostic Image Representation Learning for Digital Pathology
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 RudolfV: A Foundation Model by Pathologists for Pathologists
- 📖 Preprint: arXiv, 2024
- 📄 PDF
📜 Towards a general-purpose foundation model for computational pathology
- 📖 Journal: Nature Medicine, 2024
- 📄 PDF
- 💻 Code
📜 Computational Pathology at Health System Scale- Self-Supervised Foundation Models from Billions of Images
- 📖 AAAI 2024 Spring Symposium
- 📄 PDF
📜 A whole-slide foundation model for digital pathology from real-world data
- 📖 Nature 2024
- 📄 PDF
- 💻 Code
📜 PLUTO: Pathology-Universal Transformer
- 📖 ICML 2024 FM-Wild Workshop
- 📄 PDF
📜 A generalizable pathology foundation model using a unified knowledge distillation pretraining framework
- 📖 Journal: Nature BME 2025
- 📄 PDF
- 💻 Code
📜 PathoDuet: Foundation models for pathological slide analysis of H&E and IHC stains
- 📖 Journal: Medical Image Analysis, 2024
- 📄 PDF
- 💻 Code
📜 Multistain Pretraining for Slide Representation Learning in Pathology
- 📖 ECCV, 2024
- 📄 PDF
- 💻 Code
📜 VIRCHOW 2: SCALING SELF-SUPERVISED MIXED MAGNIFICATION MODELS IN PATHOLOGY
- 📖 Preprint: arXiv, 2024
- 📄 PDF
- 💻 Code
📜 Rotation-agnostic image representation learning for digital pathology
- 📖 CVPR, 2024
- 📄 PDF
- 💻 Code
📜 Tissue Concepts: supervised foundation models in computational pathology
- 📖 Journal: Computers in Biology and Medicine
- 📄 PDF
- 💻 Code
📜 A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images
- 📖 Journal: Nature Communications
- 📄 PDF
- 💻 Code

Task-specific Pre-trained Vision Models

📜 Foundation models for fast, label-free detection of glioma infiltration
- 📖 Journal: Nature, 2025
- 📄 PDF
- 💻 Code
📜 SegAnyPath: A Foundation Model for Multi-resolution Stain-variant and Multi-task Pathology Image Segmentation
- 📖 Journal: IEEE Transactions on Medical Imaging
- 📄 PDF
- 💻 Code

_{Return to List}

Multi-modal Foundation Models

Multi-modal Representation Learning Models

📜 Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
- 📖 Conference: CVPR, 2023
- 📄 PDF
- 💻 Code
📜 A visual-language foundation model for pathology image analysis using medical Twitter
- 📖 Journal: Nature Medicine, 2023
- 📄 PDF
- 💻 Code
📜 Quilt-1M: One Million Image-Text Pairs for Histopathology
- 📖 Conference: NeurIPS 2023
- 📄 PDF
- 💻 Code
📜 A visual-language foundation model for computational pathology
- 📖 Journal: Nature Medicine, 2024
- 📄 PDF
- 💻 Code
📜 Knowledge-Enhanced Visual-Language Pretraining for Computational Pathology
- 📖 Conference: ECCV, 2024
- 📄 PDF
- 💻 Code
📜 PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
- 📖 Preprint: arXiv, 2024
- 📄 PDF
- 💻 Code
📜 Transcriptomics-guided Slide Representation Learning in Computational Pathology
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model
- 📖 Preprint: arXiv, 2024
- 📄 PDF
- 💻 Code
📜 PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
- 📖 Conference: ICLR, 2025
- 📄 PDF
- 💻 Code
📜 Benchmarking PathCLIP for Pathology Image Analysis
- 📖 Journal: Journal of Imaging Informatics in Medicine, 2025
- 📄 PDF
- 💻 Code
📜 A pathology foundation model for cancer diagnosis and prognosis prediction
- 📖 Journal: Nature, 2024
- 📄 PDF
- 💻 Code
📜 CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
- 📖 Conference: CVPR, 2025
- 📄 PDF
- 💻 Code
📜 A vision–language foundation model for precision oncology
- 📖 Journal: Nature, 2025
- 📄 PDF
- 💻 Code
📜 A visual–omics foundation model to bridge histopathology with spatial transcriptomics
- 📖 Journal: Nature Methods, 2025
- 📄 PDF
- 💻 Code

_{Return to List}

Multi-modal Large Language Models

📜 PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology
- 📖 Conference: AAAI, 2024
- 📄 PDF
- 💻 Code
📜 A multimodal generative AI copilot for human pathology
- 📖 Journal: Nature, 2024
- 📄 PDF
📜 PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
- 📖 Conference: ICLR, 2025
- 📄 PDF
- 💻 Code
📜 Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
- 📖 Conference: CVPR, 2025
- 📄 PDF
- 💻 Code
📜 CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
- 📖 Conference: CVPR, 2025
- 📄 PDF
- 💻 Code
📜 WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image
- 📖 Conference: ICCV, 2025
- 📄 PDF
- 💻 Code
📜 Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
- 📖 Conference: AAAI, 2026
- 📄 PDF
- 💻 Code
📜 Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning
- 📖 Conference: AAAI, 2026
- 📄 PDF
- 💻 Code

Task-specific Pre-trained Multi-modal Models

📜 Generating dermatopathology reports from gigapixel whole slide images with HistoGPT
- 📖 Journal: Nature Communications, 2025
- 📄 PDF
- 💻 Code

_{Return to List}

Adaptation of Foundation Models

Pathological Classification:

📜 Text-Guided Foundation Model Adaptation for Pathological Image Classification
- 📖 Conference: MICCAI, 2023
- 📄 PDF
- 💻 Code
📜 Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning
- 📖 Conference: MICCAI, 2023
- 📄 PDF
- 💻 Code
📜 CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts
- 📖 Conference: ICCVW, 2023
- 📄 PDF
📜 Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning
- 📖 Conference: MICCAI, 2023
- 📄 PDF
- 💻 Code
📜 The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification
- 📖 Conference: NeurIPS, 2024
- 📄 PDF
- 💻 Code
📜 Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning
- 📖 Conference: MICCAI, 2024
- 📄 PDF
- 💻 Code
📜 PathoTune: Adapting Visual Foundation Model to Pathological Specialists
- 📖 Conference: MICCAI, 2024
- 📄 PDF
- 💻 Code
📜 VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification
- 📖 Journal: IEEE Transactions on Medical Imaging, 2025
- 📄 PDF
- 💻 Code
📜 Prompting Vision Foundation Models for Pathology Image Analysis
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
- 📖 Conference: CVPR, 2024
- 📄 PDF
- 💻 Code
📜 Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification
- 📖 Conference: ECCV, 2024
- 📄 PDF
📜 Prompting Whole Slide Image Based Genetic Biomarker Prediction
- 📖 Conference: MICCAI, 2024
- 📄 PDF
- 💻 Code
📜 MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning
- 📖 Conference: IEEE Transactions on Medical Imaging, 2025
- 📄 PDF
- 💻 Code

Pathological Component Segmentation:

📜 AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder
- 📖 Conference: BMVC, 2023
- 📄 PDF
📜 CellViT: Vision Transformers for precise cell segmentation and classification
- 📖 Journal: Medical Image Analysis, 2024
- 📄 PDF
- 💻 Code
📜 All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning
- 📖 Journal of Physics: Conference Series
- 📄 PDF
📜 SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology
- 📖 MICCAI 2023 Workshops
- 📄 PDF
📜 TPRO: Text-Prompting-Based Weakly Supervised Histopathology Tissue Segmentation
- 📖 MICCAI 2023
- 📄 PDF
- 💻 Code
📜 SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation
- 📖 MLMI 2023
- 📄 PDF
- 💻 Code
📜 Evaluation and Improvement of Segment Anything Model for Interactive Histopathology Image Segmentation
- 📖 MICCAI 2023 Workshops
- 📄 PDF
- 💻 Code
📜 Unleashing the Power of Prompt-driven Nucleus Instance Segmentation
- 📖 ECCV 2024
- 📄 PDF
- 💻 Code
📜 Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation
- 📖 Preprint: arXiv, 2024
- 📄 PDF
📜 WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images
- 📖 MICCAI 2024 Workshop
- 📄 PDF
- 💻 Code
📜 GlandSAM: Injecting Morphology Knowledge Into Segment Anything Model for Label-Free Gland Segmentation
- 📖 Journal: IEEE Transactions on Medical Imaging, 2025
- 📄 PDF
- 💻 Code

Other Applications:

📜 Improving Mitosis Detection on Histopathology Images Using Large Vision-Language Models
- 📖 ISBI, 2024
- 📄 PDF
📜 Zero-Shot Nuclei Detection via Visual-Language Pre-trained Models
- 📖 MICCAI, 2023
- 📄 PDF
- 💻 Code
📜 SAMMS: Multi-modality Deep Learning with the Foundation Model for the Prediction of Cancer Patient Survival
- 📖 BIBM, 2024
- 📄 PDF
📜 SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification
- 📖 ACM International Conference on Multimedia, 2024
- 📄 PDF
- 💻 Code
📜 Automatic Report Generation for Histopathology Images Using Pre-Trained Vision Transformers and BERT
- 📖 ISBI, 2024
- 📄 PDF
📜 Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology
- 📖 ICLR, 2025
- 📄 PDF
- 💻 Code
📜 Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
- 📖 CVPR, 2025
- 📄 PDF
- 💻 Code
📜 ToPoFM: Topology-Guided Pathology Foundation Model for High-Resolution Pathology Image Synthesis with Cellular-Level Control
- 📖 IEEE Transactions on Medical Imaging, 2025
- 📄 PDF

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome-Foundation-Models-for-Pathology-Image-Analysis

Introduction

papers

Large-scale Pre-trained Models

Vision Foundation Models

Visual Representation Learning Models

Task-specific Pre-trained Vision Models

Multi-modal Foundation Models

Multi-modal Representation Learning Models

Multi-modal Large Language Models

Task-specific Pre-trained Multi-modal Models

Adaptation of Foundation Models

Pathological Classification:

Pathological Component Segmentation:

Other Applications:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awesome-Foundation-Models-for-Pathology-Image-Analysis

Introduction

papers

Large-scale Pre-trained Models

Vision Foundation Models

Visual Representation Learning Models

Task-specific Pre-trained Vision Models

Multi-modal Foundation Models

Multi-modal Representation Learning Models

Multi-modal Large Language Models

Task-specific Pre-trained Multi-modal Models

Adaptation of Foundation Models

Pathological Classification:

Pathological Component Segmentation:

Other Applications:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages