π₯π₯ This is a collection of awesome articles about Foundation Models in Pathology Image Analysisπ₯π₯
Foundation models have gained popularity in recent years for a broad range of pathological imaging applications.
With the aim of providing easier access for researchers, this repo contains a comprehensive paper list of Foundation models in Pathology Image Analysis, including papers, codes, and related websites.
We considered a sum of 102 research papers spanning from 2022 to 2026.
(Each section is ordered by the publication dates)
-
π Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
-
π Transformer-based unsupervised contrastive learning for histopathological image classification
-
π Benchmarking Self-Supervised Learning on Diverse Pathology Datasets
-
π Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling
- π Preprint: MedRxiv, 2023
- π PDF
-
π A foundation model for clinical-grade computational pathology and rare cancers detection
-
π Rotation-Agnostic Image Representation Learning for Digital Pathology
-
π RudolfV: A Foundation Model by Pathologists for Pathologists
- π Preprint: arXiv, 2024
- π PDF
-
π Towards a general-purpose foundation model for computational pathology
-
π Computational Pathology at Health System Scale- Self-Supervised Foundation Models from Billions of Images
- π AAAI 2024 Spring Symposium
- π PDF
-
π A whole-slide foundation model for digital pathology from real-world data
-
π PLUTO: Pathology-Universal Transformer
- π ICML 2024 FM-Wild Workshop
- π PDF
-
π A generalizable pathology foundation model using a unified knowledge distillation pretraining framework
-
π PathoDuet: Foundation models for pathological slide analysis of H&E and IHC stains
-
π Multistain Pretraining for Slide Representation Learning in Pathology
-
π VIRCHOW 2: SCALING SELF-SUPERVISED MIXED MAGNIFICATION MODELS IN PATHOLOGY
-
π Rotation-agnostic image representation learning for digital pathology
-
π Tissue Concepts: supervised foundation models in computational pathology
-
π A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images
-
π Foundation models for fast, label-free detection of glioma infiltration
-
π SegAnyPath: A Foundation Model for Multi-resolution Stain-variant and Multi-task Pathology Image Segmentation
-
π Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
-
π A visual-language foundation model for pathology image analysis using medical Twitter
-
π Quilt-1M: One Million Image-Text Pairs for Histopathology
-
π A visual-language foundation model for computational pathology
-
π Knowledge-Enhanced Visual-Language Pretraining for Computational Pathology
-
π PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
-
π Transcriptomics-guided Slide Representation Learning in Computational Pathology
-
π CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
-
π A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model
-
π PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
-
π Benchmarking PathCLIP for Pathology Image Analysis
-
π A pathology foundation model for cancer diagnosis and prognosis prediction
-
π CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
-
π A visionβlanguage foundation model for precision oncology
-
π A visualβomics foundation model to bridge histopathology with spatial transcriptomics
-
π PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology
-
π A multimodal generative AI copilot for human pathology
- π Journal: Nature, 2024
- π PDF
-
π PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
-
π Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos
-
π SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
-
π CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
-
π WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image
-
π Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
-
π Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning
- π Generating dermatopathology reports from gigapixel whole slide images with HistoGPT
-
π Text-Guided Foundation Model Adaptation for Pathological Image Classification
-
π Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning
-
π CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts
- π Conference: ICCVW, 2023
- π PDF
-
π Prompt-MIL: Boosting Multi-instance Learning Schemes via Task-Specific Prompt Tuning
-
π The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification
-
π Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
-
π Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
-
π Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning
-
π PathoTune: Adapting Visual Foundation Model to Pathological Specialists
-
π VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification
-
π Prompting Vision Foundation Models for Pathology Image Analysis
-
π ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
-
π Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification
- π Conference: ECCV, 2024
- π PDF
-
π Prompting Whole Slide Image Based Genetic Biomarker Prediction
-
π MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning
-
π AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder
- π Conference: BMVC, 2023
- π PDF
-
π CellViT: Vision Transformers for precise cell segmentation and classification
-
π All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning
- π Journal of Physics: Conference Series
- π PDF
-
π SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology
- π MICCAI 2023 Workshops
- π PDF
-
π TPRO: Text-Prompting-Based Weakly Supervised Histopathology Tissue Segmentation
-
π SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation
-
π Evaluation and Improvement of Segment Anything Model for Interactive Histopathology Image Segmentation
-
π Unleashing the Power of Prompt-driven Nucleus Instance Segmentation
-
π Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation
- π Preprint: arXiv, 2024
- π PDF
-
π WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images
-
π GlandSAM: Injecting Morphology Knowledge Into Segment Anything Model for Label-Free Gland Segmentation
-
π Improving Mitosis Detection on Histopathology Images Using Large Vision-Language Models
- π ISBI, 2024
- π PDF
-
π Zero-Shot Nuclei Detection via Visual-Language Pre-trained Models
-
π SAMMS: Multi-modality Deep Learning with the Foundation Model for the Prediction of Cancer Patient Survival
- π BIBM, 2024
- π PDF
-
π SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification
-
π Automatic Report Generation for Histopathology Images Using Pre-Trained Vision Transformers and BERT
- π ISBI, 2024
- π PDF
-
π Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology
-
π Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
-
π ToPoFM: Topology-Guided Pathology Foundation Model for High-Resolution Pathology Image Synthesis with Cellular-Level Control
- π IEEE Transactions on Medical Imaging, 2025
- π PDF