Episode Details
Back to Episodes241: Foundation Models in Pathology: Strong on Paper, Ready for Labs?
Description
Are pathology foundation models actually ready for labs, or are they still stronger on paper than in practice?
In this episode of DigiPath Digest #49, I unpack a timely review on pathology foundation models and ask the question that matters most to me: not just what these models can do, but what has to be true before they are genuinely useful in real pathology workflows.
I walk through how pathology AI moved from narrow, task-specific models into the era of transformer-based foundation models. That shift matters because pathology is no longer only about looking at H&E in isolation. Today, pathologists are expected to integrate morphology, immunohistochemistry, molecular assays, genomics, and clinical context. That growing complexity is one reason foundation models are getting so much attention.
In this discussion, I explain how transformers entered pathology, why image patches are treated like tokens, and how shared embeddings can support classification, regression, segmentation, and multimodal retrieval. I also go through the major pathology foundation models mentioned in the paper, including Virchow/Virchow2, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, GigaPath, and TITAN, and why scale alone is not the full story.
A big part of this episode is about the gap between benchmark performance and clinical readiness. I talk about the persistent limitations in training data diversity, the overuse of TCGA, and why public benchmarks can still miss what real pathology practice looks like. I also cover where foundation models still struggle, especially in cytopathology, hematopathology, and underrepresented disease areas, along with the real-world problems of artifacts, domain shift, concept drift, infrastructure burden, regulatory complexity, and workflow disruption.
For me, one of the most important themes is this: AI in pathology should augment, not replace, pathologists. The future is not about handing diagnosis to a model. It is about building tools that support pathologists better, fit real workflows, and can be validated in ways that deserve trust.
I also spend time on what comes next: explainable AI, counterfactual explanations, conversational interfaces, retrieval-augmented systems, multimodal fusion, and the need for deployment-centric validation rather than paper-only excitement.
If you are trying to understand where pathology foundation models really stand today, this episode will help you separate the promise from the practical barriers.
Episode Highlights
00:01 – Why I chose this paper, what is changing at Digital Pathology Place, and why foundation models are worth paying attention to now.
02:15 – The core questions: what pathology foundation models are, where they are, and how difficult they are to apply in pathology.
04:50 – Why pathology is becoming more cognitively demanding, and how multimodal complexity is driving interest in scalable AI.
07:02 – From narrow AI to transformers: how pathology moved beyond single-task CNN models.
10:16 – How transformers work in pathology: image patches as tokens, self-attention, embeddings, and downstream tasks.
14:16 – Why multimodality matters, and what kinds of data foundation models may eventually integrate.
15:27 – Timeline of key model developments, from “Attention Is All You Need” to gigapixel-scale pathology foundation models.
17:13 – The leading models and what scale really looks like: Virchow, Mayo Clinic Atlas, UNI, CONCH, H-Optimus, and GigaPath.
19:51 – Why dataset diversity matters more than sheer volume, and why TCGA is not enough.
23:17 – Where foundation models still struggle: cytopathology, hematopathology, rare disease, artifacts,