A clinical benchmark of public self-supervised pathology foundation models.

Journal: Nature communications
PMID:

Abstract

The use of self-supervised learning to train pathology foundation models has increased substantially in the past few years. Notably, several models trained on large quantities of clinical data have been made publicly available in recent months. This will significantly enhance scientific research in computational pathology and help bridge the gap between research and clinical deployment. With the increase in availability of public foundation models of different sizes, trained using different algorithms on different datasets, it becomes important to establish a benchmark to compare the performance of such models on a variety of clinically relevant tasks spanning multiple organs and diseases. In this work, we present a collection of pathology datasets comprising clinical slides associated with clinically relevant endpoints including cancer diagnoses and a variety of biomarkers generated during standard hospital operation from three medical centers. We leverage these datasets to systematically assess the performance of public pathology foundation models and provide insights into best practices for training foundation models and selecting appropriate pretrained models. To enable the community to evaluate their models on our clinical datasets, we make available an automated benchmarking pipeline for external use.

Authors

  • Gabriele Campanella
    Weill Cornell Medicine, New York, USA; Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, USA.
  • Shengjia Chen
    Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Manbir Singh
    The Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, United States.
  • Ruchika Verma
  • Silke Muehlstedt
    Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Jennifer Zeng
    Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Aryeh Stock
    Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Matt Croken
    Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Brandon Veremis
    Department of Pathology, Molecular and Cell-based Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
  • Abdulkadir Elmas
    Department of Electrical Engineering, Columbia University, New York, NY, United States of America.
  • Ivan Shujski
    Department of Clinical Pathology, Sahlgrenska University Hospital, Gothenburg, Sweden.
  • Noora Neittaanmaki
  • Kuan-Lin Huang
    Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Ricky Kwan
    Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Jane Houldsworth
    Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
  • Adam J Schoenfeld
    Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, 10065, NY, USA.
  • Chad Vanderbilt
    Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA.