Interactive phenotyping of large-scale histology imaging data with HistomicsML.

Journal: Scientific reports
PMID:

Abstract

Whole-slide imaging of histologic sections captures tissue microenvironments and cytologic details in expansive high-resolution images. These images can be mined to extract quantitative features that describe tissues, yielding measurements for hundreds of millions of histologic objects. A central challenge in utilizing this data is enabling investigators to train and evaluate classification rules for identifying objects related to processes like angiogenesis or immune response. In this paper we describe HistomicsML, an interactive machine-learning system for digital pathology imaging datasets. This framework uses active learning to direct user feedback, making classifier training efficient and scalable in datasets containing 10+ histologic objects. We demonstrate how this system can be used to phenotype microvascular structures in gliomas to predict survival, and to explore the molecular pathways associated with these phenotypes. Our approach enables researchers to unlock phenotypic information from digital pathology datasets to investigate prognostic image biomarkers and genotype-phenotype associations.

Authors

  • Michael Nalisnik
    Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, USA.
  • Mohamed Amgad
    Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, 30322, USA.
  • Sanghoon Lee
    Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Blvd, Pittsburgh, PA 15206, USA.
  • Sameer H Halani
    Emory University School of Medicine, Atlanta, GA, 30322, USA.
  • Jose Enrique Velazquez Vega
    Department of Pathology and Laboratory Medicine, Emory University School of Medicine, Atlanta, GA, 30322, USA.
  • Daniel J Brat
    Department of Pathology and Laboratory Medicine, Emory University School of Medicine, Atlanta, GA, 30322, USA.
  • David A Gutman
    Department of Neurology, Emory University School of Medicine, Atlanta, GA, 30322, USA.
  • Lee A D Cooper
    Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, 30322, USA. lee.cooper@emory.edu.