Comparative analysis of machine learning approaches to classify tumor mutation burden in lung adenocarcinoma using histopathology images.

Journal: Scientific reports
PMID:

Abstract

Both histologic subtypes and tumor mutation burden (TMB) represent important biomarkers in lung cancer, with implications for patient prognosis and treatment decisions. Typically, TMB is evaluated by comprehensive genomic profiling but this requires use of finite tissue specimens and costly, time-consuming laboratory processes. Histologic subtype classification represents an established component of lung adenocarcinoma histopathology, but can be challenging and is associated with substantial inter-pathologist variability. Here we developed a deep learning system to both classify histologic patterns in lung adenocarcinoma and predict TMB status using de-identified Hematoxylin and Eosin (H&E) stained whole slide images. We first trained a convolutional neural network to map histologic features across whole slide images of lung cancer resection specimens. On evaluation using an external data source, this model achieved patch-level area under the receiver operating characteristic curve (AUC) of 0.78-0.98 across nine histologic features. We then integrated the output of this model with clinico-demographic data to develop an interpretable model for TMB classification. The resulting end-to-end system was evaluated on 172 held out cases from TCGA, achieving an AUC of 0.71 (95% CI 0.63-0.80). The benefit of using histologic features in predicting TMB is highlighted by the significant improvement this approach offers over using the clinical features alone (AUC of 0.63 [95% CI 0.53-0.72], p = 0.002). Furthermore, we found that our histologic subtype-based approach achieved performance similar to that of a weakly supervised approach (AUC of 0.72 [95% CI 0.64-0.80]). Together these results underscore that incorporating histologic patterns in biomarker prediction for lung cancer provides informative signals, and that interpretable approaches utilizing these patterns perform comparably with less interpretable, weakly supervised approaches.

Authors

  • Apaar Sadhwani
    Google Health, Google LLC, Palo Alto, California, United States of America.
  • Huang-Wei Chang
    Verily Life Sciences, South San Francisco, CA, USA.
  • Ali Behrooz
    Verily Life Sciences, South San Francisco, CA, USA.
  • Trissia Brown
    Google Health via Advanced Clinical, Deerfield, IL USA.
  • Isabelle Auvigne-Flament
    Google Health via Vituity, Emeryville, CA, USA.
  • Hardik Patel
    Verily Life Sciences, South San Francisco, CA, USA.
  • Robert Findlater
    Verily Life Sciences, South San Francisco, CA, USA.
  • Vanessa Velez
    Verily Life Sciences, South San Francisco, CA, USA.
  • Fraser Tan
    Google Health, Palo Alto, CA USA.
  • Kamilla Tekiela
    Verily Life Sciences, South San Francisco, CA, USA.
  • Ellery Wulczyn
    Google Health, Palo Alto, CA USA.
  • Eunhee S Yi
    Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA.
  • Craig H Mermel
    Google Health, Palo Alto, CA USA.
  • Debra Hanks
    Verily Life Sciences, South San Francisco, CA, USA.
  • Po-Hsuan Cameron Chen
    Google Health, Palo Alto, CA USA.
  • Kimary Kulig
    Verily Life Sciences, South San Francisco, CA, USA.
  • Cory Batenchuk
    Verily Life Sciences, South San Francisco, CA, USA.
  • David F Steiner
    Google Health, Palo Alto, CA USA.
  • Peter Cimermančič
    Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, USA.