Comparative analysis of high- and low-level deep learning approaches in microsatellite instability prediction.

Journal: Scientific reports
PMID:

Abstract

Deep learning-based approaches in histopathology can be largely divided into two categories: a high-level approach using an end-to-end model and a low-level approach using feature extractors. Although the advantages and disadvantages of both approaches are empirically well known, there exists no scientific basis for choosing a specific approach in research, and direct comparative analysis of the two approaches has rarely been performed. Using the Cancer Genomic Atlas (TCGA)-based dataset, we compared these two different approaches in microsatellite instability (MSI) prediction and analyzed morphological image features associated with MSI. Our high-level approach was based solely on EfficientNet, while our low-level approach relied on LightGBM and multiple deep learning models trained on publicly available multiclass tissue, nuclei, and gland datasets. We compared their performance and important image features. Our high-level approach showed superior performance compared to our low-level approach. In both approaches, debris, lymphocytes, and necrotic cells were revealed as important features of MSI, which is consistent with clinical knowledge. Then, during qualitative analysis, we discovered the weaknesses of our low-level approach and demonstrated that its performance can be improved by using different image features in a complementary way. We performed our study using open-access data, and we believe this study can serve as a useful basis for discovering imaging biomarkers for clinical application.

Authors

  • Jeonghyuk Park
    VUNO Inc., Seocho-gu, Seoul, South Korea.
  • Yul Ri Chung
    Department of Pathology, Green Cross Laboratories, Yongin, Gyeonggi, South Korea.
  • Akinao Nose
    Department of Physics, Graduate School of Science, The University of Tokyo, Tokyo, Japan.