Overall survival prediction of non-small cell lung cancer by integrating microarray and clinical data with deep learning.

Journal: Scientific reports
Published Date:

Abstract

Non-small cell lung cancer (NSCLC) is one of the most common lung cancers worldwide. Accurate prognostic stratification of NSCLC can become an important clinical reference when designing therapeutic strategies for cancer patients. With this clinical application in mind, we developed a deep neural network (DNN) combining heterogeneous data sources of gene expression and clinical data to accurately predict the overall survival of NSCLC patients. Based on microarray data from a cohort set (614 patients), seven well-known NSCLC biomarkers were used to group patients into biomarker- and biomarker+ subgroups. Then, by using a systems biology approach, prognosis relevance values (PRV) were then calculated to select eight additional novel prognostic gene biomarkers. Finally, the combined 15 biomarkers along with clinical data were then used to develop an integrative DNN via bimodal learning to predict the 5-year survival status of NSCLC patients with tremendously high accuracy (AUC: 0.8163, accuracy: 75.44%). Using the capability of deep learning, we believe that our prediction can be a promising index that helps oncologists and physicians develop personalized therapy and build the foundation of precision medicine in the future.

Authors

  • Yu-Heng Lai
    Department of Chemistry, Chinese Culture University, Taipei, 11114, Taiwan.
  • Wei-Ning Chen
    Department of Electrical Engineering, National Tsing Hua University, Hsinchu, 30013, Taiwan.
  • Te-Cheng Hsu
    Department of Electrical Engineering, National Tsing Hua University, Hsinchu, 30013, Taiwan.
  • Che Lin
    Department of Electrical Engineering and Graduate Institute of Communication Engineering, National Taiwan University, Taipei, 10617, Taiwan. che.lin@gmail.com.
  • Yu Tsao
  • Semon Wu
    Department of Life Science, Chinese Culture University, Taipei, 11114, Taiwan.