Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machine learning methods involved a gene filtering step, in which tens of thousands of genes were firstly filtered based on the gene expression levels by a statistical method with an arbitrary cutoff. Although gene filtering procedure helps to reduce the feature dimension and avoid overfitting, there is a risk that some pathogenic genes important to the disease will be ignored.

Authors

  • Yiru Zhao
    College of Computer Science, Sichuan University, Chengdu, 610064, Sichuan, China.
  • Yifan Zhou
    Department of Pharmacology, University of Oxford, Oxford, United Kingdom.
  • Yuan Liu
    Department of General Surgery, Wuxi People's Hospital Affiliated to Nanjing Medical University, Wuxi, China.
  • Yinyi Hao
    College of Chemistry, Sichuan University, Chengdu, 610064, Sichuan, China.
  • Menglong Li
    College of Chemistry, Sichuan University, Chengdu 610064, PR China. Electronic address: liml@scu.edu.cn.
  • Xuemei Pu
    College of Chemistry, Sichuan University Chengdu 610064 People's Republic of China xmpuscu@scu.edu.cn +86 028 8541 2290.
  • Chuan Li
    State Key Laboratory for Molecular Virology and Genetic Engineering, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 102206, China.
  • Zhining Wen
    College of Chemistry, Sichuan University, Chengdu, Sichuan, China.