Integrating multi-omics data through deep learning for accurate cancer prognosis prediction.

Journal: Computers in biology and medicine
Published Date:

Abstract

BACKGROUND: Genomic information is nowadays widely used for precise cancer treatments. Since the individual type of omics data only represents a single view that suffers from data noise and bias, multiple types of omics data are required for accurate cancer prognosis prediction. However, it is challenging to effectively integrate multi-omics data due to the large number of redundant variables but relatively small sample size. With the recent progress in deep learning techniques, Autoencoder was used to integrate multi-omics data for extracting representative features. Nevertheless, the generated model is fragile from data noises. Additionally, previous studies usually focused on individual cancer types without making comprehensive tests on pan-cancer. Here, we employed the denoising Autoencoder to get a robust representation of the multi-omics data, and then used the learned representative features to estimate patients' risks.

Authors

  • Hua Chai
    Faculty of Information Technology & State Key Laboratory of Quality Research in Chinese Medicines, Macau University of Science and Technology, Avenida Wai Long,Taipa, Macau, 999078, China.
  • Xiang Zhou
    Department of Sociology, Harvard University, Cambridge, Massachusetts, USA.
  • Zhongyue Zhang
    School of Data and Computer Science , Sun Yat-sen University , Guangzhou 510006 , China.
  • Jiahua Rao
    School of Data and Computer Science , Sun Yat-sen University , Guangzhou 510006 , China.
  • Huiying Zhao
    Sun Yat-sen Memorial Hospital, Sun Yat-sen University, 107 Yan Jiang West Road, Guangzhou 510120, China.
  • Yuedong Yang
    Institute for Glycomics and School of Information and Communication Technique, Griffith University, Parklands Dr. Southport, QLD 4222, Australia.