Survey and comparative assessments of computational multi-omics integrative methods with multiple regulatory networks identifying distinct tumor compositions across pan-cancer data sets.

Journal: Briefings in bioinformatics

Published Date: May 20, 2021

Abstract

The significance of pan-cancer categories has recently been recognized as widespread in cancer research. Pan-cancer categorizes a cancer based on its molecular pathology rather than an organ. The molecular similarities among multi-omics data found in different cancer types can play several roles in both biological processes and therapeutic developments. Therefore, an integrated analysis for various genomic data is frequently used to reveal novel genetic and molecular mechanisms. However, a variety of algorithms for multi-omics clustering have been proposed in different fields. The comparison of different computational clustering methods in pan-cancer analysis performance remains unclear. To increase the utilization of current integrative methods in pan-cancer analysis, we first provide an overview of five popular computational integrative tools: similarity network fusion, integrative clustering of multiple genomic data types (iCluster), cancer integration via multi-kernel learning (CIMLR), perturbation clustering for data integration and disease subtyping (PINS) and low-rank clustering (LRACluster). Then, a priori interactions in multi-omics data were incorporated to detect prominent molecular patterns in pan-cancer data sets. Finally, we present comparative assessments of these methods, with discussion over key issues in applying these algorithms. We found that all five methods can identify distinct tumor compositions. The pan-cancer samples can be reclassified into several groups by different proportions. Interestingly, each method can classify the tumors into categories that are different from original cancer types or subtypes, especially for ovarian serous cystadenocarcinoma (OV) and breast invasive carcinoma (BRCA) tumors. In addition, all clusters of the five computational methods show notable prognostic values. Furthermore, both the 9 recurrent differential genes and the 15 common pathway characteristics were identified across all the methods. The results and discussion can help the community select appropriate integrative tools according to different research tasks or aims in pan-cancer analysis.

Authors

Zhuohui Wei

Computer Science and Engineering, South China University of Technology.
Yue Zhang

Department of Ophthalmology, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing, China.
Wanlin Weng

Computer Science and Engineering, South China University of Technology.
Jiazhou Chen

School of Computer Science and Engineering, South China University of Technology, Guangzhou 510000, China.
Hongmin Cai

School of Computer Science& Engineering, South China University of Technology, Guangdong, China. hmcai@scut.edu.cn.

Keywords

Breast Neoplasms Computational Biology Cystadenocarcinoma, Serous Databases, Genetic Female Gene Regulatory Networks Genomics Humans Machine Learning Neoplasms Ovarian Neoplasms

External Resources

View on PubMed Access via DOI PubMed (32533167)

Survey and comparative assessments of computational multi-omics integrative methods with multiple regulatory networks identifying distinct tumor compositions across pan-cancer data sets.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals