Multi-scale supervised clustering-based feature selection for tumor classification and identification of biomarkers and targets on genomic data.

Journal: BMC genomics
Published Date:

Abstract

BACKGROUND: The small number of samples and the curse of dimensionality hamper the better application of deep learning techniques for disease classification. Additionally, the performance of clustering-based feature selection algorithms is still far from being satisfactory due to their limitation in using unsupervised learning methods. To enhance interpretability and overcome this problem, we developed a novel feature selection algorithm. In the meantime, complex genomic data brought great challenges for the identification of biomarkers and therapeutic targets. The current some feature selection methods have the problem of low sensitivity and specificity in this field.

Authors

  • Da Xu
    School of Mathematics and Statistics, Shandong University, Weihai, 264209, China.
  • Jialin Zhang
    School of Mathematics and Statistics, Shandong University, Weihai, 264209, China.
  • Hanxiao Xu
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, PR China.
  • Yusen Zhang
    School of Mathematics and Statistics, Shandong University at Weihai, Weihai 264209, China. Electronic address: zhangys@sdu.edu.cn.
  • Wei Chen
    Department of Urology, Zigong Fourth People's Hospital, Sichuan, China.
  • Rui Gao
    School of Control Science and Engineering, Shandong University, Jinan, China.
  • Matthias Dehmer
    Department of Mechatronics and Biomedical Computer Science, UMIT, Hall in Tyrol, IL, Austria.