Prediction of core cancer genes using a hybrid of feature selection and machine learning methods.

Journal: Genetics and molecular research : GMR
Published Date:

Abstract

Machine learning techniques are of great importance in the analysis of microarray expression data, and provide a systematic and promising way to predict core cancer genes. In this study, a hybrid strategy was introduced based on machine learning techniques to select a small set of informative genes, which will lead to improving classification accuracy. First feature filtering algorithms were applied to select a set of top-ranked genes, and then hierarchical clustering and collapsing dense clusters were used to select core cancer genes. Through empirical study, our approach is capable of selecting relatively few core cancer genes while making high-accuracy predictions. The biological significance of these genes was evaluated using systems biology analysis. Extensive functional pathway and network analyses have confirmed findings in previous studies and can bring new insights into common cancer mechanisms.

Authors

  • Y X Liu
    School of Basic Medical Science, Harbin Medical University, Harbin, Heilongjiang, China.
  • N N Zhang
    Modern Laboratory Centre, Harbin Normal University, Harbin, China.
  • Y He
    Network & Information Centre, Harbin Medical University, Harbin, Heilongjiang, China.
  • L J Lun
    College of Computer Science and Information Engineering, Harbin Normal University, Harbin, China.