Stable feature selection based on the ensemble L -norm support vector machine for biomarker discovery.

Journal: BMC genomics
Published Date:

Abstract

BACKGROUND: Lately, biomarker discovery has become one of the most significant research issues in the biomedical field. Owing to the presence of high-throughput technologies, genomic data, such as microarray data and RNA-seq, have become widely available. Many kinds of feature selection techniques have been applied to retrieve significant biomarkers from these kinds of data. However, they tend to be noisy with high-dimensional features and consist of a small number of samples; thus, conventional feature selection approaches might be problematic in terms of reproducibility.

Authors

  • Myungjin Moon
    Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba-ken, 277-8562, Japan.
  • Kenta Nakai
    Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba-ken, 277-8562, Japan. knakai@ims.u-tokyo.ac.jp.