scFSNN: a feature selection method based on neural network for single-cell RNA-seq data.

Journal: BMC genomics
PMID:

Abstract

While single-cell RNA sequencing (scRNA-seq) allows researchers to analyze gene expression in individual cells, its unique characteristics like over-dispersion, zero-inflation, high gene-gene correlation, and large data volume with many features pose challenges for most existing feature selection methods. In this paper, we present a feature selection method based on neural network (scFSNN) to solve classification problem for the scRNA-seq data. scFSNN is an embedded method that can automatically select features (genes) during model training, control the false discovery rate of selected features and adaptively determine the number of features to be eliminated. Extensive simulation and real data studies demonstrate its excellent feature selection ability and predictive performance.

Authors

  • Minjiao Peng
    College of Mathematics and Statistics, Institute of Statistical Sciences, Shenzhen Key Laboratory of Advanced Machine Learning and Applications, Shenzhen University, Shenzhen, China.
  • Baoqin Lin
    School of Chinese Materia Medica, Guangzhou University of Chinese Medicine, Guangzhou, China. Electronic address: linbaoqin@gzucm.edu.cn.
  • Jun Zhang
    First School of Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan, China.
  • Yan Zhou
    Department of Computer Science, University of Texas at Dallas, Richardson, Texas 75080, United States.
  • Bingqing Lin
    School of Mathematical Sciences, Shenzhen University, Nanshan, Shenzhen, 518060, Guangdong, China. bqlin@szu.edu.cn.