Enhancing single-cell classification accuracy using image conversion and deep learning.

Journal: Yi chuan = Hereditas
PMID:

Abstract

Single-cell transcriptome sequencing (scRNA-seq) is widely used in the fields of animal and plant developmental biology and important trait analysis by obtaining single-cell transcript abundance data in high throughput, which can deeply reveal cell types, subtype composition, specific gene markers and functional differences. However, scRNA-seq data are often accompanied by problems such as high noise, high dimensionality and batch effect, resulting in a large number of low-expressed genes and variants, which seriously affect the accuracy and reliability of data analysis. This not only increases the complexity of data processing, but also limits the effectiveness of feature selection and downstream analysis. Although several statistical inference and machine learning methods have been used to address these challenges, the existing methods still have limitations in cell type identification, feature selection, and batch effect correction, which are difficult to meet the needs of complex biological research. In this study, we proposes an innovative single-cell classification method, scIC (single-cell image classification), which converts scRNA-seq data into image form and combines it with deep learning techniques for cell classification. Through this image conversion, we are able to capture complex patterns in the data more efficiently, and then construct efficient classification models using convolutional neural networks (CNN) and residual networks (ResNet). After testing scRNA-seq data from four cell types (mouse skin basal cells, mouse lymphocytes, human neuronal cells, and mouse spinal cord cells), the accuracy of the classification models exceeded 94%, with the mouse skin basal cell dataset achieving a classification accuracy of 99.8% when using the ResNet50 model. These results indicate that image transformation of scRNA-seq data and combining it with deep learning techniques can significantly improve the classification accuracy, providing new ideas and effective tools for solving key challenges in single-cell data analysis. The code for this study is publicly available at: https://github.com/Bingxi-Gao/SCImageClassify.

Authors

  • Bingxi Gao
    College of Animal Science and Technology, Yangtze University, Jingzhou 434025, China.
  • Huaxuan Wu
    School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, China.
  • Zhiqiang Du
    Yangzhou University, School of Nursing, School of Public Health, Yangzhou, China.