BEENE: deep learning-based nonlinear embedding improves batch effect estimation.

Journal: Bioinformatics (Oxford, England)
PMID:

Abstract

MOTIVATION: Analyzing large-scale single-cell transcriptomic datasets generated using different technologies is challenging due to the presence of batch-specific systematic variations known as batch effects. Since biological and technological differences are often interspersed, detecting and accounting for batch effects in RNA-seq datasets are critical for effective data integration and interpretation. Low-dimensional embeddings, such as principal component analysis (PCA) are widely used in visual inspection and estimation of batch effects. Linear dimensionality reduction methods like PCA are effective in assessing the presence of batch effects, especially when batch effects exhibit linear patterns. However, batch effects are inherently complex and existing linear dimensionality reduction methods could be inadequate and imprecise in the presence of sophisticated nonlinear batch effects.

Authors

  • Md Ashiqur Rahman
    Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.
  • Abdullah Aman Tutul
    Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.
  • Mahfuza Sharmin
    Department of Genetics, Stanford University, Stanford, CA 94305, United States.
  • Md Shamsuzzoha Bayzid
    Department of Computer Science and Engineering, Bangladesh University of Engineering & Technology, Dhaka, Bangladesh.