Are batch effects still relevant in the age of big data?

Journal: Trends in biotechnology
Published Date:

Abstract

Batch effects (BEs) are technical biases that may confound analysis of high-throughput biotechnological data. BEs are complex and effective mitigation is highly context-dependent. In particular, the advent of high-resolution technologies such as single-cell RNA sequencing presents new challenges. We first cover how BE modeling differs between traditional datasets and the new data landscape. We also discuss new approaches for measuring and mitigating BEs, including whether a BE is significant enough to warrant correction. Even with the advent of machine learning and artificial intelligence, the increased complexity of next-generation biotechnological data means increased complexities in BE management. We forecast that BEs will not only remain relevant in the age of big data but will become even more important.

Authors

  • Wilson Wen Bin Goh
    School of Biological Sciences, Nanyang Technological University, Singapore 637551, Republic of Singapore. Electronic address: wilsongoh@ntu.edu.sg.
  • Chern Han Yong
    Department of Computer Science, National University of Singapore, 117417, Singapore. Electronic address: dcshan@nus.edu.sg.
  • Limsoon Wong
    Department of Computer Science, National University of Singapore, Singapore; Department of Pathology, National University of Singapore, Singapore.