A practical guide to FAIR data management in the age of multi-OMICS and AI.

Journal: Frontiers in immunology
PMID:

Abstract

Multi-cellular biological systems, including the immune system, are highly complex, dynamic, and adaptable. Systems biologists aim to understand such complexity at a quantitative level. However, these ambitious efforts are often limited by access to a variety of high-density intra-, extra- and multi-cellular measurements resolved in time and space and across a variety of perturbations. The advent of automation, OMICs and single-cell technologies now allows high dimensional multi-modal data acquisition from the same biological samples multiplexed at scale (multi-OMICs). As a result, systems biologists -theoretically- have access to more data than ever. However, the mathematical frameworks and computational tools needed to analyze and interpret such data are often still nascent, limiting the biological insights that can be obtained without years of computational method development and validation. More pressingly, much of the data sits in silos in formats that are incomprehensible to other scientists or machines limiting its value to the vaster scientific community, especially the computational biologists tasked with analyzing these vast amounts of data in more nuanced ways. With the rapid development and increasing interest in using artificial intelligence (AI) for the life sciences, improving how biologic data is organized and shared is more pressing than ever for scientific progress. Here, we outline a practical approach to multi-modal data management and FAIR sharing, which are in line with the latest US and EU funders' data sharing policies. This framework can help extend the longevity and utility of data by allowing facile use and reuse, accelerating scientific discovery in the biomedical sciences.

Authors

  • Douaa Mugahid
    Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States.
  • Jared Lyon
    BioMicro Center, Massachusetts Institute of Technology, Cambridge, MA, United States.
  • Charlie Demurjian
    BioMicro Center, Massachusetts Institute of Technology, Cambridge, MA, United States.
  • Nathan Eolin
    BioMicro Center, Massachusetts Institute of Technology, Cambridge, MA, United States.
  • Charlie Whittaker
    BioMicro Center, Massachusetts Institute of Technology, Cambridge, MA, United States.
  • Mark Godek
    Ragon Institute of Massachusetts General Hospital (MGH), Massachusetts Institute of Technology (MIT), and Harvard, Cambridge, MA, United States.
  • Douglas Lauffenburger
    Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, United States.
  • Sarah Fortune
    Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States.
  • Stuart Levine
    BioMicro Center, Massachusetts Institute of Technology, Cambridge, MA, United States.