Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies.

Journal: Cell genomics
Published Date:

Abstract

Metadata, or "data about data," is essential for organizing, understanding, and managing large-scale omics datasets. It enhances data discovery, integration, and interpretation, enabling reproducibility, reusability, and secondary analysis. However, metadata sharing remains hindered by perceptual and technical barriers, including the lack of uniform standards, privacy concerns, study design limitations, insufficient incentives, inadequate infrastructure, and a shortage of trained personnel. These challenges compromise data reliability and obstruct integrative meta-analyses. Addressing these issues requires standardization, education, stronger roles for journals and funding agencies, and improved incentives and infrastructure. Looking ahead, emerging technologies such as artificial intelligence and machine learning may offer promising solutions to automate metadata processes, increasing accuracy and scalability. Fostering a collaborative culture of metadata sharing will maximize the value of omics data, accelerating innovation and scientific discovery.

Authors

  • Yu-Ning Huang
    Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA.
  • Viorel Munteanu
    Department of Computers, Informatics, and Microelectronics, Technical University of Moldova, 2045 Chisinau, Moldova; Department of Biological and Morphofunctional Sciences, College of Medicine and Biological Sciences, Stefan cel Mare University of Suceava, 720229 Suceava, Romania.
  • Michael I Love
    Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA; Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA.
  • Cynthia Flaire Ronkowski
    Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA.
  • Dhrithi Deshpande
    Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA.
  • Annie Wong-Beringer
    Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Russell Corbett-Detig
    Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Mihai Dimian
    Department of Computers, Electronics, and Automation, Stefan cel Mare University of Suceava, 720229 Suceava, Romania.
  • Jason H Moore
    University of Pennsylvania, Philadelphia, PA, USA.
  • Lana X Garmire
    Epidemiology Program, University of Hawaii Cancer Center, Honolulu, Hawaii. lgarmire@cc.hawaii.edu.
  • T B K Reddy
    Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, California, USA.
  • Atul J Butte
    Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA.
  • Mark D Robinson
    Institute of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland.
  • Eleazar Eskin
    1 Department of Computer Science, University of California, Los Angeles, California.
  • Malak S Abedalthagafi
    Department of Pathology and Laboratory Medicine, Emory University Hospital, Atlanta, GA, USA.
  • Serghei Mangul
    Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA; Department of Computers, Informatics, and Microelectronics, Technical University of Moldova, 2045 Chisinau, Moldova; Department of Biological and Morphofunctional Sciences, College of Medicine and Biological Sciences, Stefan cel Mare University of Suceava, 720229 Suceava, Romania; Sage Bionetworks, Seattle, WA, USA. Electronic address: serghei.mangul@gmail.com.