Cognitive analysis of metabolomics data for systems biology.

Journal: Nature protocols
PMID:

Abstract

Cognitive computing is revolutionizing the way big data are processed and integrated, with artificial intelligence (AI) natural language processing (NLP) platforms helping researchers to efficiently search and digest the vast scientific literature. Most available platforms have been developed for biomedical researchers, but new NLP tools are emerging for biologists in other fields and an important example is metabolomics. NLP provides literature-based contextualization of metabolic features that decreases the time and expert-level subject knowledge required during the prioritization, identification and interpretation steps in the metabolomics data analysis pipeline. Here, we describe and demonstrate four workflows that combine metabolomics data with NLP-based literature searches of scientific databases to aid in the analysis of metabolomics data and their biological interpretation. The four procedures can be used in isolation or consecutively, depending on the research questions. The first, used for initial metabolite annotation and prioritization, creates a list of metabolites that would be interesting for follow-up. The second workflow finds literature evidence of the activity of metabolites and metabolic pathways in governing the biological condition on a systems biology level. The third is used to identify candidate biomarkers, and the fourth looks for metabolic conditions or drug-repurposing targets that the two diseases have in common. The protocol can take 1-4 h or more to complete, depending on the processing time of the various software used.

Authors

  • Erica L-W Majumder
    Center for Mass Spectrometry and Metabolomics, The Scripps Research Institute, La Jolla, CA, USA.
  • Elizabeth M Billings
    Center for Mass Spectrometry and Metabolomics, The Scripps Research Institute, La Jolla, CA, USA.
  • H Paul Benton
  • Richard L Martin
    IBM Almaden Research Lab , 650 Harry Road, San Jose, California 95120, United States.
  • Amelia Palermo
    Center for Mass Spectrometry and Metabolomics, The Scripps Research Institute, La Jolla, CA, USA.
  • Carlos Guijas
    Scripps Center for Metabolomics, The Scripps Research Institute, La Jolla, CA, USA.
  • Markus M Rinschen
    Center for Mass Spectrometry and Metabolomics, The Scripps Research Institute, La Jolla, CA, USA.
  • Xavier Domingo-Almenara
  • J Rafael Montenegro-Burke
  • Bradley A Tagtow
    IBM Watson Health, Cambridge, MA, USA.
  • Robert S Plumb
    Waters Corporation, Milford, MA, USA.
  • Gary Siuzdak