Neural network based integration of assays to assess pathogenic potential.

Journal: Scientific reports
Published Date:

Abstract

Limited data significantly hinders our capability of biothreat assessment of novel bacterial strains. Integration of data from additional sources that can provide context about the strain can address this challenge. Datasets from different sources, however, are generated with a specific objective and which makes integration challenging. Here, we developed a deep learning-based approach called the neural network embedding model (NNEM) that integrates data from conventional assays designed to classify species with new assays that interrogate hallmarks of pathogenicity for biothreat assessment. We used a dataset of metabolic characteristics from a de-identified set of known bacterial strains that the Special Bacteriology Reference Laboratory (SBRL) of the Centers for Disease Control and Prevention (CDC) has curated for use in species identification. The NNEM transformed results from SBRL assays into vectors to supplement unrelated pathogenicity assays from de-identified microbes. The enrichment resulted in a significant improvement in accuracy of 9% for biothreat. Importantly, the dataset used in our analysis is large, but noisy. Therefore, the performance of our system is expected to improve as additional types of pathogenicity assays are developed and deployed. The proposed NNEM strategy thus provides a generalizable framework for enrichment of datasets with previously collected assays indicative of species.

Authors

  • Mohammed Eslami
    Data Science, Netrias, LLC, Annapolis, MD 21409, USA.
  • Yi-Pei Chen
    Netrias, LLC, Cambridge, MA 02142.
  • Ainsley C Nicholson
    Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA.
  • Mark Weston
    Data Science, Netrias, LLC, Annapolis, MD 21409, USA.
  • Melissa Bell
    Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA.
  • John R McQuiston
    Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA.
  • James Samuel
    Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX, 77807, USA.
  • Erin J van Schaik
    Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807.
  • Paul de Figueiredo
    Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807.