A Convolutional Neural Network-Based Approach for the Rapid Annotation of Molecularly Diverse Natural Products.

Journal: Journal of the American Chemical Society
Published Date:

Abstract

This report describes the first application of the novel NMR-based machine learning tool "Small Molecule Accurate Recognition Technology" (SMART 2.0) for mixture analysis and subsequent accelerated discovery and characterization of new natural products. The concept was applied to the extract of a filamentous marine cyanobacterium known to be a prolific producer of cytotoxic natural products. This environmental extract was roughly fractionated, and then prioritized and guided by cancer cell cytotoxicity, NMR-based SMART 2.0, and MS-based molecular networking. This led to the isolation and rapid identification of a new chimeric swinholide-like macrolide, symplocolide A, as well as the annotation of swinholide A, samholides A-I, and several new derivatives. The planar structure of symplocolide A was confirmed to be a structural hybrid between swinholide A and luminaolide B by 1D/2D NMR and LC-MS analysis. A second example applies SMART 2.0 to the characterization of structurally novel cyclic peptides, and compares this approach to the recently appearing "atomic sort" method. This study exemplifies the revolutionary potential of combined traditional and deep learning-assisted analytical approaches to overcome longstanding challenges in natural products drug discovery.

Authors

  • Raphael Reher
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Hyun Woo Kim
    Chemical Data-Driven Research Center, Korea Research Institute of Chemical Technology (KRICT), Daejeon 34114, Korea.
  • Chen Zhang
    Department of Dermatology, Affiliated Jinling Hospital, Medical School of Nanjing University, Nanjing, China.
  • Huanru Henry Mao
    Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Mingxun Wang
    Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Louis-FĂ©lix Nothias
    Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Andres Mauricio Caraballo-Rodriguez
    Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Evgenia Glukhov
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Bahar Teke
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Tiago Leao
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Kelsey L Alexander
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Brendan M Duggan
    Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, 92093, United States of America.
  • Ezra L Van Everbroeck
    Director's Office, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
  • Pieter C Dorrestein
    Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093; Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, La Jolla, CA 92037 pdorrestein@ucsd.edu.
  • Garrison W Cottrell
    Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USAgary@ucsd.eduhttp://cseweb.ucsd.edu/~gary/.
  • William H Gerwick
    Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, La Jolla, California, 92037, United States of America. wgerwick@ucsd.edu.