Drug target prediction through deep learning functional representation of gene signatures.

Journal: Nature communications
Published Date:

Abstract

Many machine learning applications in bioinformatics currently rely on matching gene identities when analyzing input gene signatures and fail to take advantage of preexisting knowledge about gene functions. To further enable comparative analysis of OMICS datasets, including target deconvolution and mechanism of action studies, we develop an approach that represents gene signatures projected onto their biological functions, instead of their identities, similar to how the word2vec technique works in natural language processing. We develop the Functional Representation of Gene Signatures (FRoGS) approach by training a deep learning model and demonstrate that its application to the Broad Institute's L1000 datasets results in more effective compound-target predictions than models based on gene identities alone. By integrating additional pharmacological activity data sources, FRoGS significantly increases the number of high-quality compound-target predictions relative to existing approaches, many of which are supported by in silico and/or experimental evidence. These results underscore the general utility of FRoGS in machine learning-based bioinformatics applications. Prediction networks pre-equipped with the knowledge of gene functions may help uncover new relationships among gene signatures acquired by large-scale OMICs studies on compounds, cell types, disease models, and patient cohorts.

Authors

  • Hao Chen
    The First School of Medicine, Wenzhou Medical University, Wenzhou, China.
  • Frederick J King
    Novartis Institutes for BioMedical Research, San Diego, California 92121, United States.
  • Bin Zhou
    Department of Oral and Maxillofacial Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
  • Yu Wang
    Clinical and Technical Support, Philips Healthcare, Shanghai, China.
  • Carter J Canedy
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
  • Joel Hayashi
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
  • Yang Zhong
    Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai, 200032, China.
  • Max W Chang
    Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
  • Lars Pache
    NCI Designated Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA.
  • Julian L Wong
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
  • Yong Jia
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
  • John Joslin
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
  • Tao Jiang
    Department of Respiratory and Critical Care Medicine, Center for Respiratory Medicine, the Fourth Affiliated Hospital of School of Medicine, and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, China.
  • Christopher Benner
    Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
  • Sumit K Chanda
    Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA.
  • Yingyao Zhou
    Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA. yingyao.zhou@novartis.com.