Prediction and Interpretable Visualization of Retrosynthetic Reactions Using Graph Convolutional Networks.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Recently, many research groups have been addressing data-driven approaches for (retro)synthetic reaction prediction and retrosynthetic analysis. Although the performances of the data-driven approach have progressed because of recent advances of machine learning and deep learning techniques, problems such as improving capability of reaction prediction and the black-box problem of neural networks persist for practical use by chemists. To spread data-driven approaches to chemists, we focused on two challenges: improvement of retrosynthetic reaction prediction and interpretability of the prediction. In this paper, we propose an interpretable prediction framework using graph convolutional networks (GCN) for retrosynthetic reaction prediction and integrated gradients (IG) for visualization of contributions to the prediction to address these challenges. As a result, from the viewpoint of balanced accuracies, our model showed better performances than the approach using an extended-connectivity fingerprint. Furthermore, IG-based visualization of the GCN prediction successfully highlighted reaction-related atoms.

Authors

  • Shoichi Ishida
    Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa 230-0045, Japan.
  • Kei Terayama
    Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa 230-0045, Japan.
  • Ryosuke Kojima
    Department of Biomedical Data Intelligence, Kyoto University Graduate School of Medicine, Sakyo-ku, Kyoto, Kyoto, Japan.
  • Kiyosei Takasu
    Graduate School of Pharmaceutical Sciences , Kyoto University , Yoshida, Sakyo-ku, Kyoto 606-8501 , Japan.
  • Yasushi Okuno
    Graduate School of Medicine, Kyoto University, Shogoin-kawaharacho, city/>Sakyo-ku Kyoto, 606-8507, Japan.