Extracting chemical-protein relations using attention-based neural networks.

Journal: Database : the journal of biological databases and curation
Published Date:

Abstract

Relation extraction is an important task in the field of natural language processing. In this paper, we describe our approach for the BioCreative VI Task 5: text mining chemical-protein interactions. We investigate multiple deep neural network (DNN) models, including convolutional neural networks, recurrent neural networks (RNNs) and attention-based (ATT-) RNNs (ATT-RNNs) to extract chemical-protein relations. Our experimental results indicate that ATT-RNN models outperform the same models without using attention and the ATT-gated recurrent unit (ATT-GRU) achieves the best performing micro average F1 score of 0.527 on the test set among the tested DNNs. In addition, the result of word-level attention weights also shows that attention mechanism is effective on selecting the most important trigger words when trained with semantic relation labels without the need of semantic parsing and feature engineering. The source code of this work is available at https://github.com/ohnlp/att-chemprot.

Authors

  • Sijia Liu
    These authors contributed equally to this study and Dr. Li is now working at IBM; Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA; Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA.
  • Feichen Shen
    Department of Health Sciences Research, Rochester MN.
  • Ravikumar Komandur Elayavilli
    Mayo Clinic, Rochester, MN, USA.
  • Yanshan Wang
    Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
  • Majid Rastegar-Mojarad
    Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
  • Vipin Chaudhary
    Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA.
  • Hongfang Liu
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.