Improving neural protein-protein interaction extraction with knowledge selection.

Journal: Computational biology and chemistry
Published Date:

Abstract

Protein-protein interaction (PPI) extraction from published scientific literature provides additional support for precision medicine efforts. Meanwhile, knowledge bases (KBs) contain huge amounts of structured information of protein entities and their relations, which can be encoded in entity and relation embeddings to help PPI extraction. However, the prior knowledge of protein-protein pairs must be selectively used so that it is suitable for different contexts. This paper proposes a Knowledge Selection Model (KSM) to fuse the selected prior knowledge and context information for PPI extraction. Firstly, two Transformers encode the context sequence of a protein pair according to each protein embedding, respectively. Then, the two outputs are fed to a mutual attention to capture the important context features towards the protein pair. Next, the context features are used to distill the relation embedding by a knowledge selector. Finally, the selected relation embedding and the context features are concatenated for PPI extraction. Experiments on the BioCreative VI PPI dataset show that KSM achieves a new state-of-the-art performance (38.08 % F1-score) by adding knowledge selection.

Authors

  • Huiwei Zhou
    School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, China.
  • Xuefei Li
    Wuhan National High Magnetic Field Center, Huazhong University of Science and Technology, Wuhan, China.
  • Weihong Yao
    School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China. Electronic address: weihongy@dlut.edu.cn.
  • Zhuang Liu
    Department of Radiology, Fudan University Shanghai Cancer Center, Shanghai, 200032, China.
  • Shixian Ning
    School of Computer Science and Technology, Dalian University of Technology, Chuangxinyuan Building, No. 2 Linggong Road, Ganjingzi District, Dalian, Liaoning, China.
  • Chengkun Lang
    School of Computer Science and Technology, Dalian University of Technology, Chuangxinyuan Building, No. 2 Linggong Road, Ganjingzi District, Dalian, Liaoning, China.
  • Lei Du
    School of Mathematical Sciences, Dalian University of Technology, Chuangxinyuan Building, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, Liaoning, China.