Extracting Drug-Protein Relation from Literature Using Ensembles of Biomedical Transformers.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Automatic extraction of relations between drugs/chemicals and proteins from ever-growing biomedical literature is required to build up-to-date knowledge bases in biomedicine. To promote the development of automated methods, BioCreative-VII organized a shared task - the DrugProt track, to recognize drug-protein entity relations from PubMed abstracts. We participated in the shared task and leveraged deep learning-based transformer models pre-trained on biomedical data to build ensemble approaches to automatically extract drug-protein relation from biomedical literature. On the main corpora of 10,750 abstracts, our best system obtained an F1-score of 77.60% (ranked 4th among 30 participating teams), and on the large-scale corpus of 2.4M documents, our system achieved micro-averaged F1-score of 77.32% (ranked 2nd among 9 system submissions). This demonstrates the effectiveness of domain-specific transformer models and ensemble approaches for automatic relation extraction from biomedical literature.

Authors

  • Avisha Das
    Arizona Advanced AI & Innovation (A3I) Hub, Mayo Clinic Arizona, Phoenix, AZ, USA.
  • Zhao Li
    Research Center for Data Hub and Security, Zhejiang Lab, Hangzhou, China. lzjoey@gmail.com.
  • Qiang Wei
    School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Jianfu Li
    Mayo Clinic.
  • Liang-Chin Huang
    School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Yan Hu
    Department of Thoracic Surgery, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China.
  • Rongbin Li
    School of Biomedical Informatics, University of Texas Health Science Center at Houston; Yale University; Melax Technologies, Houston.
  • Wenjin Jim Zheng
    School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Hua Xu
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.