Drug knowledge discovery via multi-task learning and pre-trained models.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: Drug repurposing is to find new indications of approved drugs, which is essential for investigating new uses for approved or investigational drug efficiency. The active gene annotation corpus (named AGAC) is annotated by human experts, which was developed to support knowledge discovery for drug repurposing. The AGAC track of the BioNLP Open Shared Tasks using this corpus is organized by EMNLP-BioNLP 2019, where the "Selective annotation" attribution makes AGAC track more challenging than other traditional sequence labeling tasks. In this work, we show our methods for trigger word detection (Task 1) and its thematic role identification (Task 2) in the AGAC track. As a step forward to drug repurposing research, our work can also be applied to large-scale automatic extraction of medical text knowledge.

Authors

  • Dongfang Li
    Fuzhou university, Fuzhou, Fujian, China; Beijing Institute of Technology, Beijing, China. Electronic address: 188377985@qq.com.
  • Ying Xiong
    Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China.
  • Baotian Hu
    Harbin Institute of Technology (Shenzhen), Shenzhen, China. hubaotian@hit.edu.cn.
  • Buzhou Tang
  • Weihua Peng
    Baidu, International Technology (Shenzhen) Co., Ltd, Shenzhen, China.
  • Qingcai Chen
    Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China.