A transfer learning model with multi-source domains for biomedical event trigger extraction.

Journal: BMC genomics
Published Date:

Abstract

BACKGROUND: Automatic extraction of biomedical events from literature, that allows for faster update of the latest discoveries automatically, is a heated research topic now. Trigger word recognition is a critical step in the process of event extraction. Its performance directly influences the results of the event extraction. In general, machine learning-based trigger recognition approaches such as neural networks must to be trained on a dataset with plentiful annotations to achieve high performances. However, the problem of the datasets in wide coverage event domains is that their annotations are insufficient and imbalance. One of the methods widely used to deal with this problem is transfer learning. In this work, we aim to extend the transfer learning to utilize multiple source domains. Multiple source domain datasets can be jointly trained to help achieve a higher recognition performance on a target domain with wide coverage events.

Authors

  • Yifei Chen
    Department of Computer Science and Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA.