BioADAPT-MRC: adversarial learning-based domain adaptation improves biomedical machine reading comprehension task.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Biomedical machine reading comprehension (biomedical-MRC) aims to comprehend complex biomedical narratives and assist healthcare professionals in retrieving information from them. The high performance of modern neural network-based MRC systems depends on high-quality, large-scale, human-annotated training datasets. In the biomedical domain, a crucial challenge in creating such datasets is the requirement for domain knowledge, inducing the scarcity of labeled data and the need for transfer learning from the labeled general-purpose (source) domain to the biomedical (target) domain. However, there is a discrepancy in marginal distributions between the general-purpose and biomedical domains due to the variances in topics. Therefore, direct-transferring of learned representations from a model trained on a general-purpose domain to the biomedical domain can hurt the model's performance.

Authors

  • Maria Mahbub
    Cyber Resilience and Intelligence Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America.
  • Sudarshan Srinivasan
    Cyber Resilience and Intelligence Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America.
  • Edmon Begoli
    University of Tennessee, Knoxville, TN, USA; Oak Ridge National Laboratory, Knoxville, TN, USA.
  • Gregory D Peterson
    Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, United States of America.