Effects of data and entity ablation on multitask learning models for biomedical entity recognition.

Journal: Journal of biomedical informatics

Published Date: Apr 9, 2022

Abstract

MOTIVATION: Training domain-specific named entity recognition (NER) models requires high quality hand curated gold standard datasets which are time-consuming and expensive to create. Furthermore, the storage and memory required to deploy NLP models can be prohibitive when the number of tasks is large. In this work, we explore utilizing multi-task learning to reduce the amount of training data needed to train new domain-specific models. We evaluate our system across 22 distinct biomedical NER datasets and evaluate the extent to which transfer learning helps task performance using two forms of ablation.

Authors

Nicholas E Rodriguez

Department of Computer Science, Virginia Commonwealth University, Richmond 23284, USA.
Mai Nguyen

San Diego Supercomputer Center, University of California, San Diego 92093, USA.
Bridget T McInnes

Department of Computer Science, Virginia Commonwealth University, 401 S. Main St., Rm E4225, Richmond, VA 23284, USA. Electronic address: btmcinnes@vcu.edu.

Keywords

Natural Language Processing Software

External Resources

View on PubMed Access via DOI PubMed (35413440)

Effects of data and entity ablation on multitask learning models for biomedical entity recognition.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals