Towards reliable named entity recognition in the biomedical domain.

Journal: Bioinformatics (Oxford, England)

Published Date: Jan 1, 2020

Abstract

MOTIVATION: Automatic biomedical named entity recognition (BioNER) is a key task in biomedical information extraction. For some time, state-of-the-art BioNER has been dominated by machine learning methods, particularly conditional random fields (CRFs), with a recent focus on deep learning. However, recent work has suggested that the high performance of CRFs for BioNER may not generalize to corpora other than the one it was trained on. In our analysis, we find that a popular deep learning-based approach to BioNER, known as bidirectional long short-term memory network-conditional random field (BiLSTM-CRF), is correspondingly poor at generalizing. To address this, we evaluate three modifications of BiLSTM-CRF for BioNER to improve generalization: improved regularization via variational dropout, transfer learning and multi-task learning.

Authors

John M Giorgi

Department of Computer Science, University of Toronto, Toronto, Canada.
Gary D Bader

The Donnelly Centre, University of Toronto, Toronto, Canada.

Keywords

Computational Biology Deep Learning Information Storage and Retrieval Machine Learning Models, Genetic Software

External Resources

View on PubMed Access via DOI PubMed (31218364)

Towards reliable named entity recognition in the biomedical domain.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals