A comparison of word embeddings for the biomedical natural language processing.
Journal:
Journal of biomedical informatics
Published Date:
Nov 1, 2018
Abstract
BACKGROUND: Word embeddings have been prevalently used in biomedical Natural Language Processing (NLP) applications due to the ability of the vector representations being able to capture useful semantic properties and linguistic relationships between words. Different textual resources (e.g., Wikipedia and biomedical literature corpus) have been utilized in biomedical NLP to train word embeddings and these word embeddings have been commonly leveraged as feature input to downstream machine learning models. However, there has been little work on evaluating the word embeddings trained from different textual resources.
Authors
Keywords
Adolescent
Adult
Aged
Electronic Health Records
Family Health
Female
Humans
Information Storage and Retrieval
Linguistics
Machine Learning
Male
Medical Informatics
Middle Aged
Minnesota
Models, Statistical
Natural Language Processing
Probability
PubMed
Semantics
Unified Medical Language System
Young Adult