Incorporating linguistic knowledge for learning distributed word representations.

Journal: PloS one

PMID: 25874581

Abstract

Combined with neural language models, distributed word representations achieve significant advantages in computational linguistics and text mining. Most existing models estimate distributed word vectors from large-scale data in an unsupervised fashion, which, however, do not take rich linguistic knowledge into consideration. Linguistic knowledge can be represented as either link-based knowledge or preference-based knowledge, and we propose knowledge regularized word representation models (KRWR) to incorporate these prior knowledge for learning distributed word representations. Experiment results demonstrate that our estimated word representation achieves better performance in task of semantic relatedness ranking. This indicates that our methods can efficiently encode both prior knowledge from knowledge bases and statistical knowledge from large-scale text corpora into a unified word representation model, which will benefit many tasks in text mining.

Authors

Yan Wang

College of Animal Science and Technology, Beijing University of Agriculture, Beijing, China.
Zhiyuan Liu

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China.
Maosong Sun

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China; Jiangsu Collaborative Innovation Center for Language Competence, Jiangsu, China.

Keywords

Algorithms Data Mining Humans Knowledge Bases Language Linguistics Models, Theoretical Natural Language Processing Semantics Verbal Learning

External Resources

View on PubMed Access via DOI PubMed (25874581)

Incorporating linguistic knowledge for learning distributed word representations.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals