Unsupervised concept extraction from clinical text through semantic composition.

Journal: Journal of biomedical informatics

Published Date: Feb 10, 2019

Abstract

Concept extraction is an important step in clinical natural language processing. Once extracted, the use of concepts can improve the accuracy and generalization of downstream systems. We present a new unsupervised system for the extraction of concepts from clinical text. The system creates representations of concepts from the Unified Medical Language System (UMLS®) by combining natural language descriptions of concepts with word representations, and composing these into higher-order concept vectors. These concept vectors are then used to assign labels to candidate phrases which are extracted using a syntactic chunker. Our approach scores an exact F-score of.32 and an inexact F-score of.45 on the well-known I2b2-2010 challenge corpus, outperforming the only other unsupervised concept extraction method. As our approach relies only on word representations and a chunker, it is completely unsupervised. As such, it can be applied to languages and corpora for which we do not have prior annotations. All our code is open-source and can be found at www.github.com/clips/conch.

Authors

Stéphan Tulkens

Computational Linguistics and Psycholinguistics (CLiPS) Research Center, University of Antwerp, Prinsstraat 13, 2000 Antwerp, Belgium. Electronic address: stephan.tulkens@uantwerpen.be.
Simon Šuster

Computational Linguistics and Psycholinguistics (CLiPS) Research Center, University of Antwerp, Prinsstraat 13, 2000 Antwerp, Belgium. Electronic address: simon.suster@uantwerpen.be.
Walter Daelemans

University of Antwerp, Computational Linguistics and Psycholinguistics (CLiPS) Research Center, Lange Winkelstraat 40-42, B-2000 Antwerp, Belgium.

Keywords

Semantics Unified Medical Language System Unsupervised Machine Learning

External Resources

View on PubMed Access via DOI PubMed (30753949)

Unsupervised concept extraction from clinical text through semantic composition.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals