Automatically Expanding the Synonym Set of SNOMED CT using Wikipedia.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Clinical terminologies and ontologies are often used in natural language processing/understanding tasks as a method for semantically tagging text. One ontology commonly used for this task is SNOMED CT. Natural language is rich and varied: many different combinations of words may be used to express the same idea. It is therefore essential that ontologies and terminologies have a rich set of synonyms. One source of synonyms is Wikipedia. We examine methods for aligning concepts in SNOMED CT with articles in Wikipedia so that newly-found synonyms may be added to SNOMED CT. Our experiments show promising results and provide guidance to researchers who wish to use Wikipedia for similar tasks.

Authors

  • Daniel R Schlegel
    Department of Biomedical Informatics, University at Buffalo, SUNY, Buffalo, NY, USA.
  • Chris Crowner
    Department of Biomedical Informatics, University at Buffalo, SUNY, Buffalo, NY, USA.
  • Peter L Elkin
    Department of Biomedical Informatics, University at Buffalo, Buffalo, NY.