Leveraging PubMed to Create a Specialty-Based Sense Inventory for Spanish Acronym Resolution.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Acronyms frequently occur in clinical text, which makes their identification, disambiguation and resolution an important task in clinical natural language processing. This paper contributes to acronym resolution in Spanish through the creation of a set of sense inventories organized by clinical specialty containing acronyms, their expansions, and corpus-driven features. The new acronym resource is composed of 51 clinical specialties with 3,603 acronyms in total, from which we identified 228 language independent acronyms and 391 language dependent expansions. We further analyzed the sense inventory across specialties and present novel insights of acronym usage in biomedical Spanish texts.

Authors

  • Alexandra Pomares-Quimbaya
    Pontificia Universidad Javeriana, Bogotá, Colombia.
  • Pilar López-Úbeda
    Universidad de Jaén, Jaén, Andalucía, Spain.
  • Michel Oleynik
    Institute of Mathematics and Statistics, University of São Paulo, São Paulo, Brazil.
  • Stefan Schulz
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.