Cross-lingual semantic annotation of biomedical literature: experiments in Spanish and English.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Biomedical literature is one of the most relevant sources of information for knowledge mining in the field of Bioinformatics. In spite of English being the most widely addressed language in the field; in recent years, there has been a growing interest from the natural language processing community in dealing with languages other than English. However, the availability of language resources and tools for appropriate treatment of non-English texts is lacking behind. Our research is concerned with the semantic annotation of biomedical texts in the Spanish language, which can be considered an under-resourced language where biomedical text processing is concerned.

Authors

  • Naiara Perez
    Vicomtech, SNLT, Donostia 20009, Spain.
  • Pablo Accuosto
    Barcelona, TALN/DTIC, Universitat Pompeu Fabra, Barcelona 08018, Spain.
  • Àlex Bravo
    Research Programme on Biomedical Informatics (GRIB), IMIM-Universitat Pompeu Fabra, Barcelona, Spain.
  • Montse Cuadros
    Vicomtech, SNLT, Donostia 20009, Spain.
  • Eva Martínez-Garcia
    Vicomtech, SNLT, Donostia 20009, Spain.
  • Horacio Saggion
    Barcelona, TALN/DTIC, Universitat Pompeu Fabra, Barcelona 08018, Spain.
  • German Rigau
    IXA Group, HiTZ Centre, University of the Basque Country UPV/EHU, Donostia 20018, Spain.