Cimind: A phonetic-based tool for multilingual named entity recognition in biomedical texts.

Journal: Journal of biomedical informatics
Published Date:

Abstract

BACKGROUND: Extracting concepts from biomedical texts is a key to support many advanced applications such as biomedical information retrieval. However, in clinical notes Named Entity Recognition (NER) has to deal with various types of errors such as spelling errors, grammatical errors, truncated sentences, and non-standard abbreviations. Moreover, in numerous countries, NER is challenged by the availability of many resources originally developed and only suitable for English texts. This paper presents the Cimind system, a multilingual system dedicated to named entity recognition in medical texts based on a phonetic similarity measure.

Authors

  • ChloĆ© Cabot
    SIBM, Rouen University Hospital & TIBS, LITIS EA 4108, Rouen, France.
  • Stefan Darmoni
    Department of Biomedical Informatics, Rouen University Hospital, TIBS, LITIS EA 4108 Rouen University, France.
  • Lina F Soualmia
    SIBM, Rouen University Hospital & TIBS, LITIS EA 4108, Rouen, France.