Minimalistic Approach to Coreference Resolution in Lithuanian Medical Records.

Journal: Computational and mathematical methods in medicine
PMID:

Abstract

Coreference resolution is a challenging part of natural language processing (NLP) with applications in machine translation, semantic search and other information retrieval, and decision support systems. Coreference resolution requires linguistic preprocessing and rich language resources for automatically identifying and resolving such expressions. Many rarer and under-resourced languages (such as Lithuanian) lack the required language resources and tools. We present a method for coreference resolution in Lithuanian language and its application for processing e-health records from a hospital reception. Our novelty is the ability to process coreferences with minimal linguistic resources, which is important in linguistic applications for rare and endangered languages. The experimental results show that coreference resolution is applicable to the development of NLP-powered online healthcare services in Lithuania.

Authors

  • Voldemaras Žitkus
    Faculty of Informatics, Kaunas University of Technology, 51386 Kaunas, Lithuania.
  • Rita Butkienė
    Faculty of Informatics, Kaunas University of Technology, 51386 Kaunas, Lithuania.
  • Rimantas Butleris
    Faculty of Informatics, Kaunas University of Technology, 51386 Kaunas, Lithuania.
  • Rytis Maskeliūnas
    Department of Multimedia Engineering, Kaunas University of Technology, Kaunas, Lithuania.
  • Robertas Damaševičius
    Faculty of Applied Mathematics, Silesian University of Technology, Gliwice, Poland.
  • Marcin Wozniak
    Faculty of Applied Mathematics, Silesian University of Technology, 44-100 Gliwice, Poland.