Leveraging large language models to mimic domain expert labeling in unstructured text-based electronic healthcare records in non-english languages.

Journal: BMC medical informatics and decision making
PMID:

Abstract

BACKGROUND: The integration of big data and artificial intelligence (AI) in healthcare, particularly through the analysis of electronic health records (EHR), presents significant opportunities for improving diagnostic accuracy and patient outcomes. However, the challenge of processing and accurately labeling vast amounts of unstructured data remains a critical bottleneck, necessitating efficient and reliable solutions. This study investigates the ability of domain specific, fine-tuned large language models (LLMs) to classify unstructured EHR texts with typographical errors through named entity recognition tasks, aiming to improve the efficiency and reliability of supervised learning AI models in healthcare.

Authors

  • Izzet Turkalp Akbasli
    Division of Pediatric Emergency, Department of Pediatrics, Faculty of Medicine, Hacettepe University, Ankara, Turkey. iakbasli@hacettepe.edu.tr.
  • Ahmet Ziya Birbilen
    Division of Pediatric Emergency, Department of Pediatrics, Faculty of Medicine, Hacettepe University, Ankara, Turkey. ahmet.birbilen@hacettepe.edu.tr.
  • Ozlem Teksam
    Division of Pediatric Emergency, Department of Pediatrics, Faculty of Medicine, Hacettepe University, Ankara, Turkey.