The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data.

Journal: BMC medical informatics and decision making
PMID:

Abstract

BACKGROUND: Medical narratives are fundamental to the correct identification of a patient's health condition. This is not only because it describes the patient's situation. It also contains relevant information about the patient's context and health state evolution. Narratives are usually vague and cannot be categorized easily. On the other hand, once the patient's situation is correctly identified based on a narrative, it is then possible to map the patient's situation into precise classification schemas and ontologies that are machine-readable. To this end, language models can be trained to read and extract elements from these narratives. However, the main problem is the lack of data for model identification and model training in languages other than English. First, gold standard annotations are usually not available due to the high level of data protection for patient data. Second, gold standard annotations (if available) are difficult to access. Alternative available data, like MIMIC (Sci Data 3:1, 2016) is written in English and for specific patient conditions like intensive care. Thus, when model training is required for other types of patients, like oncology (and not intensive care), this could lead to bias. To facilitate clinical narrative model training, a method for creating high-quality synthetic narratives is needed.

Authors

  • Juan G Diaz Ochoa
    PerMediQ GmbH, Pelargusstr. 2, 70180 Stuttgart, Germany. Electronic address: juan.diaz@permediq.de.
  • Faizan E Mustafa
    QUIBIQ GmbH, Heßbrühlstr. 11, D-70565 Stuttgart, Germany. Electronic address: faizan.e.mustafa@quibiq.de.
  • Felix Weil
    QuiBiQ GmbH, Heßbrühlstr. 11, Stuttgart, D-70565, Germany.
  • Yi Wang
    Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China.
  • Kudret Kama
    Klinikum Stuttgart, Stuttgart Cancer Center - Tumorzentrum Eva Mayr-Stihl DE, Kriegsbergstraße 60, Stuttgart, D-70174, Germany.
  • Markus Knott
    Klinikum Stuttgart, Stuttgart Cancer Center - Tumorzentrum Eva Mayr-Stihl DE, Kriegsbergstraße 60, Stuttgart, D-70174, Germany.