Evaluation of Multilingual Simplifications of IR Procedural Reports Using GPT-4.
Journal:
Journal of vascular and interventional radiology : JVIR
PMID:
39793700
Abstract
This study assessed the feasibility of large language models such as GPT-4 (OpenAI, San Francisco, California) to summarize interventional radiology procedural reports to improve layperson understanding and translate medical texts into multiple languages. Two hundred reports from 8 categories were summarized using GPT-4. Readability was assessed with Flesch-Kincaid reading level (FKRL) and Flesch reading ease score (FRES). Accuracy was assessed by 8 interventional radiologists. Summaries were translated into Spanish, Korean, Chinese, and Swahili, and their accuracy were assessed by 8 bilingual interventional radiologists. The original reports' FKRL of 10.7 and FRES of 41.9 improved to 7.0 and 73.0, respectively. Summaries were mostly accurate, with minimal misinformation. Translations introduced an increase in number of misinformation but no significant increase in critically wrong information. Layperson comprehension scores improved significantly from 2.5 to 4.3 out of 5 after summarization. Overall, GPT-4 enhanced report readability and comprehension, suggesting potential for broader application in improving patient communication.