Large language models for structured reporting in radiology: performance of GPT-4, ChatGPT-3.5, Perplexity and Bing.

Journal: La Radiologia medica
Published Date:

Abstract

Structured reporting may improve the radiological workflow and communication among physicians. Artificial intelligence applications in medicine are growing fast. Large language models (LLMs) are recently gaining importance as valuable tools in radiology and are currently being tested for the critical task of structured reporting. We compared four LLMs models in terms of knowledge on structured reporting and templates proposal. LLMs hold a great potential for generating structured reports in radiology but additional formal validations are needed on this topic.

Authors

  • Carlo A Mallio
    Research Unit of Radiology, Department of Medicine and Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Università Campus Bio-Medico di Roma, Via Álvaro del Portillo, 200, 00128, Rome, RM, Italy. c.mallio@policlinicocampus.it.
  • Andrea C Sertorio
    Research Unit of Radiology, Department of Medicine and Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Università Campus Bio-Medico di Roma, Via Álvaro del Portillo, 200, 00128, Rome, RM, Italy.
  • Caterina Bernetti
    Research Unit of Radiology, Department of Medicine and Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Università Campus Bio-Medico di Roma, Via Álvaro del Portillo, 200, 00128, Rome, RM, Italy.
  • Bruno Beomonte Zobel
    Research Unit of Radiology, Department of Medicine and Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Università Campus Bio-Medico di Roma, Via Álvaro del Portillo, 200, 00128, Rome, RM, Italy.