[Evaluating the accuracy of large language models in answering mammography screening questions in Italian and English: a study based on the Eusobi guidelines.].

Journal: Recenti progressi in medicina
PMID:

Abstract

INTRODUCTION: Artificial intelligence (AI) is transforming various aspects of everyday life, including healthcare, through large language models (LLMs) like ChatGPT, Gemini, and Copilot. These systems are increasingly used to disseminate medical information, allowing patients to access simplified explanations. This study aims to compare responses to breast imaging-related questions formulated in Italian and English, based on Eusobi guidelines, evaluating the LLMs' ability to provide accurate and complete answers on mammography screening concepts.

Authors

  • Manuel Signorini
    Unità operativa complessa di Radiologia, Azienda Ulss 5 Polesana, Rovigo.
  • Silvia Fontani
    Unità operativa complessa di Radiologia, Azienda Ulss 5 Polesana, Rovigo.
  • Paola Minichetti
    Istituto di Radiologia, Dipartimento di Medicina, Università di Udine.
  • Silvia Teggi
    Unità operativa complessa di Radiologia, Azienda Ulss 5 Polesana, Rovigo.
  • Alessandra Barusco
    Unità operativa complessa di Radiologia, Azienda Ulss 5 Polesana, Rovigo.
  • Massimo Favat
    Unità operativa complessa di Radiologia, Azienda Ulss 5 Polesana, Rovigo.