Evaluating large language models as a supplementary patient information resource on antimalarial use in systemic lupus erythematosus.

Journal: Lupus
PMID:

Abstract

ObjectiveTo assess the accuracy, completeness, and reproducibility of Large Language Models (LLMs) (Copilot, GPT-3.5, and GPT-4) on antimalarial use in systemic lupus erythematosus (SLE).Materials and MethodsWe utilized 13 questions derived from patient surveys and common inquiries from the National Health Service. Two independent rheumatologists assessed responses from the LLMs using predefined Likert scales for accuracy, completeness, and reproducibility.ResultsThe GPT models and Copilot achieved high scores in accuracy. However, the completeness of outputs was rated at 38.5%, 55.9%, and 92.3% for Copilot, GPT-3.5, and GPT-4. When questions related to "mechanism of action" and "lifestyle", were analyzed for completeness ( = 8), ChatGPT-4 scored significantly higher (100%) compared to Copilot (37.5%). In contrast, questions related to "side-effects" ( = 5) scored higher for ChatGPT models than Copilot, and the differences were not statistically significant. All three LLMs demonstrated high reproducibility, with rates ranging from 84.6% to 92.3%.ConclusionsAdvanced LLMs like GPT -4 offer significant promise in enhancing patients' understanding of antimalarial therapy in SLE. Although chatbots' capability can potentially bridge the information gap patients face, the performance and limitations of such tools need further exploration to optimize their use in clinical settings.

Authors

  • Pamela Munguía-Realpozo
    Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
  • Claudia Mendoza-Pinto
    Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
  • Ivet Etchegaray-Morales
    Department of Rheumatology, Medicine School, Autonomous University of Puebla, Puebla, Mexico.
  • Edith Ramírez-Lara
    Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
  • Juan Carlos Solis-Poblano
    Department of Hematology, Specialties Hospital UMAE, Mexican Social Security Institute, Puebla, Mexico.
  • Socorro Méndez-Martínez
    Coordination of Health Research, Mexican Social Security Institute, Puebla, Mexico.
  • Laura Serrano Vertiz
    Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
  • Jorge Ayón-Aguilar
    Coordination of Health Research, Mexican Social Security Institute, Puebla, Mexico.