Evaluating large language models as a supplementary patient information resource on antimalarial use in systemic lupus erythematosus.

Journal: Lupus

PMID: 40013495

Abstract

ObjectiveTo assess the accuracy, completeness, and reproducibility of Large Language Models (LLMs) (Copilot, GPT-3.5, and GPT-4) on antimalarial use in systemic lupus erythematosus (SLE).Materials and MethodsWe utilized 13 questions derived from patient surveys and common inquiries from the National Health Service. Two independent rheumatologists assessed responses from the LLMs using predefined Likert scales for accuracy, completeness, and reproducibility.ResultsThe GPT models and Copilot achieved high scores in accuracy. However, the completeness of outputs was rated at 38.5%, 55.9%, and 92.3% for Copilot, GPT-3.5, and GPT-4. When questions related to "mechanism of action" and "lifestyle", were analyzed for completeness ( = 8), ChatGPT-4 scored significantly higher (100%) compared to Copilot (37.5%). In contrast, questions related to "side-effects" ( = 5) scored higher for ChatGPT models than Copilot, and the differences were not statistically significant. All three LLMs demonstrated high reproducibility, with rates ranging from 84.6% to 92.3%.ConclusionsAdvanced LLMs like GPT -4 offer significant promise in enhancing patients' understanding of antimalarial therapy in SLE. Although chatbots' capability can potentially bridge the information gap patients face, the performance and limitations of such tools need further exploration to optimize their use in clinical settings.

Authors

Pamela Munguía-Realpozo

Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
Claudia Mendoza-Pinto

Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
Ivet Etchegaray-Morales

Department of Rheumatology, Medicine School, Autonomous University of Puebla, Puebla, Mexico.
Edith Ramírez-Lara

Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
Juan Carlos Solis-Poblano

Department of Hematology, Specialties Hospital UMAE, Mexican Social Security Institute, Puebla, Mexico.
Socorro Méndez-Martínez

Coordination of Health Research, Mexican Social Security Institute, Puebla, Mexico.
Laura Serrano Vertiz

Systemic Rheumatic Autoimmune Diseases Research Unit, Specialties Hospital UMAE- CIBIOR, Mexican Social Security Institute, Puebla, Mexico.
Jorge Ayón-Aguilar

Coordination of Health Research, Mexican Social Security Institute, Puebla, Mexico.

Keywords

Antimalarials Female Humans Language Large Language Models Lupus Erythematosus, Systemic Male Patient Education as Topic Reproducibility of Results Surveys and Questionnaires

External Resources

View on PubMed Access via DOI PubMed (40013495)

Evaluating large language models as a supplementary patient information resource on antimalarial use in systemic lupus erythematosus.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals