Comparative evaluation of artificial intelligence models GPT-4 and GPT-3.5 in clinical decision-making in sports surgery and physiotherapy: a cross-sectional study.

Journal: BMC medical informatics and decision making
PMID:

Abstract

BACKGROUND: The integration of artificial intelligence (AI) in healthcare has rapidly expanded, particularly in clinical decision-making. Large language models (LLMs) such as GPT-4 and GPT-3.5 have shown potential in various medical applications, including diagnostics and treatment planning. However, their efficacy in specialized fields like sports surgery and physiotherapy remains underexplored. This study aims to compare the performance of GPT-4 and GPT-3.5 in clinical decision-making within these domains using a structured assessment approach.

Authors

  • Sönmez Saglam
    Department of Orthopaedics and Traumatology, Faculty of Medicine, Duzce University, Duzce, Türkiye. dr.sonmezsaglam@gmail.com.
  • Veysel Uludag
    Department of Physiotherapy and Rehabilitation, Faculty of Health Sciences, Duzce University, Duzce, Türkiye.
  • Zekeriya Okan Karaduman
    Department of Orthopaedics and Traumatology, Faculty of Medicine, Duzce University, Duzce, Türkiye.
  • Mehmet Arıcan
    Department of Orthopaedics and Traumatology, Faculty of Medicine, Duzce University, Duzce, Türkiye.
  • Mücahid Osman Yücel
    Department of Orthopaedics and Traumatology, Faculty of Medicine, Duzce University, Duzce, Türkiye.
  • Raşit Emin Dalaslan
    Department of Orthopaedics and Traumatology, Faculty of Medicine, Duzce University, Duzce, Türkiye.