Advancements in large language model accuracy for answering physical medicine and rehabilitation board review questions.

Journal: PM & R : the journal of injury, function, and rehabilitation
Published Date:

Abstract

BACKGROUND: There have been significant advances in machine learning and artificial intelligence technology over the past few years, leading to the release of large language models (LLMs) such as ChatGPT. There are many potential applications for LLMs in health care, but it is critical to first determine how accurate LLMs are before putting them into practice. No studies have evaluated the accuracy and precision of LLMs in responding to questions related to the field of physical medicine and rehabilitation (PM&R).

Authors

  • Jason Bitterman
    Division of Physical Medicine and Rehabilitation, Hartford Healthcare Medical Group, Hartford, Connecticut, USA.
  • Alexander D'Angelo
    Nebraska Medicine Department of Physical Medicine and Rehabilitation, University of Nebraska Medical Center, Omaha, Nebraska, USA.
  • Alexandra Holachek
  • James E Eubanks
    Department of Orthopedics and Physical Medicine, Division of Physical Medicine and Rehabilitation, Medical University of South Carolina (MUSC), Charleston, South Carolina, USA.

Keywords

No keywords available for this article.