Evaluating Large Language Model's accuracy in current procedural terminology coding given operative note templates across various plastic surgery sub-specialties.

Journal: Journal of plastic, reconstructive & aesthetic surgery : JPRAS
Published Date:

Abstract

BACKGROUND: Manual CPT coding from operative notes is a time-intensive process that adds to the administrative burden in healthcare. Large Language Models (LLMs) offer a promising solution, but their accuracy in assigning CPT codes based on full operative note templates remains largely untested. Thus, this study evaluates the ability of three LLMs - GPT-4, Gemini, and Copilot - to generate accurate CPT codes from operative note templates across diverse plastic surgery procedures.

Authors

  • Mia J Carrarini
    Department of Plastic Surgery, University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA.
  • Hilary Y Liu
    Department of Plastic Surgery, University of Pittsburgh Medical Center, 1350 Locust Street, Pittsburgh, PA, G10315219, USA.
  • Catherine K Perez
    Department of Plastic Surgery, University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA.
  • Francesco M Egro
    Department of Plastic Surgery, University of Pittsburgh Medical Center, 1350 Locust Street, Pittsburgh, PA, G10315219, USA. francescoegro@gmail.com.