Evaluating the Reliability of GPT-4o in Histological Image Interpretation.

Journal: Clinical anatomy (New York, N.Y.)
Published Date:

Abstract

Advanced large language models with multimodal capabilities offer potential new applications in medical education. This study evaluated GPT-4o's performance in normal histology image interpretation. We assessed GPT-4o's ability to interpret 120 histological images across four histological tissue types at three different magnification levels. Three histology experts evaluated responses using a 4-point rubric across three assessment criteria: tissue/organ identification, structure identification, and structure function assessment. Statistical analysis included ANOVA with Tukey tests, three-way ANOVA for interaction effects, Pearson's correlation, and ICC for reliability. GPT-4o achieved an overall mean score of 2.71 (SE 0.07), with 59.01% of responses rated "Good" or "Excellent." Performance varied significantly across tissues, with epithelial showing highest accuracy (mean 3.11, SE 0.06) and muscle lowest (mean 2.43, SE 0.07). Combined 3 magnifications yielded better results (mean 3.03, SE 0.07) than low magnification alone (mean 2.41, SE 0.07, p < 0.001). Tissue/organ identification questions received higher scores (mean 2.83) than structure identification (mean 2.65) and structure function assessment (mean 2.64). Inter-rater reliability was excellent (ICC = 0.89). GPT-4o demonstrates moderate histological interpretation ability, varying by tissue type and magnification level. The model performs best with multiple magnification views. These findings suggest potential use in medical education but indicate the need for instructors' supervision.

Authors

  • Volodymyr Mavrych
    College of Medicine, Alfaisal University, Riyadh, Kingdom of Saudi Arabia.
  • Einas M Yousef
    College of Medicine, Alfaisal University, Riyadh, Kingdom of Saudi Arabia.
  • Ahmed Yaqinuddin
    College of Medicine, Alfaisal University, Riyadh, Kingdom of Saudi Arabia.
  • Aftab Ahmed Shaikh
    College of Medicine, Alfaisal University, Kingdom of Saudi Arabia.
  • Olena Bolgova
    College of Medicine, Alfaisal University, Riyadh, Kingdom of Saudi Arabia.

Keywords

No keywords available for this article.