An investigative analysis - ChatGPT's capability to excel in the Polish speciality exam in pathology.

Journal: Polish journal of pathology : official journal of the Polish Society of Pathologists
PMID:

Abstract

This study evaluates the effectiveness of the ChatGPT-3.5 language model in providing correct answers to pathomorphology questions as required by the State Speciality Examination (PES). Artificial intelligence (AI) in medicine is generating increasing interest, but its potential needs thorough evaluation. A set of 119 exam questions by type and subtype were used, which were posed to the ChatGPT-3.5 model. Performance was analysed with regard to the success rate in different question categories and subtypes. ChatGPT-3.5 achieved a performance of 45.38%, which is significantly below the minimum PES pass threshold. The results achieved varied by question type and subtype, with better results in questions requiring "comprehension and critical thinking" than "memory". The analysis shows that, although ChatGPT-3.5 can be a useful teaching tool, its performance in providing correct answers to pathomorphology questions is significantly lower than that of human respondents. This conclusion highlights the need to further improve the AI model, taking into account the specificities of the medical field. Artificial intelligence can be helpful, but it cannot fully replace the experience and knowledge of specialists.

Authors

  • Michał Bielówka
    Students' Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Jakub Kufel
    Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Marcin Rojek
    Students' Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Dominika Kaczyńska
    Students' Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Łukasz Czogalik
    Students' Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Adam Mitręga
    Students' Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland.
  • Wiktoria Bartnikowska
    Faculty of Medical Sciences in Katowice, Medical University of Silesia, Katowice, Poland.
  • Dominika Kondoł
    Dr B. Hager Memorial Multi-Specialty District Hospital, Tarnowskie Góry, Poland.
  • Kacper Palkij
    Dr B. Hager Memorial Multi-Specialty District Hospital, Tarnowskie Góry, Poland.
  • Sylwia Mielcarska
    Department of Medical and Molecular Biology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Zabrze, Poland.