Comparative study of ChatGPT and human evaluators on the assessment of medical literature according to recognised reporting standards.
Journal:
BMJ health & care informatics
Published Date:
Oct 1, 2023
Abstract
INTRODUCTION: Amid clinicians' challenges in staying updated with medical research, artificial intelligence (AI) tools like the large language model (LLM) ChatGPT could automate appraisal of research quality, saving time and reducing bias. This study compares the proficiency of ChatGPT3 against human evaluation in scoring abstracts to determine its potential as a tool for evidence synthesis.