Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard.

Journal: JMIR medical education
Published Date:

Abstract

BACKGROUND: Large language models (LLMs) have revolutionized natural language processing with their ability to generate human-like text through extensive training on large data sets. These models, including Generative Pre-trained Transformers (GPT)-3.5 (OpenAI), GPT-4 (OpenAI), and Bard (Google LLC), find applications beyond natural language processing, attracting interest from academia and industry. Students are actively leveraging LLMs to enhance learning experiences and prepare for high-stakes exams, such as the National Eligibility cum Entrance Test (NEET) in India.

Authors

  • Faiza Farhat
    Department of Zoology, Aligarh Muslim University, Aligarh, India.
  • Beenish Moalla Chaudhry
    School of Computing and Informatics, The University of Louisiana, Lafayette, LA, United States.
  • Mohammad Nadeem
    Department of Computer Science, Aligarh Muslim University, Aligarh, India.
  • Shahab Saquib Sohail
    Department of Computer Science and Engineering, Jamia Hamdard, New Delhi, India.
  • Dag Øivind Madsen
    USN School of Business, University of South-Eastern Norway, Hønefoss, Norway.