AIMC Topic: Educational Measurement

Clear Filters Showing 271 to 280 of 311 articles

Generative AI vs. human expertise: a comparative analysis of case-based rational pharmacotherapy question generation.

European journal of clinical pharmacology
PURPOSE: This study evaluated the performance of three generative AI models-ChatGPT- 4o, Gemini 1.5 Advanced Pro, and Claude 3.5 Sonnet-in producing case-based rational pharmacology questions compared to expert educators.

Chatbots' Role in Generating Single Best Answer Questions for Undergraduate Medical Student Assessment: Comparative Analysis.

JMIR medical education
BACKGROUND: Programmatic assessment supports flexible learning and individual progression but challenges educators to develop frequent assessments reflecting different competencies. The continuous creation of large volumes of assessment items, in a c...

Assessing ChatGPT-4's performance on the US prosthodontic exam: impact of fine-tuning and contextual prompting vs. base knowledge, a cross-sectional study.

BMC medical education
BACKGROUND: Artificial intelligence (AI), such as ChatGPT-4 from OpenAI, has the potential to transform medical education and assessment. However, its effectiveness in specialized fields like prosthodontics, especially when comparing base to fine-tun...

Bridging AI and Medical Expertise: ChatGPT's Success on the Medical Specialization Residency Admission Exam in Spain.

Studies in health technology and informatics
The growing use of Artificial Intelligence (AI) in healthcare, particularly focusing on the potential of generative AI models like ChatGPT-4 is a trending topic. The study examines how ChatGPT-4 performed on the national Medicine Residency exam in Sp...

The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams.

JMIR medical informatics
Enhancing clinical reasoning and reducing diagnostic errors are essential in medical practice; OpenAI-o1, with advanced reasoning capabilities, performed better than GPT-4 on 15 Japanese National Medical Licensing Examination questions (accuracy: 100...

Accurate multi-category student performance forecasting at early stages of online education using neural networks.

Scientific reports
The ability to accurately predict and analyze student performance in online education, both at the outset and throughout the semester, is vital. Most of the published studies focus on binary classification (Fail or Pass) but there is still a signific...

Is artificial intelligence successful in the Turkish neurology board exam?

Neurological research
OBJECTIVES: OpenAI declared that GPT-4 performed better in academic and certain specialty areas. Medical licensing exams assess the clinical competence of doctors. We aimed to investigate for the first time howChatGPT will perform in the Turkish Neur...

Evaluating the Performance of Large Language Models (LLMs) in Answering and Analysing the Chinese Dental Licensing Examination.

European journal of dental education : official journal of the Association for Dental Education in Europe
BACKGROUND: This study aimed to simulate diverse scenarios of students employing LLMs for CDLE examination preparation, providing a detailed evaluation of their performance in medical education.

Comparative Accuracy of Generative Artificial Intelligence Platforms on Predoctoral Pediatric Dentistry Examination.

Pediatric dentistry
To determine the comparative accuracy of seven generative artificial intelligence (GenAI) platforms in answering multiple-choice questions on a predoctoral pediatric dentistry examination. This study evaluated the impact of question type and GenAI t...