Educational Measurement - AI Medical Compendium

Generative AI vs. human expertise: a comparative analysis of case-based rational pharmacotherapy question generation.

European journal of clinical pharmacology Jun 1, 2025

PURPOSE: This study evaluated the performance of three generative AI models-ChatGPT- 4o, Gemini 1.5 Advanced Pro, and Claude 3.5 Sonnet-in producing case-based rational pharmacology questions compared to expert educators.

Humans Educational Measurement Diabetes Mellitus, Type 2 Hypertension Artificial Intelligence Students, Medical

View on PubMed DOI

Chatbots' Role in Generating Single Best Answer Questions for Undergraduate Medical Student Assessment: Comparative Analysis.

JMIR medical education May 30, 2025

BACKGROUND: Programmatic assessment supports flexible learning and individual progression but challenges educators to develop frequent assessments reflecting different competencies. The continuous creation of large volumes of assessment items, in a c...

Generative Artificial Intelligence Education, Medical, Undergraduate Clinical Competence Educational Measurement Reproducibility of Results Students, Medical Humans Artificial Intelligence

View on PubMed DOI

Assessing ChatGPT-4's performance on the US prosthodontic exam: impact of fine-tuning and contextual prompting vs. base knowledge, a cross-sectional study.

BMC medical education May 23, 2025

BACKGROUND: Artificial intelligence (AI), such as ChatGPT-4 from OpenAI, has the potential to transform medical education and assessment. However, its effectiveness in specialized fields like prosthodontics, especially when comparing base to fine-tun...

Generative Artificial Intelligence United States Cross-Sectional Studies Educational Measurement Internship and Residency Prosthodontics Humans Artificial Intelligence

View on PubMed DOI

Bridging AI and Medical Expertise: ChatGPT's Success on the Medical Specialization Residency Admission Exam in Spain.

Studies in health technology and informatics May 15, 2025

The growing use of Artificial Intelligence (AI) in healthcare, particularly focusing on the potential of generative AI models like ChatGPT-4 is a trending topic. The study examines how ChatGPT-4 performed on the national Medicine Residency exam in Sp...

Specialization Humans Artificial Intelligence Spain Educational Measurement Internship and Residency Generative Artificial Intelligence

View on PubMed DOI

The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams.

JMIR medical informatics May 12, 2025

Enhancing clinical reasoning and reducing diagnostic errors are essential in medical practice; OpenAI-o1, with advanced reasoning capabilities, performed better than GPT-4 on 15 Japanese National Medical Licensing Examination questions (accuracy: 100...

Large Language Models Educational Measurement Japan Clinical Reasoning Licensure, Medical Humans

View on PubMed DOI

Accurate multi-category student performance forecasting at early stages of online education using neural networks.

Scientific reports May 9, 2025

The ability to accurately predict and analyze student performance in online education, both at the outset and throughout the semester, is vital. Most of the published studies focus on binary classification (Fail or Pass) but there is still a signific...

Students Forecasting Universities Educational Measurement Female Education, Distance Humans Neural Networks, Computer

View on PubMed DOI

The Performance of AI in Dermatology Exams: The Exam Success and Limits of ChatGPT.

Journal of cosmetic dermatology May 1, 2025

BACKGROUND: Artificial intelligence holds significant potential in dermatology.

Humans Artificial Intelligence Clinical Competence Educational Measurement Dermatology Generative Artificial Intelligence Internship and Residency

View on PubMed DOI

Is artificial intelligence successful in the Turkish neurology board exam?

Neurological research May 1, 2025

OBJECTIVES: OpenAI declared that GPT-4 performed better in academic and certain specialty areas. Medical licensing exams assess the clinical competence of doctors. We aimed to investigate for the first time howChatGPT will perform in the Turkish Neur...

Turkey Neurology Clinical Competence Educational Measurement Humans Artificial Intelligence

View on PubMed DOI

Evaluating the Performance of Large Language Models (LLMs) in Answering and Analysing the Chinese Dental Licensing Examination.

European journal of dental education : official journal of the Association for Dental Education in Europe May 1, 2025

BACKGROUND: This study aimed to simulate diverse scenarios of students employing LLMs for CDLE examination preparation, providing a detailed evaluation of their performance in medical education.

China Educational Measurement Education, Dental Large Language Models Licensure, Dental

View on PubMed DOI

Comparative Accuracy of Generative Artificial Intelligence Platforms on Predoctoral Pediatric Dentistry Examination.

Pediatric dentistry Mar 15, 2025

To determine the comparative accuracy of seven generative artificial intelligence (GenAI) platforms in answering multiple-choice questions on a predoctoral pediatric dentistry examination. This study evaluated the impact of question type and GenAI t...

Educational Measurement Generative Artificial Intelligence Humans Artificial Intelligence Education, Dental Pediatric Dentistry

View on PubMed

AIMC Topic: Educational Measurement

Generative AI vs. human expertise: a comparative analysis of case-based rational pharmacotherapy question generation.

Chatbots' Role in Generating Single Best Answer Questions for Undergraduate Medical Student Assessment: Comparative Analysis.

Assessing ChatGPT-4's performance on the US prosthodontic exam: impact of fine-tuning and contextual prompting vs. base knowledge, a cross-sectional study.

Bridging AI and Medical Expertise: ChatGPT's Success on the Medical Specialization Residency Admission Exam in Spain.

The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams.

Accurate multi-category student performance forecasting at early stages of online education using neural networks.

The Performance of AI in Dermatology Exams: The Exam Success and Limits of ChatGPT.

Is artificial intelligence successful in the Turkish neurology board exam?

Evaluating the Performance of Large Language Models (LLMs) in Answering and Analysing the Chinese Dental Licensing Examination.

Comparative Accuracy of Generative Artificial Intelligence Platforms on Predoctoral Pediatric Dentistry Examination.

Popular Topics

Recent Journals

AIMC Topic: Educational Measurement

Don't Miss the Future of Medicine

Popular Topics

Recent Journals