Theory of Mind Imitation by LLMs for Physician-Like Human Evaluation

Journal: medRxiv
Published Date:

Abstract

Aligning the Theory of Mind (ToM) capabilities of Large Language Models (LLMs) with human cognitive processes enables them to imitate physician behavior. This study evaluates LLMs abilities such as Belief and Knowledge, Reasoning and Problem-Solving, Communication and Language Skills, Emotional and Social Intelligence, Self-Awareness, and Metacognition in performing human-like evaluations of Foundation Models. We used a dataset composed of clinical questions, reference answers, and LLM-generated responses based on guidelines for the prevention of heart disease. Comparing GPT-4 to human experts across ToM abilities, we found the highest Emotional and Social Intelligence agreement using the Brennan-Prediger coefficient. This study contributes to a deeper understanding of LLM’s cognitive capabilities and highlights their potential role in augmenting or complementing human clinical assessments.

Authors

  • Raghav Awasthi; Shreya Mishra; Charumathi Raghu; Auron Moises; Ashish Atreja; Dwarikanath Mahapatra; Nishant Singh; Ashish K. Khanna; Jacek B. Cywinski; Kamal Maheshwari; Francis A. Papay; Piyush Mathur