Token Probabilities to Mitigate Large Language Models Overconfidence in Answering Medical Questions: Quantitative Study.
Journal:
Journal of medical Internet research
Published Date:
Aug 29, 2025
Abstract
BACKGROUND: Chatbots have demonstrated promising capabilities in medicine, scoring passing grades for board examinations across various specialties. However, their tendency to express high levels of confidence in their responses, even when incorrect, poses a limitation to their utility in clinical settings.