Exploring the risks of over-reliance on AI in diagnostic pathology. What lessons can be learned to support the training of young pathologists?

Journal: PloS one
Published Date:

Abstract

The integration of Artificial Intelligence (AI) algorithms into pathology practice presents both opportunities and challenges. Although it can improve accuracy and inter-rater reliability, it is not infallible and can produce erroneous diagnoses, hence the need for pathologists to always check predictions. This critical judgment is particularly important when algorithm errors could lead to high-impact negative clinical outcomes, such as missing an invasive carcinoma. However, the influence of AI tools on pathologists' decision-making is not well explored. This study aims to evaluate the impact of a previously developed AI tool on the diagnostic accuracy and inter-rater reliability among pathologists, while assessing whether pathologists maintain independent judgment of AI predictions. Eight pathologists from different hospitals and with varying levels of experience, participated in the study. Each of them reviewed 115 slides of laryngeal biopsies, including benign epithelium, low-grade and high-grade dysplasia, and invasive squamous carcinomas. The study compared diagnostic outcomes with and without AI assistance. The reference labels were established by an expert's double-blind review. Results show that assisted pathologists had a higher accuracy for high-grade dysplasia, invasive carcinoma and improved inter-rater reliability. However, cases of over-reliance on AI have been observed, resulting in the omission of correctly diagnosed invasive carcinomas during the unassisted examination. The false predictions on these carcinoma slides were labeled with a low confidence score, which was not considered by the less experienced pathologists, showing the risk that they would follow the AI prediction without enough critical judgment or expertise. Our study emphasizes the potential over-reliance of pathologists on AI models and the potential harmful consequences, even with the advancement of powerful algorithms. The integration of confidence scores and the education of pathologists to use this tool could help to optimize the safe integration of AI into pathology practice.

Authors

  • Yaëlle Bellahsen-Harrar
    Department of Pathology, APHP, Hôpital Européen Georges-Pompidou, Paris, France.
  • Mélanie Lubrano
    Centre for Computational Biology (CBIO), Mines Paris, PSL University, Paris, France.
  • Charles Lépine
    Department of Pathology, University Hospital of Nantes, 9 Quai Moncousu cedex 01, 44093, Nantes, France.
  • Aurélie Beaufrère
    Assistance Publique-Hôpitaux de Paris, Beaujon University Hospital, Department of Pathology, F-92110, Clichy, France.
  • Claire Bocciarelli
    Department of Pathology, CHRU Brest, Brest, France.
  • Anaïs Brunet
    Department of Pathology, Hôpital Henri Mondor, APHP, Créteil, France.
  • Elise Decroix
    Department of Pathology, IUCT-Oncopole, Toulouse, France.
  • Franck Neil El-Sissy
    Department of Pathology, Hôpital Lariboisière, APHP, Paris, France.
  • Bettina Fabiani
    Tribun Health, Paris, France.
  • Aurélien Morini
    Faculté de médecine de Créteil, Université Paris Est Créteil, Créteil, France.
  • Cyprien Tilmant
  • Thomas Walter
  • Cécile Badoual
    Department of Pathology, APHP, Hôpital Européen Georges-Pompidou, Paris, France.