In defence of post-hoc explanations in medical AI
Journal:
arXiv
Published Date:
Apr 29, 2025
Abstract
Since the early days of the Explainable AI movement, post-hoc explanations
have been praised for their potential to improve user understanding, promote
trust, and reduce patient safety risks in black box medical AI systems.
Recently, however, critics have argued that the benefits of post-hoc
explanations are greatly exaggerated since they merely approximate, rather than
replicate, the actual reasoning processes that black box systems take to arrive
at their outputs. In this article, we aim to defend the value of post-hoc
explanations against this recent critique. We argue that even if post-hoc
explanations do not replicate the exact reasoning processes of black box
systems, they can still improve users' functional understanding of black box
systems, increase the accuracy of clinician-AI teams, and assist clinicians in
justifying their AI-informed decisions. While post-hoc explanations are not a
"silver bullet" solution to the black box problem in medical AI, we conclude
that they remain a useful strategy for addressing the black box problem in
medical AI.