Noninferiority of Artificial Intelligence-Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics.

Journal: Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc

PMID: 36931740

Abstract

Image analysis assistance with artificial intelligence (AI) has become one of the great promises over recent years in pathology, with many scientific studies being published each year. Nonetheless, and perhaps surprisingly, only few image AI systems are already in routine clinical use. A major reason for this is the missing validation of the robustness of many AI systems: beyond a narrow context, the large variability in digital images due to differences in preanalytical laboratory procedures, staining procedures, and scanners can be challenging for the subsequent image analysis. Resulting faulty AI analysis may bias the pathologist and contribute to incorrect diagnoses and, therefore, may lead to inappropriate therapy or prognosis. In this study, a pretrained AI assistance tool for the quantification of Ki-67, estrogen receptor (ER), and progesterone receptor (PR) in breast cancer was evaluated within a realistic study set representative of clinical routine on a total of 204 slides (72 Ki-67, 66 ER, and 66 PR slides). This represents the cohort with the largest image variance for AI tool evaluation to date, including 3 staining systems, 5 whole-slide scanners, and 1 microscope camera. These routine cases were collected without manual preselection and analyzed by 10 participant pathologists from 8 sites. Agreement rates for individual pathologists were found to be 87.6% for Ki-67 and 89.4% for ER/PR, respectively, between scoring with and without the assistance of the AI tool regarding clinical categories. Individual AI analysis results were confirmed by the majority of pathologists in 95.8% of Ki-67 cases and 93.2% of ER/PR cases. The statistical analysis provides evidence for high interobserver variance between pathologists (Krippendorff's α, 0.69) in conventional immunohistochemical quantification. Pathologist agreement increased slightly when using AI support (Krippendorff α, 0.72). Agreement rates of pathologist scores with and without AI assistance provide evidence for the reliability of immunohistochemical scoring with the support of the investigated AI tool under a large number of environmental variables that influence the quality of the diagnosed tissue images.

Authors

Niklas Abele

Friedrich-Alexander-Universität Erlangen-Nürnberg, Institut für Pathologie, Erlangen, Germany. Electronic address: niklas.abele@uk-erlangen.de.
Katharina Tiemann

Institute of Hematopathology Hamburg, Hamburg, Germany.
Till Krech

Institute of Pathology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany; Institute of Pathology, Clinical Center Osnabrueck, Osnabrueck, Germany.
Axel Wellmann

Institute of Pathology Celle, Celle, Germany.
Christian Schaaf

Department of Internal Medicine II, Klinikum rechts der Isar of the TU Munich, Munich, Germany.
Florian Länger

Institut für Pathologie, Medizinische Hochschule Hannover, Hannover, Germany.
Anja Peters

Institut für Pathologie, Städtisches Klinikum Lüneburg gGmbH, Lüneburg, Germany.
Andreas Donner

Zentrum für Pathologie, Zytologie und Molekularpathologie Neuss, Neuss, Germany.
Felix Keil

Institute of Pathology, University of Regensburg, Regensburg, Germany.
Khalid Daifalla

Mindpeak, Hamburg, Germany.
Marina Mackens

Institute of Hematopathology Hamburg, Hamburg, Germany.
Andreas Mamilos

Institute of Pathology, University of Regensburg, Regensburg, Germany.
Evgeny Minin

Institute of Pathology, Clinical Center Osnabrueck, Osnabrueck, Germany.
Michel Krümmelbein

Institute of Hematopathology Hamburg, Hamburg, Germany.
Linda Krause

Institute of Medical Biometry and Epidemiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Maria Stark

Institute of Medical Biometry and Epidemiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Antonia Zapf
Marc Päpper

Mindpeak, Hamburg, Germany.
Arndt Hartmann

Institute of Pathology, University Hospital of Friedrich-Alexander-University Erlangen-Nürnberg, Germany.
Tobias Lang

Mindpeak, Hamburg, Germany. tobias.lang@mindpeak.ai.

Keywords

Artificial Intelligence Breast Neoplasms Estrogens Female Humans Ki-67 Antigen Receptors, Estrogen Receptors, Progesterone Reproducibility of Results

External Resources

View on PubMed Access via DOI PubMed (36931740)

Noninferiority of Artificial Intelligence-Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals