Cytologic scoring of equine exercise-induced pulmonary hemorrhage: Performance of human experts and a deep learning-based algorithm.

Journal: Veterinary pathology
PMID:

Abstract

Exercise-induced pulmonary hemorrhage (EIPH) is a relevant respiratory disease in sport horses, which can be diagnosed by examination of bronchoalveolar lavage fluid (BALF) cells using the total hemosiderin score (THS). The aim of this study was to evaluate the diagnostic accuracy and reproducibility of annotators and to validate a deep learning-based algorithm for the THS. Digitized cytological specimens stained for iron were prepared from 52 equine BALF samples. Ten annotators produced a THS for each slide according to published methods. The reference methods for comparing annotator's and algorithmic performance included a ground truth dataset, the mean annotators' THSs, and chemical iron measurements. Results of the study showed that annotators had marked interobserver variability of the THS, which was mostly due to a systematic error between annotators in grading the intracytoplasmatic hemosiderin content of individual macrophages. Regarding overall measurement error between the annotators, 87.7% of the variance could be reduced by using standardized grades based on the ground truth. The algorithm was highly consistent with the ground truth in assigning hemosiderin grades. Compared with the ground truth THS, annotators had an accuracy of diagnosing EIPH (THS of < or ≥ 75) of 75.7%, whereas, the algorithm had an accuracy of 92.3% with no relevant differences in correlation with chemical iron measurements. The results show that deep learning-based algorithms are useful for improving reproducibility and routine applicability of the THS. For THS by experts, a diagnostic uncertainty interval of 40 to 110 is proposed. THSs within this interval have insufficient reproducibility regarding the EIPH diagnosis.

Authors

  • Christof A Bertram
    Institute of Veterinary Pathology, Freie Universität Berlin, Berlin, Germany.
  • Christian Marzahl
    Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany. c.marzahl@euroimmun.de.
  • Alexander Bartel
    Freie Universität Berlin, Berlin, Germany.
  • Jason Stayt
    VetPath Laboratory Services, Ascot, Western, Australia.
  • Federico Bonsembiante
    University of Padova, Legnaro, Italy.
  • Janet Beeler-Marfisi
    University of Guelph, Guelph, Ontario, Canada.
  • Ann K Barton
    Equine Clinic, Freie Universität Berlin, Berlin, Germany.
  • Ginevra Brocca
    University of Padova, Legnaro, Italy.
  • Maria E Gelain
    University of Padova, Legnaro, Italy.
  • Agnes Gläsel
    Justus-Liebig-Universität Giessen, Giessen, Germany.
  • Kelly du Preez
    University of Pretoria, Pretoria, South Africa.
  • Kristina Weiler
    Justus-Liebig-Universität Giessen, Giessen, Germany.
  • Christiane Weissenbacher-Lang
    University of Veterinary Medicine Vienna, Vienna, Austria.
  • Katharina Breininger
  • Marc Aubreville
    Pattern Recognition Lab, Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany. marc.aubreville@fau.de.
  • Andreas Maier
    Pattern Recognition Lab, University Erlangen-Nürnberg, Erlangen, Germany.
  • Robert Klopfleisch
    Institute of Veterinary Pathology, Freie Universität Berlin, Berlin, Germany.
  • Jenny Hill
    VetPath Laboratory Services, Ascot, Western, Australia.