Multilayer outperforms single-layer slide scanning in AI-based classification of whole slide images with low-burden acid-fast mycobacteria (AFB).

Journal: Computer methods and programs in biomedicine
Published Date:

Abstract

Manual screening of Ziehl-Neelsen (ZN)-stained slides that are negative or contain rare acid-fast mycobacteria (AFB) is labor-intensive and requires repetitive refocusing to visualize AFB candidates under the microscope. Whole slide image (WSI) scanners have enabled implementation of AI to classify digital ZN-stained slides as AFB+ or AFB-. By default, these scanners acquire a single-layer WSI. However, some scanners can acquire a multilayer WSI with a z-stack and an extended focus image layer embedded. We developed a parameterized WSI classification pipeline to assess whether multilayer imaging improves ZN-stained slide classification accuracy. A CNN built into the pipeline classified tiles in each image layer to form an AFB probability score heatmap. Features extracted from the heatmap were then entered into a WSI classifier. 46 AFB+ and 88 AFB- single-layer WSIs were used for the classifier training. 15 AFB+ (with rare microorganisms) and 5 AFB- multilayer WSIs comprised the test set. Parameters in the pipeline included: (a) a WSI representation: z-stack of image layers, middle image layer (a single image layer equivalent) or an extended focus image layer, (b) 4 methods of aggregating AFB probability scores across the z-stack, (c) 3 classifiers, (d) 3 AFB probability thresholds, and (e) 9 feature vector types extracted from the aggregated AFB probability heatmaps. Balanced accuracy (BACC) was used to measure the pipeline performance for all parameter combinations. Analysis of Covariance (ANCOVA) was used to statistically evaluate the effect of each parameter on the BACC. After adjusting for other factors, a significant effect of the WSI representation (p-value < 1.99E-76), classifier type (p-value < 1.73E-21), and AFB threshold (p-value = 0.003) was observed on the BACC. The feature type had no significant effect (p-value = 0.459) on the BACC. WSIs represented by the middle layer, extended focus layer and the z-stack followed by the weighted averaging of AFB probability scores were classified with the average BACC of 58.80%, 68.64%, and 77.28%, respectively. The multilayer WSIs represented by the z-stack with the weighted averaging of AFB probability scores were classified by a Random Forest classifier with the average BACC of 83.32%. Low classification accuracy of WSIs represented by the middle layer suggests that they contain fewer features permitting identification of AFB than the multilayer WSIs. Our results indicate that single-layer acquisition can introduce a bias (sampling error) into the WSI. This bias can be mitigated by the multilayer or the extended focus acquisitions.

Authors

  • Karolina Nurzynska
    Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Gliwice, Poland.
  • Dalin Li
    Cedars Sinai Medical Center, Inflammatory Bowel & Immunobiology Research Institute, Los Angeles, CA, USA.
  • Ann E Walts
    Department of Pathology & Laboratory Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, United States of America.
  • Arkadiusz Gertych