Optimization of deep learning models for inference in low resource environments.

Journal: Computers in biology and medicine
Published Date:

Abstract

Artificial Intelligence (AI), and particularly deep learning (DL), has shown great promise to revolutionize healthcare. However, clinical translation is often hindered by demanding hardware requirements. In this study, we assess the effectiveness of optimization techniques for DL models in healthcare applications, targeting varying AI workloads across the domains of radiology, histopathology, and medical RGB imaging, while evaluating across hardware configurations. The assessed AI workloads focus on both segmentation and classification workloads, by virtue of brain extraction in Magnetic Resonance Imaging (MRI), colorectal cancer delineation in Hematoxylin & Eosin (H&E) stained digitized tissue sections, and diabetic foot ulcer classification in RGB images. We quantitatively evaluate model performance in terms of model runtime during inference (including speedup, latency, and memory usage) and model utility on unseen data. Our results demonstrate that optimization techniques can substantially improve model runtime, without compromising model utility. These findings suggest that optimization techniques can facilitate the clinical translation of AI models in low-resource environments, making them more practical for real-world healthcare applications even in underserved regions.

Authors

  • Siddhesh Thakur
    Division of Computational Pathology, Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, IN, USA.
  • Sarthak Pati
    Perelman School of Medicine, Philadelphia, PA, USA.
  • Junwen Wu
    Intel Corporation, Santa Clara, CA, USA. Electronic address: junwen.wu@intel.com.
  • Ravi Panchumarthy
    Intel Corporation, Santa Clara, CA, USA. Electronic address: ravi.panchumarthy@intel.com.
  • Deepthi Karkada
    Intel Corporation, Santa Clara, CA, USA. Electronic address: karkada.deepthi@intel.com.
  • Alexander Kozlov
    Department of Neuroscience, Karolinska Institutet, SE-17177 Stockholm, Sweden.
  • Vasily Shamporov
    Intel Corporation, Santa Clara, CA, USA. Electronic address: vasily.shamporov@intel.com.
  • Alexander Suslov
    Intel Corporation, Santa Clara, CA, USA. Electronic address: alexander.suslov@intel.com.
  • Daniil Lyakhov
    Intel Corporation, Santa Clara, CA, USA. Electronic address: daniil.lyakhov@intel.com.
  • Maksim Proshin
    Intel Corporation, Santa Clara, CA, USA. Electronic address: maksim.proshin@intel.com.
  • Prashant Shah
    Intel, Santa Clara, CA, USA.
  • Dimitrios Makris
    Department of Computer Science, School of Computer Science & Mathematics (CSM), Kingston University, London, UK. Electronic address: d.makris@kingston.ac.uk.
  • Spyridon Bakas
    Perelman School of Medicine, Philadelphia, PA, USA.