Identifying and training deep learning neural networks on biomedical-related datasets.

Journal: Briefings in bioinformatics
PMID:

Abstract

This manuscript describes the development of a resources module that is part of a learning platform named 'NIGMS Sandbox for Cloud-based Learning' https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox at the beginning of this Supplement. This module delivers learning materials on implementing deep learning algorithms for biomedical image data in an interactive format that uses appropriate cloud resources for data access and analyses. Biomedical-related datasets are widely used in both research and clinical settings, but the ability for professionally trained clinicians and researchers to interpret datasets becomes difficult as the size and breadth of these datasets increases. Artificial intelligence, and specifically deep learning neural networks, have recently become an important tool in novel biomedical research. However, use is limited due to their computational requirements and confusion regarding different neural network architectures. The goal of this learning module is to introduce types of deep learning neural networks and cover practices that are commonly used in biomedical research. This module is subdivided into four submodules that cover classification, augmentation, segmentation and regression. Each complementary submodule was written on the Google Cloud Platform and contains detailed code and explanations, as well as quizzes and challenges to facilitate user training. Overall, the goal of this learning module is to enable users to identify and integrate the correct type of neural network with their data while highlighting the ease-of-use of cloud computing for implementing neural networks. This manuscript describes the development of a resource module that is part of a learning platform named ``NIGMS Sandbox for Cloud-based Learning'' https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.

Authors

  • Alan E Woessner
    Department of Biomedical Engineering, University of Arkansas, Fayetteville, Arkansas, USA.
  • Usman Anjum
    Arkansas Integrative Metabolic Research Center, University of Arkansas, Fayetteville, AR.
  • Hadi Salman
    Arkansas Integrative Metabolic Research Center, University of Arkansas, Fayetteville, AR.
  • Jacob Lear
    Department of Computer Science and Computer Engineering, University of Arkansas, Fayetteville, AR.
  • Jeffrey T Turner
    Health Data and AI, Deloitte Consulting LLP, Arlington VA, USA.
  • Ross Campbell
    Health Data and AI, Deloitte Consulting LLP, Arlington VA, USA.
  • Laura Beaudry
    Google Cloud, Reston VA, USA.
  • Justin Zhan
    Arkansas Integrative Metabolic Research Center, University of Arkansas, Fayetteville, AR.
  • Lawrence E Cornett
    Department of Physiology and Cell Biology, University of Arkansas for Medical Sciences, Little Rock, AR.
  • Susan Gauch
    Department of Computer Science and Computer Engineering, University of Arkansas, Fayetteville, AR.
  • Kyle P Quinn
    Department of Biomedical Engineering, Tufts University, Medford, MA, 02155, USA; Department of Biomedical Engineering, University of Arkansas, Fayetteville, AR, 72701, USA.