Deep Learning for Reintegrating Biology.

Journal: Integrative and comparative biology
Published Date:

Abstract

The goal of this vision paper is to investigate the possible role that advanced machine learning techniques, especially deep learning (DL), could play in the reintegration of various biological disciplines. To achieve this goal, a series of operational, but admittedly very simplistic, conceptualizations have been introduced: Life has been taken as a multidimensional phenomenon that inhabits three physical dimensions (time, space, and scale) and biological research as establishing connection between different points in the domain of life. Each of these points hence denotes a position in time, space, and scale at which a life phenomenon of interest takes place. Using these conceptualizations, fragmentation of biology can be seen as the result of too few and especially too short-ranged connections. Reintegrating biology could then be accomplished by establishing more, longer ranged connections. DL methods appear to be very well suited for addressing this particular need at this particular time. Notwithstanding the numerous unsubstantiated claims regarding the capabilities of AI, DL networks represent a major advance in the ability to find complex relationships inside large data sets that would have not been accessible with traditional data analytic methods or to a human observer. In addition, ongoing advances in the automation of taking measurements from phenomena on all levels of biological organization continue to increase the number of large quantitative data sets that are available. These increasingly common data sets could serve as anchor points for making long-range connections by virtue of DL. However, connections within the domain of life are likely to be structured in a highly nonuniform fashion and hence it is necessary to develop methods, for example, theoretical, computational, and experimental, to determine linkage of biological data sets most likely to provide useful insights on a biological problem using DL. Finally, specific DL approaches and architectures should be developed to match the needs of reintegrating biology.

Authors

  • Rolf Müller
    Department of Microbial Natural Products, Helmholtz-Institute for Pharmaceutical Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical Biotechnology, Saarland University , 66123 Saarbrücken, Germany.
  • Jin-Ping Han
    IBM TJ Watson Research Center, Ossining, NY, USA.
  • Sriram Chandrasekaran
    Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, USA.
  • Paul Bogdan
    Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, United States.