Correspondence between neuroevolution and gradient descent.

Journal: Nature communications
Published Date:

Abstract

We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerical simulation to show that this correspondence can be observed for finite mutations, for shallow and deep neural networks. Our results provide a connection between two families of neural-network training methods that are usually considered to be fundamentally different.

Authors

  • Stephen Whitelam
    Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, Califronia 94720, USA.
  • Viktor Selin
    Department of Physics, University of Ottawa, Ottawa, ON, K1N 6N5, Canada.
  • Sang-Won Park
    Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA.
  • Isaac Tamblyn
    National Research Council of Canada Ottawa, Ontario K1N 5A2, Canada Vector Institute for Artificial Intelligence Toronto, Ontario M5G 1M1, Canada.