Emergence of integrated behaviors through direct optimization for homeostasis.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Homeostasis is a self-regulatory process, wherein an organism maintains a specific internal physiological state. Homeostatic reinforcement learning (RL) is a framework recently proposed in computational neuroscience to explain animal behavior. Homeostatic RL organizes the behaviors of autonomous embodied agents according to the demands of the internal dynamics of their bodies, coupled with the external environment. Thus, it provides a basis for real-world autonomous agents, such as robots, to continually acquire and learn integrated behaviors for survival. However, prior studies have generally explored problems pertaining to limited size, as the agent must handle observations of such coupled dynamics. To overcome this restriction, we developed an advanced method to realize scaled-up homeostatic RL using deep RL. Furthermore, several rewards for homeostasis have been proposed in the literature. We identified that the reward definition that uses the difference in drive function yields the best results. We created two benchmark environments for homeostasis and performed a behavioral analysis. The analysis showed that the trained agents in each environment changed their behavior based on their internal physiological states. Finally, we extended our method to address vision using deep convolutional neural networks. The analysis of a trained agent revealed that it has visual saliency rooted in the survival environment and internal representations resulting from multimodal input.

Authors

  • Naoto Yoshida
    Graduate School of Information Science and Technology, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan; International Research Center for Neurointelligence (WPI-IRCN), Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan. Electronic address: n-yoshida@isi.imi.i.u-tokyo.ac.jp.
  • Tatsuya Daikoku
    International Research Center for Neurointelligence (WPI-IRCN), Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan.
  • Yukie Nagai
    National Institute of Information and Communications Technology , Suita, Osaka 565-0871 , Japan.
  • Yasuo Kuniyoshi
    Next Generation Artificial Intelligence Research Center & School of Information Science and Technology, The University of Tokyo , 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656 , Japan.