Decentralized policy learning with partial observation and mechanical constraints for multiperson modeling.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Extracting the rules of real-world multi-agent behaviors is a current challenge in various scientific and engineering fields. Biological agents independently have limited observation and mechanical constraints; however, most of the conventional data-driven models ignore such assumptions, resulting in lack of biological plausibility and model interpretability for behavioral analyses. Here we propose sequential generative models with partial observation and mechanical constraints in a decentralized manner, which can model agents' cognition and body dynamics, and predict biologically plausible behaviors. We formulate this as a decentralized multi-agent imitation-learning problem, leveraging binary partial observation and decentralized policy models based on hierarchical variational recurrent neural networks with physical and biomechanical penalties. Using real-world basketball and soccer datasets, we show the effectiveness of our method in terms of the constraint violations, long-term trajectory prediction, and partial observation. Our approach can be used as a multi-agent simulator to generate realistic trajectories using real-world data.

Authors

  • Keisuke Fujii
    Graduate School of Informatics, Nagoya University, Nagoya, Aichi, Japan; Center for Advanced Intelligence Project, RIKEN, Osaka, Japan; PRESTO, Japan Science and Technology Agency, Tokyo, Japan. Electronic address: fujii@i.nagoya-u.ac.jp.
  • Naoya Takeishi
    Center for Advanced Intelligence Project, RIKEN, Osaka, Japan; Graduate School of Engineering, The University of Tokyo, Tokyo, Japan.
  • Yoshinobu Kawahara
    Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka, Japan; Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihonbashi, Chuo-ku, Tokyo, Japan.
  • Kazuya Takeda
    Graduate School of Informatics, Nagoya University, Nagoya, Aichi, Japan.