A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame.

Journal: IEEE transactions on neural networks and learning systems
PMID:

Abstract

To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).

Authors

  • Li Chen
    Department of Endocrinology and Metabolism, Qilu Hospital, Shandong University, Jinan, China.
  • Yulong Zhang
    State Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China.
  • YangHe Feng
    Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, Hunan, China.
  • Longfei Zhang
    Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, No. 19, Xin Jie Kou Wai Street, Hai Dian District, Beijing, 100875, China.
  • Zhong Liu
    Science and Technology on Information Systems Engineering Laboratory, College of Information System and Management, National University of Defense Technology, Changsha, Hunan, China.