Leveraging spatial residual attention and temporal Markov networks for video action understanding.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

The effective use of temporal relationships while extracting fertile spatial features is the key to video action understanding. Video action understanding is a challenging visual task because it generally necessitates not only the features of individual key frames but also the contextual understanding of the entire video and the relationships among key frames. Temporal relationships pose a challenge to video action understanding. However, existing 3D convolutional neural network approaches are limited, with a great deal of redundant spatial and temporal information. In this paper, we present a novel two-stream approach that incorporates Spatial Residual Attention and Temporal Markov (SRATM) to learn complementary features to achieve stronger video action understanding performance. Specifically, the proposed SRATM consists of spatial residual attention and temporal Markov. Firstly, the spatial residual attention network captures effective spatial feature representation. Further, the temporal Markov network enhances the model by learning the temporal relationships via conducting probabilistic logic calculation among frames in a video. Finally, we conduct extensive experiments on four video action datasets, namely, Something-Something-V1, Something-Something-V2, Diving48, and Mini-Kinetics, show that the proposed SRATM method achieves competitive results.

Authors

  • Yangyang Xu
    Department of Urology, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, Beijing, China.
  • Zengmao Wang
    School of Computer Science, Wuhan University, Wuhan, 430072, China; Hubei LuoJia Laboratory, Wuhan, 430072, China; Artificial Intelligence Institute of Wuhan University, Wuhan, 430072, China; National Engineering Research Center for Multimedia Software, Wuhan, 430072, China. Electronic address: wangzengmao@whu.edu.cn.
  • Xiaoping Zhang
    China Academy of Chinese Medicine Sciences, Beijing, 100070, China.