Simple and Effective: Spatial Rescaling for Person Reidentification.

Journal: IEEE transactions on neural networks and learning systems
Published Date:

Abstract

Global average pooling (GAP) allows convolutional neural networks (CNNs) to localize discriminative information for recognition using only image-level labels. While GAP helps CNNs to attend to the most discriminative features of an object, e.g., head of a bird or one man's bag, it may suffer if that information is missing due to camera viewpoint changes and intraclass variations in some tasks. To circumvent this issue, we propose one new module to help CNNs to see more, namely, Spatial Rescaling (SpaRs) layer. It introduces spatial relations among the feature map activations back to the model, guiding the model to focus on a broad area in the feature map. With simple implementation, it can be inserted into CNNs of various architectures directly. SpaRs layer consistently improves the performance over the reidentification (re-ID) models. Besides, the new module based on different normalization methods also demonstrates the superiority of fine-grained and general image classification benchmarks. The visualization method shows the changes in activated regions when equipped with the SpaRs layer for better understanding. Our code is publicly available at https://github.com/HRanWang/Spatial-Re-Scaling.

Authors

  • Haoran Wang
    Department of Urology, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510700, China.
  • Licheng Jiao
  • Shuyuan Yang
    Key Lab of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi'an, Shaanxi, 710071, China.
  • Lingling Li
    College of Biological Science and Engineering, Fuzhou University, No. 2 Xue Yuan Road, University Town, Fuzhou, Fujian 350108, China.
  • Zexin Wang