Learning to Match Anchors for Visual Object Detection.

Journal: IEEE transactions on pattern analysis and machine intelligence
Published Date:

Abstract

Modern CNN-based object detectors assign anchors for ground-truth objects under the restriction of object-anchor Intersection-over-Union (IoU). In this study, we propose a learning-to-match (LTM) method to break IoU restriction, allowing objects to match anchors in a flexible manner. LTM updates hand-crafted anchor assignment to "free" anchor matching by formulating detector training in the Maximum Likelihood Estimation (MLE) framework. During the training phase, LTM is implemented by converting the detection likelihood to anchor matching loss functions which are plug-and-play. Minimizing the matching loss functions drives learning and selecting features which best explain a class of objects with respect to both classification and localization. LTM is extended from anchor-based detectors to anchor-free detectors, validating the general applicability of learnable object-feature matching mechanism for visual object detection. Experiments on MS COCO dataset demonstrate that LTM detectors consistently outperform counterpart detectors with significant margins. The last but not the least, LTM requires negligible computational cost in both training and inference phases as it does not involve any additional architecture or parameter. Code has been made publicly available.

Authors

  • Xiaosong Zhang
    Data Mining Lab, University of Electronic Science and Technology of China, 611731 Chengdu, China; School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China.
  • Fang Wan
    INSA LYON, Université Lyon2, Université Claude Bernard Lyon1, Université Jean Monnet Saint-Etienne, DISP UR4570, France. Electronic address: 1140293340@qq.com.
  • Chang Liu
    Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Xiangyang Ji
    Department of Automation, Tsinghua University, Main building, Haidian District, Beijing 100084, People's Republic of China.
  • Qixiang Ye