Learning to Match Anchors for Visual Object Detection.

Journal: IEEE transactions on pattern analysis and machine intelligence

Published Date: May 5, 2022

Abstract

Modern CNN-based object detectors assign anchors for ground-truth objects under the restriction of object-anchor Intersection-over-Union (IoU). In this study, we propose a learning-to-match (LTM) method to break IoU restriction, allowing objects to match anchors in a flexible manner. LTM updates hand-crafted anchor assignment to "free" anchor matching by formulating detector training in the Maximum Likelihood Estimation (MLE) framework. During the training phase, LTM is implemented by converting the detection likelihood to anchor matching loss functions which are plug-and-play. Minimizing the matching loss functions drives learning and selecting features which best explain a class of objects with respect to both classification and localization. LTM is extended from anchor-based detectors to anchor-free detectors, validating the general applicability of learnable object-feature matching mechanism for visual object detection. Experiments on MS COCO dataset demonstrate that LTM detectors consistently outperform counterpart detectors with significant margins. The last but not the least, LTM requires negligible computational cost in both training and inference phases as it does not involve any additional architecture or parameter. Code has been made publicly available.

Authors

Xiaosong Zhang

Data Mining Lab, University of Electronic Science and Technology of China, 611731 Chengdu, China; School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China.
Fang Wan

INSA LYON, Université Lyon2, Université Claude Bernard Lyon1, Université Jean Monnet Saint-Etienne, DISP UR4570, France. Electronic address: 1140293340@qq.com.
Chang Liu

Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Xiangyang Ji

Department of Automation, Tsinghua University, Main building, Haidian District, Beijing 100084, People's Republic of China.
Qixiang Ye

Keywords

Algorithms Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (33434120)

Learning to Match Anchors for Visual Object Detection.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Learning to Match Anchors for Visual Object Detection.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals