Fine-Grained Image Analysis With Deep Learning: A Survey.

Journal: IEEE transactions on pattern analysis and machine intelligence
PMID:

Abstract

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas - fine-grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community.

Authors

  • Xiu-Shen Wei
  • Yi-Zhe Song
  • Oisin Mac Aodha
  • Jianxin Wu
    Department of Biochemistry, Capital Institute of Pediatrics, Beijing 100020, China. Electronic address: jianxinwu_2000@163.com.
  • Yuxin Peng
  • Jinhui Tang
  • Jian Yang
    Drug Discovery and Development Research Group, College of Pharmacy and Nutrition, University of Saskatchewan, Saskatoon, SK, Canada.
  • Serge Belongie