KLSANet: Key local semantic alignment Network for few-shot image classification.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Few-shot image classification involves recognizing new classes with a limited number of labeled samples. Current local descriptor-based methods, while leveraging consistent low-level features across visible and invisible classes, face challenges including redundant adjacent information, irrelevant partial representation, and limited interpretability. This paper proposes KLSANet, a few-shot image classification approach based on key local semantic alignment network, which aligns key local semantics for accurate classification. Furthermore, we introduce a key local screening module to mitigate the influence of semantically irrelevant image parts on classification. KLSANet demonstrates superior performance on three benchmark datasets (CUB, Stanford Dogs, Stanford Cars), outperforming state-of-the-art methods in 1-shot and 5-shot settings with average improvements of 3.95% and 2.56% respectively. Visualization experiments demonstrate the interpretability of KLSANet predictions. Code is available at: https://github.com/ZitZhengWang/KLSANet.

Authors

  • Zhe Sun
    Centre for Atmospheric Science, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK.
  • Wang Zheng
    Department of Information Science and Engineering, Yanshan University, Hebei Street, Qinhuangdao, Hebei, China.
  • Pengfei Guo
    Department of Environmental Science and Engineering, College of Chemical Engineering, Beijing University of Chemical Technology, Beijing 100029, China.