Investigating the Bilateral Connections in Generative Zero-Shot Learning.

Journal: IEEE transactions on cybernetics
Published Date:

Abstract

Zero-shot learning (ZSL) is a pretty intriguing topic in the computer vision community since it handles novel instances and unseen categories. In a typical ZSL setting, there is a main visual space and an auxiliary semantic space. Most existing ZSL methods handle the problem by learning either a visual-to-semantic mapping or a semantic-to-visual mapping. In other words, they investigate a unilateral connection from one end to the other. However, the connection between the visual space and the semantic space are bilateral in reality, that is, the visual space depicts the semantic space; the semantic space, on the other hand, describes the visual space. In this article, therefore, we investigate the bilateral connections in ZSL and present a novel model, called Boomerang-GAN, by taking advantage of conditional generative adversarial networks (GANs). Specifically, we generate unseen visual samples from their category semantic embeddings by a conditional GAN. Different from the existing generative ZSL methods that only consider generating visual features from class descriptions, our method also considers that the generated visual features can be translated back to their corresponding semantic embeddings by introducing a multimodal cycle-consistent loss. Extensive experiments of both ZSL and generalized ZSL on five widely used datasets verify that our method is able to outperform previous state-of-the-art approaches in both recognition and segmentation tasks.

Authors

  • Jingjing Li
    Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, 430074, China.
  • Mengmeng Jing
    School of Computer Science and Engineering, University of Electronic Science and Technology of China, China.
  • Ke Lu
    University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing 100049, China. Electronic address: luk@ucas.ac.cn.
  • Lei Zhu
    School of Civil and Hydraulic Engineering, Ningxia University, Yinchuan, China.
  • Heng Tao Shen
    Center for Future Media, School of Computer Science & Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Sichuan Artificial Intelligence Research Institute, Yibin 644000, China. Electronic address: shenhengtao@uestc.edu.cn.