Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Few-shot image generation aims at generating novel images for the unseen category when given K images from the same category. Despite significant advancements in existing few-shot image generation methods, great challenges remain regarding the quality and diversity of the generated images. This issue stems from the model's struggle to fully comprehend the semantic content of images and extract sufficiently semantic representations. To address these issues, we propose a semantic mask reconstruction (SMR) and category semantic learning (CSL) method for few-shot image generation. Specifically, SMR performs mask reconstruction in a high-level semantic space and designs a strategy for dynamically adjusting the mask ratio, which increases the difficulty of the generation tasks by gradually increasing the mask ratio to enhance the learning ability of the discriminator, thereby prompting the generator to learn more critical features relevant to the generation task. In addition, CSL introduces a triplet loss to optimize the distance between the generated image, its corresponding input image, and input images of other categories. This encourages the generative model to discern subtle differences between categories, thereby achieving more fine-grained generation and improving the fidelity of generated images. Both SMR and CSL can function as plug-and-play modules. Extensive experimental results across three standard datasets demonstrate that the SMR-CSL outperforms other methods in terms of the quality and diversity of the generated images. Furthermore, the results of downstream classification experiments verify that the images generated by the proposed method can effectively assist downstream classification tasks.

Authors

  • Ting Xiao
    Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
  • Yunjie Cai
    Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China. Electronic address: y30231031@mail.ecust.edu.cn.
  • Jiaoyan Guan
    Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China. Electronic address: y80210029@mail.ecust.edu.cn.
  • Zhe Wang
    Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China.