Memorizing Structure-Texture Correspondence for Image Anomaly Detection.

Journal: IEEE transactions on neural networks and learning systems
Published Date:

Abstract

This work focuses on image anomaly detection by leveraging only normal images in the training phase. Most previous methods tackle anomaly detection by reconstructing the input images with an autoencoder (AE)-based model, and an underlying assumption is that the reconstruction errors for the normal images are small, and those for the abnormal images are large. However, these AE-based methods, sometimes, even reconstruct the anomalies well; consequently, they are less sensitive to anomalies. To conquer this issue, we propose to reconstruct the image by leveraging the structure-texture correspondence. Specifically, we observe that, usually, for normal images, the texture can be inferred from its corresponding structure (e.g., the blood vessels in the fundus image and the structured anatomy in optical coherence tomography image), while it is hard to infer the texture from a destroyed structure for the abnormal images. Therefore, a structure-texture correspondence memory (STCM) module is proposed to reconstruct image texture from its structure, where a memory mechanism is used to characterize the mapping from the normal structure to its corresponding normal texture. As the correspondence between destroyed structure and texture cannot be characterized by the memory, the abnormal images would have a larger reconstruction error, facilitating anomaly detection. In this work, we utilize two kinds of complementary structures (i.e., the semantic structure with human-labeled category information and the low-level structure with abundant details), which are extracted by two structure extractors. The reconstructions from the two kinds of structures are fused together by a learned attention weight to get the final reconstructed image. We further feed the reconstructed image into the two aforementioned structure extractors to extract structures. On the one hand, constraining the consistency between the structures extracted from the original input and that from the reconstructed image would regularize the network training; on the other hand, the error between the structures extracted from the original input and that from the reconstructed image can also be used as a supplement measurement to identify the anomaly. Extensive experiments validate the effectiveness of our method for image anomaly detection on both industrial inspection images and medical images.

Authors

  • Kang Zhou
    School of Traffic and Transportation, Beijing Jiaotong University, Beijing 100044, China.
  • Jing Li
    Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin, China.
  • Yuting Xiao
  • Jianlong Yang
    Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo, Zhejiang, China.
  • Jun Cheng
    School of Electrical and Information Technology, Yunnan Minzu University, Kunming, Yunnan 650500, PR China. Electronic address: jcheng6819@126.com.
  • Wen Liu
    Department of Dermatology, Air Force General Hospital, PLA Beijing 100142, China.
  • Weixin Luo
  • Jiang Liu
    Department of Pharmacy, The Fourth Hospital of Hebei Medical University Shijiazhuang 050000, Hebei, China.
  • Shenghua Gao