RepCo: Replenish sample views with better consistency for contrastive learning.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Sep 11, 2023

Abstract

Contrastive learning methods aim to learn shared representations by minimizing distances between positive pairs, and maximizing distances between negative pairs in the embedding space. To achieve better performance of contrastive learning, one of the key problems is to design appropriate sample pairs. In most previous works, random cropping on the input image is utilized to obtain two views as positive pairs. However, such strategies lead to suboptimal performance since the sampled crops may have inconsistent semantic information, which consequently degrades the quality of contrastive views. To address this limitation, we explore to replenish sample views with better consistency of the image and propose a novel self-supervised learning (SSL) framework RepCo. Instead of searching for semantically consistent patches between two different views, we select patches on the same image as the replenishment of positive/negative pairs, encourage patches that are similar but come from different positions as positive pairs, and force patches that are dissimilar but come from adjacent positions to have different representations, i.e. construct negative pairs to enrich the learned representations. Our method effectively generates high-quality contrastive views, explores the untapped semantic consistency on images, and provides more informative representations for downstream tasks. Experiments on adequate downstream tasks have shown that, our approach achieves ＋2.1 AP (COCO pre-trained) and ＋1.6 AP (ImageNet pre-trained) gains on Pascal VOC object detection, ＋2.3 mIoU gains on Cityscapes semantic segmentation, respectively.

Authors

Xinyu Lei
Longjun Liu

College of Artificial Intelligence, Xi'an Jiaotong University, Xi'an, Shaanxi, China. Electronic address: liulongjun@xjtu.edu.cn.
Yi Zhang

Department of Thyroid Surgery, China-Japan Union Hospital of Jilin University, Jilin University, Changchun, China.
Puhang Jia

National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center of Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China.
Haonan Zhang

Electronic Information School, Wuhan University, Wuhan 430064, China.
Nanning Zheng

Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, 710049, China.

Keywords

Learning Neural Networks, Computer Semantics

External Resources

View on PubMed Access via DOI PubMed (37757725)

RepCo: Replenish sample views with better consistency for contrastive learning.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals