Enhancing consistency and mitigating bias: A data replay approach for incremental learning.

Journal: Neural networks : the official journal of the International Neural Network Society
PMID:

Abstract

Deep learning systems are prone to catastrophic forgetting when learning from a sequence of tasks, as old data from previous tasks is unavailable when learning a new task. To address this, some methods propose replaying data from previous tasks during new task learning, typically using extra memory to store replay data. However, it is not expected in practice due to memory constraints and data privacy issues. Instead, data-free replay methods invert samples from the classification model. While effective, these methods face inconsistencies between inverted and real training data, overlooked in recent works. To that effect, we propose to measure the data consistency quantitatively by some simplification and assumptions. Using this measurement, we gain insight to develop a novel loss function that reduces inconsistency. Specifically, the loss minimizes the KL divergence between distributions of inverted and real data under a tied multivariate Gaussian assumption, which is simple to implement in continual learning. Additionally, we observe that old class weight norms decrease continually as learning progresses. We analyze the reasons and propose a regularization term to balance class weights, making old class samples more distinguishable. To conclude, we introduce Consistency-enhanced data replay with a Debiased classifier for class incremental learning (CwD). Extensive experiments on CIFAR-100, Tiny-ImageNet, and ImageNet100 show consistently improved performance of CwD compared to previous approaches.

Authors

  • Chenyang Wang
    Burning Rock Biotech, Guangzhou, China.
  • Junjun Jiang
    Guangxi Key Laboratory of AIDS Prevention and Treatment & Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, School of Public Health, Guangxi Medical University, Nanning 530021, Guangxi, China.
  • Xingyu Hu
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China. Electronic address: huxingyu@hit.edu.cn.
  • Xianming Liu
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China. Electronic address: csxm@hit.edu.cn.
  • Xiangyang Ji
    Department of Automation, Tsinghua University, Main building, Haidian District, Beijing 100084, People's Republic of China.