Return of the normal distribution: Flexible deep continual learning with variational auto-encoders.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Learning continually from sequentially arriving data has been a long standing challenge in machine learning. An emergent body of deep learning literature suggests various solutions, through introduction of significant simplifications to the problem statement. As a consequence of a growing focus on particular tasks and their respective benchmark assumptions, these efforts are thus becoming increasingly tailored to specific settings. Whereas approaches that leverage Variational Bayesian techniques seem to provide a more general perspective of key continual learning mechanisms, they however entail their own caveats. Inspired by prior theoretical work on solving the prevalent mismatch between prior and aggregate posterior in deep generative models, we return to a generic variational auto-encoder based formulation and investigate its utility for continual learning. Specifically, we propose to adapt a two-stage training framework towards a context conditioned variant for continual learning, where we then formulate mechanisms to alleviate catastrophic forgetting through choices of generative rehearsal or well-motivated extraction of data exemplar subsets. Although the proposed generic two-stage variational auto-encoder is not tailored towards a particular task and allows for flexible amounts of supervision, we empirically demonstrate it to surpass task-tailored methods in both supervised classification, as well as unsupervised representation learning.

Authors

  • Yongwon Hong
    Department of Computer Science, Yonsei University, Seoul, Republic of Korea. Electronic address: yhong@yonsei.ac.kr.
  • Martin Mundt
    Department of Computer Science, TU Darmstadt and Hessian Center for Artificial Intelligence (hessian.AI), Darmstadt, Germany. Electronic address: martin.mundt@tu-darmstadt.de.
  • Sungho Park
    Department of Transportation System Engineering, Ajou University, Suwon, Republic of Korea.
  • Yungjung Uh
    Applied Information Engineering, Yonsei University, Seoul, Republic of Korea. Electronic address: yj.uh@yonsei.ac.kr.
  • Hyeran Byun
    Department of Computer Science, Yonsei University, Seoul, Republic of Korea. Electronic address: hrbyun@yonsei.ac.kr.