Return of the normal distribution: Flexible deep continual learning with variational auto-encoders.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Oct 1, 2022

Abstract

Learning continually from sequentially arriving data has been a long standing challenge in machine learning. An emergent body of deep learning literature suggests various solutions, through introduction of significant simplifications to the problem statement. As a consequence of a growing focus on particular tasks and their respective benchmark assumptions, these efforts are thus becoming increasingly tailored to specific settings. Whereas approaches that leverage Variational Bayesian techniques seem to provide a more general perspective of key continual learning mechanisms, they however entail their own caveats. Inspired by prior theoretical work on solving the prevalent mismatch between prior and aggregate posterior in deep generative models, we return to a generic variational auto-encoder based formulation and investigate its utility for continual learning. Specifically, we propose to adapt a two-stage training framework towards a context conditioned variant for continual learning, where we then formulate mechanisms to alleviate catastrophic forgetting through choices of generative rehearsal or well-motivated extraction of data exemplar subsets. Although the proposed generic two-stage variational auto-encoder is not tailored towards a particular task and allows for flexible amounts of supervision, we empirically demonstrate it to surpass task-tailored methods in both supervised classification, as well as unsupervised representation learning.

Authors

Yongwon Hong

Department of Computer Science, Yonsei University, Seoul, Republic of Korea. Electronic address: yhong@yonsei.ac.kr.
Martin Mundt

Department of Computer Science, TU Darmstadt and Hessian Center for Artificial Intelligence (hessian.AI), Darmstadt, Germany. Electronic address: martin.mundt@tu-darmstadt.de.
Sungho Park

Department of Transportation System Engineering, Ajou University, Suwon, Republic of Korea.
Yungjung Uh

Applied Information Engineering, Yonsei University, Seoul, Republic of Korea. Electronic address: yj.uh@yonsei.ac.kr.
Hyeran Byun

Department of Computer Science, Yonsei University, Seoul, Republic of Korea. Electronic address: hrbyun@yonsei.ac.kr.

Keywords

Adaptation, Physiological Bayes Theorem Machine Learning Normal Distribution

External Resources

View on PubMed Access via DOI PubMed (35944369)

Return of the normal distribution: Flexible deep continual learning with variational auto-encoders.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals