A wholistic view of continual learning with deep neural networks: Forgotten lessons and the bridge to active and open world learning.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Current deep learning methods are regarded as favorable if they empirically perform well on dedicated test sets. This mentality is seamlessly reflected in the resurfacing area of continual learning, where consecutively arriving data is investigated. The core challenge is framed as protecting previously acquired representations from being catastrophically forgotten. However, comparison of individual methods is nevertheless performed in isolation from the real world by monitoring accumulated benchmark test set performance. The closed world assumption remains predominant, i.e. models are evaluated on data that is guaranteed to originate from the same distribution as used for training. This poses a massive challenge as neural networks are well known to provide overconfident false predictions on unknown and corrupted instances. In this work we critically survey the literature and argue that notable lessons from open set recognition, identifying unknown examples outside of the observed set, and the adjacent field of active learning, querying data to maximize the expected performance gain, are frequently overlooked in the deep learning era. Hence, we propose a consolidated view to bridge continual learning, active learning and open set recognition in deep neural networks. Finally, the established synergies are supported empirically, showing joint improvement in alleviating catastrophic forgetting, querying data, selecting task orders, while exhibiting robust open world application.

Authors

  • Martin Mundt
    Department of Computer Science, TU Darmstadt and Hessian Center for Artificial Intelligence (hessian.AI), Darmstadt, Germany. Electronic address: martin.mundt@tu-darmstadt.de.
  • Yongwon Hong
    Department of Computer Science, Yonsei University, Seoul, Republic of Korea. Electronic address: yhong@yonsei.ac.kr.
  • Iuliia Pliushch
    Department of Computer Science, Goethe University, Theodor-W.-Adorno-Platz 1, 60323 Frankfurt, Germany. Electronic address: pliushch@em.uni-frankfurt.de.
  • Visvanathan Ramesh
    Department of Computer Science, Goethe University, Theodor-W.-Adorno-Platz 1, 60323 Frankfurt, Germany. Electronic address: vramesh@em.uni-frankfurt.de.