Continual learning in the presence of repetition.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Continual learning (CL) provides a framework for training models in ever-evolving environments. Although re-occurrence of previously seen objects or tasks is common in real-world problems, the concept of repetition in the data stream is not often considered in standard benchmarks for CL. Unlike with the rehearsal mechanism in buffer-based strategies, where sample repetition is controlled by the strategy, repetition in the data stream naturally stems from the environment. This report provides a summary of the CLVision challenge at CVPR 2023, which focused on the topic of repetition in class-incremental learning. The report initially outlines the challenge objective and then describes three solutions proposed by finalist teams that aim to effectively exploit the repetition in the stream to learn continually. The experimental results from the challenge highlight the effectiveness of ensemble-based solutions that employ multiple versions of similar modules, each trained on different but overlapping subsets of classes. This report underscores the transformative potential of taking a different perspective in CL by employing repetition in the data stream to foster innovative strategy design.

Authors

  • Hamed Hemati
    Institute for Computer Science, University of St. Gallen, Rosenbergstrasse 30, St. Gallen, 9000, Switzerland. Electronic address: hemati.hmd@gmail.com.
  • Lorenzo Pellegrini
    Department of Computer Science, University of Bologna, Via dell'Università 50, Cesena, 47521, Italy.
  • Xiaotian Duan
    The University of Chicago, 5801 S Ellis Ave, Chicago, 60637, United States; Argonne National Laboratory, 9700 S Cass Ave, Lemont, 60439, United States.
  • Zixuan Zhao
    School of Public Health, Shandong Second Medical University, Weifang, Shandong, China.
  • Fangfang Xia
    Computing, Environment, and Life Sciences Directorate, Argonne National Laboratory, Argonne, Illinois, USA.
  • Marc Masana
    Graz University of Technology, Rechbauerstraße 12, Graz, 8010, Austria; TU Graz - SAL Dependable Embedded Systems Lab, Silicon Austria Labs, Graz, 8010, Austria.
  • Benedikt Tscheschner
    Graz University of Technology, Rechbauerstraße 12, Graz, 8010, Austria; Know-Center GmbH, Sandgasse 36/4, Graz, 8010, Austria.
  • Eduardo Veas
    Graz University of Technology, Rechbauerstraße 12, Graz, 8010, Austria; Know-Center GmbH, Sandgasse 36/4, Graz, 8010, Austria.
  • Yuxiang Zheng
    Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China.
  • Shiji Zhao
    MIIT Key Laboratory of Pattern Analysis and Machine Intelligence, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China.
  • Shao-Yuan Li
    MIIT Key Laboratory of Pattern Analysis and Machine Intelligence, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China.
  • Sheng-Jun Huang
  • Vincenzo Lomonaco
    Department of Computer Science and Engineering, University of Bologna, Italy. Electronic address: vincenzo.lomonaco@unibo.it.
  • Gido M van de Ven
    Department of Electrical Engineering, KU Leuven, Kasteelpark Arenberg 10, Leuven, 3001, Belgium. Electronic address: gido.vandeven@kuleuven.be.