Adaptive Orthogonal Projection for Batch and Online Continual Learning

Yiduo Guo; Wenpeng Hu; Dongyan Zhao; Bing Liu

2022 AAAI AAAI 2022

Adaptive Orthogonal Projection for Batch and Online Continual Learning

Abstract

Abstract Catastrophic forgetting is a key obstacle to continual learning. One of the state-of-the-art approaches is orthogonal projection. The idea of this approach is to learn each task by updating the network parameters or weights only in the direction orthogonal to the subspace spanned by all previous task inputs. This ensures no interference with tasks that have been learned. The system OWM that uses the idea performs very well against other state-of-the-art systems. In this paper, we first discuss an issue that we discovered in the mathematical derivation of this approach and then propose a novel method, called AOP (Adaptive Orthogonal Projection), to resolve it, which results in significant accuracy gains in empirical evaluations in both the batch and online continual learning settings without saving any previous training data as in replay-based methods.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — parameter isolation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yiduo Guo , Wenpeng Hu , Dongyan Zhao , Bing Liu

Topics

Machine Learning > Learning Types > Continual Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Learning Paradigms > Continual Learning Deep Learning > Learning Types > Representation Learning Deep Learning > Learning Types > Continual Learning

Keywords

representation learning continual learning catastrophic forgetting online learning orthogonal projection batch learning parameter isolation replay-free learning

Download PDF

Related papers

Dynamic Spatial Propagation Network for Depth Completion 2022

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding 2022

AnchorFace: Boosting TAR@FAR for Practical Face Recognition 2022

Parallel and High-Fidelity Text-to-Lip Generation 2022