Task-Free Dynamic Sparse Vision Transformer for Continual Learning

Abstract

Vision Transformers (ViTs) represent self-attention-based network backbones shown to be efficient in many individual tasks, but which have not been explored in Task-Free Continual Learning (TFCL) so far. Most existing ViT-based approaches for Continual Learning (CL) are relying on task information. In this study, we explore the advantages of the ViT in a more challenging CL scenario where the task boundaries are unavailable during training. To address this learning paradigm, we propose the Task-Free Dynamic Sparse Vision Transformer (TFDSViT), which can dynamically build new sparse experts, where each expert leverages sparsity to allocate the model's capacity for capturing different information categories over time. To avoid forgetting and ensure efficiency in reusing the previously learned knowledge in subsequent learning, we propose a new dynamic dual attention mechanism consisting of the Sparse Attention (SA') and Knowledge Transfer Attention (KTA) modules. The SA' refrains from updating some previously learned attention blocks for preserving prior knowledge. The KTA uses and regulates the information flow of all previously learned experts for learning new patterns. The proposed dual attention mechanism can simultaneously relieve forgetting and promote knowledge transfer for a dynamic expansion model in a task-free manner. We also propose an energy-based dynamic expansion mechanism using the energy as a measure of novelty for the incoming samples which provides appropriate expansion signals leading to a compact network architecture for TFDSViT. Extensive empirical studies demonstrate the effectiveness of TFDSViT. The code and supplementary material (SM) are available at https://github.com/dtuzi123/TFDSViT.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Ye, Fei Bors, Adrian Gheorghe https://orcid.org/0000-0001-7838-0021
Copyright, Publisher and Additional Information:	© 2024, Association for the Advancement of Artificial Intelligence. This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy.
Dates:	Accepted: 1 December 2023 Published: 24 March 2024
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York)
Depositing User:	Pure (York)
Date Deposited:	20 Dec 2024 12:20
Last Modified:	20 Sep 2025 02:45
Published Version:	https://doi.org/10.1609/aaai.v38i15.29581
Status:	Published
Publisher:	AAAI Press
Identification Number:	10.1609/aaai.v38i15.29581
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:221038

Download

Accepted Version

Filename: AAAI_AI24a.pdf

Description: AAAI_AI24a

Licence: CC-BY 2.5

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Task-Free Dynamic Sparse Vision Transformer for Continual Learning

Abstract

Metadata

Download

Accepted Version

Export

Statistics