Online Task-Free Continual Learning via Expansible Vision Transformer

Abstract

Vision Transformers (ViTs) have lately shown remarkable data representation capabilities leading to state-of-the-art results in several vision and language learning tasks. Given its powerful representation ability, some recent studies have explored the ViT in continual learning by employing the dynamic expansion mechanism. However, these methods rely on the task information and therefore can not deal with a more realistic scenario, namely the Task-Agnostic Continual Learning (TACL). Unlike these ViT-based continual learning methods, this paper addresses TACL by proposing the Lifelong Expansible Vision Transformer (LEViT) model, which dynamically increases the model’s capacity to deal with changes in the underlying probability distribution of the data representations learnt during continual learning. LEViT is implemented by an ensemble of transformers, each enabled with a multi-head attention mechanism and a linear classifier. We propose a new dynamic expansion mechanism which incrementally increases the capacity of LEViT without requiring task labels, by evaluating the statistical similarity between the joint distribution modeled by all previously learned components and the probabilistic representation of incoming data samples. The proposed expansion mechanism ensures the diversity of learnt knowledge by the components of LEViT. In addition, we introduce the Dynamic Knowledge Fusion (DKF) approach, aiming to explore the ViT feature representation ability for knowledge transfer. Specifically, we view all previously learnt components as an evolved knowledge base which provides prior knowledge for future learning. The proposed LEViT, when compared to the existing ViT-based methods, does not require any task information and can reuse previously learned representations to promote future task learning.

Metadata

Item Type:	Article
Authors/Creators:	Ye, Fei Bors, Adrian Gheorghe https://orcid.org/0000-0001-7838-0021
Copyright, Publisher and Additional Information:	This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy.
Dates:	Accepted: 16 April 2025 Published (online): 3 June 2025 Published: 1 January 2026
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York)
Date Deposited:	09 Jun 2025 15:40
Last Modified:	02 Mar 2026 00:10
Published Version:	https://doi.org/10.1016/j.patcog.2025.111730
Status:	Published
Refereed:	Yes
Identification Number:	10.1016/j.patcog.2025.111730
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:227640

Download

Accepted Version

Filename: LEViT-PR25.pdf

Description: LEViT-PR25

Licence: CC-BY 2.5

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Online Task-Free Continual Learning via Expansible Vision Transformer

Abstract

Metadata

Download

Accepted Version

Export

Statistics