Kurpukdee, Nattapong and Bors, Adrian Gheorghe orcid.org/0000-0001-7838-0021 (2024) Temporal Transformer Encoder for Video Class Incremental Learning. In: IEEE International Conference on Image Processing (ICIP). IEEE , Abu Dhabi, UAE , pp. 1295-1301.
Abstract
Current video classification approaches suffer from catastrophic forgetting when they are retrained on new databases. Continual learning aims to enable a classification system with learning from a succession of tasks without forgetting. In this paper we propose to use a transformer-based video class incremental learning model. During a succession of learning steps, at each training time, the transformer is used to extract characteristic spatio-temporal features from videos corresponding to a set of classes. When new video classification tasks become available, we train new classifier modules with the transformer-extracted features, gradually building a mixture model. The proposed methodology enables continual class learning in videos without being required to consider the learning of an initial set of classes, leading to low computation and memory requirements. The proposed model is evaluated on standard action recognition datasets including UCF101 and HMDB51, which are split into sets of classes, to be learnt sequentially. Our proposed method significantly outperforms the baselines on all datasets. Index Terms-Continual video classification, video transformer, video class incremental learning.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | ©2024 IEEE. This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy. |
Dates: |
|
Institution: | The University of York |
Academic Units: | The University of York > Faculty of Sciences (York) > Computer Science (York) |
Depositing User: | Pure (York) |
Date Deposited: | 20 Dec 2024 12:00 |
Last Modified: | 22 Mar 2025 00:15 |
Published Version: | https://doi.org/10.1109/ICIP51287.2024.10647854 |
Status: | Published |
Publisher: | IEEE |
Identification Number: | 10.1109/ICIP51287.2024.10647854 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:221070 |
Download
Filename: ICIP2024.pdf
Description: Temporal Transformer Encoder for Video Class Incremental Learning
Licence: CC-BY 2.5