Mai, K.T., Davies, T. orcid.org/0000-0002-9677-2579 and Griffin, L.D.
(2024)
Understanding the limitations of self-supervised learning for tabular anomaly detection.
Pattern Analysis and Applications, 27.
61.
ISSN 1433-7541
Abstract
While self-supervised learning has improved anomaly detection in computer vision and natural language processing, it is unclear whether tabular data can benefit from it. This paper explores the limitations of self-supervision for tabular anomaly detection. We conduct several experiments spanning various pretext tasks on 26 benchmark datasets to understand why this is the case. Our results confirm representations derived from self-supervision do not improve tabular anomaly detection performance compared to using the raw representations of the data. We show this is due to neural networks introducing irrelevant features, which reduces the effectiveness of anomaly detectors. However, we demonstrate that using a subspace of the neural network’s representation can recover performance.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © The Author(s) 2024. This is an open access article under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
Keywords: | anomaly detection, deep learning, self-supervised learning, tabular data |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Education, Social Sciences and Law (Leeds) > School of Law (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 26 Jan 2024 16:51 |
Last Modified: | 21 Oct 2024 14:26 |
Status: | Published |
Publisher: | Springer |
Identification Number: | 10.1007/s10044-023-01208-1 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:208310 |