Balancing feature alignment and uniformity for few-shot classification

Abstract

In Few-Shot Learning (FSL), the objective is to correctly recognize new samples from novel classes with only a few available samples per class. Existing methods in FSL primarily focus on learning transferable knowledge from base classes by maximizing the information between feature representations and their corresponding labels. However, this approach may suffer from the “supervision collapse" issue, which arises due to a bias towards the base classes. In this paper, we propose a solution to address this issue by preserving the intrinsic structure of the data and enabling the learning of a generalized model for the novel classes. Following the InfoMax principle, our approach maximizes two types of mutual information (MI): between the samples and their feature representations, and between the feature representations and their class labels. This allows us to strike a balance between discrimination (capturing class-specific information) and generalization (capturing common characteristics across different classes) in the feature representations. To achieve this, we adopt a unified framework that perturbs the feature embedding space using two low-bias estimators. The first estimator maximizes the MI between a pair of intra-class samples, while the second estimator maximizes the MI between a sample and its augmented views. This framework effectively combines knowledge distillation between class-wise pairs and enlarges the diversity in feature representations. By conducting extensive experiments on popular FSL benchmarks, our proposed approach achieves comparable performances with state-of-the-art competitors. For example, we achieved an accuracy of 69.53% on the miniImageNet dataset and 77.06% on the CIFAR-FS dataset for the 5-way 1-shot task.

Metadata

Item Type:	Article
Authors/Creators:	Yu, Y. Zhang, D. Ji, Z. Li, X. Han, J. Zhang, Z.
Copyright, Publisher and Additional Information:	© 2023 The Authors. Except as otherwise noted, this author-accepted version of a journal article published in IEEE Transactions on Image Processing is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/
Keywords:	Training; Task analysis; Feature extraction; Adaptation models; Smoothing methods; Data models; Mutual information
Dates:	Accepted: 21 October 2023 Published (online): 3 November 2023 Published: 3 November 2023
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	15 Nov 2023 09:54
Last Modified:	23 Jul 2025 14:03
Status:	Published
Publisher:	Institute of Electrical and Electronics Engineers
Refereed:	Yes
Identification Number:	10.1109/TIP.2023.3328475
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:205361

Download

Accepted Version

Filename: yunlong_TIP23.pdf

Licence: CC-BY 4.0

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Balancing feature alignment and uniformity for few-shot classification

Abstract

Metadata

Download

Accepted Version

Export

Statistics