Alhazmi, M. and Altahhan, A. orcid.org/0000-0003-1133-7744 (2025) Achieving 3D Attention via Triplet Squeeze and Excitation Block. In: 2025 International Joint Conference on Neural Networks (IJCNN). 2025 International Joint Conference on Neural Networks (IJCNN), 30 Jun - 05 Jul 2025, Rome, Italy. IEEE. ISBN: 979-8-3315-1043-5. ISSN: 2161-4393. EISSN: 2161-4407.
Abstract
The emergence of ConvNeXt and its variants has reaffirmed the conceptual and structural suitability of CNN-based models for vision tasks, re-establishing them as key players in image classification in general, and in facial expression recognition (FER) in particular. In this paper, we propose a new set of models that build on these advancements by incorporating a new set of attention mechanisms that combines Triplet attention with Squeeze-and-Excitation (TripSE) in four different variants. We demonstrate the effectiveness of these variants by applying them to the ResNet18, DenseNet and ConvNext architectures to validate their versatility and impact. Our study shows that incorporating a TripSE block in these CNN models boosts their performances, particularly for the ConvNeXt architecture, indicating its utility. We evaluate the proposed mechanisms and associated models across four datasets, namely CIFAR100, ImageNet, FER2013 and AffectNet datasets, where ConvNext with TripSE achieves state-of-the-art results with an accuracy of 78.27% on the popular FER2013 dataset, a new feat for this dataset.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
| Keywords: | Computer vision, Attention mechanisms, Three-dimensional displays, Accuracy, Image recognition, Face recognition, Computational modeling, Neural networks, Computer architecture, Image classification |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 28 Jan 2026 14:19 |
| Last Modified: | 28 Jan 2026 15:30 |
| Published Version: | https://ieeexplore.ieee.org/document/11229332 |
| Status: | Published |
| Publisher: | IEEE |
| Identification Number: | 10.1109/ijcnn64981.2025.11229332 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:237081 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)