Chen, Jinhui, Lue, Zhaojie, Zhang, Zhihong et al. (4 more authors) (2019) Polar Transformation on Image Features for Orientation-Invariant Representations. IEEE Transactions on Multimedia. pp. 300-313. ISSN 1520-9210
Abstract
The choice of image feature representation plays a crucial role in the analysis of visual information. Although vast numbers of alternative robust feature representation models have been proposed to improve the performance of different visual tasks, most existing feature representations (e.g. handcrafted features or Convolutional Neural Networks (CNN)) have a relatively limited capacity to capture the highly orientation-invariant (rotation/reversal) features. The net consequence is suboptimal visual performance. To address these problems, this study adopts a novel transformational approach, which investigates the potential of using polar feature representations. Our low level consists of a histogram of oriented gradient, which is then binned using annular spatial bin-type cells applied to the polar gradient. This gives gradient binning invariance for feature extraction. In this way, the descriptors have significantly enhanced orientation-invariant capabilities. The proposed feature representation, termed it orientation-invariant histograms of oriented gradients (Oi-HOG), is capable of accurately processing facial expression recognition (FER). In the context of the CNN architecture, we propose two polar convolution operations, referred to as Full Polar Convolution (FPolarConv) and Local Polar Convolution (LPolarConv), and use these to develop polar architectures for the CNN orientation-invariant representation. Experimental results show that the proposed orientation-invariant image representation, based on polar models for both handcrafted features and deep learning features, is both competitive with state-of-the-art methods and maintains a compact representation on a set of challenging benchmark image datasets.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | This is an author-produced version of the published paper. Uploaded in accordance with the publisher’s self-archiving policy. Further copying may not be permitted; contact the publisher for details |
Keywords: | CNN,Convolution,Feature extraction,HOG,Histograms,Image representation,Robustness,Rotation-invariant and reversal-invariant representation,Task analysis,Visualization |
Dates: |
|
Institution: | The University of York |
Academic Units: | The University of York > Faculty of Sciences (York) > Computer Science (York) |
Depositing User: | Pure (York) |
Date Deposited: | 27 Jun 2018 11:00 |
Last Modified: | 25 Oct 2024 23:58 |
Published Version: | https://doi.org/10.1109/TMM.2018.2856121 |
Status: | Published |
Refereed: | Yes |
Identification Number: | 10.1109/TMM.2018.2856121 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:132227 |