Rajab, M. and Wang, D. orcid.org/0000-0003-0068-1005 (2020) Practical challenges and recommendations of filter methods for feature selection. Journal of Information & Knowledge Management, 19 (01). 2040019. ISSN 0219-6492
Abstract
Feature selection, the process of identifying relevant features to be incorporated into a proposed model, is one of the significant steps of the learning process. It removes noise from the data to increase the learning performance while reducing the computational complexity. The literature review indicated that most previous studies had focused on improving the overall classifier performance or reducing costs associated with training time during building of the classifiers. However, in this era of big data, there is an urgent need to deal with more complex issues that makes feature selection, especially using filter-based methods, more challenging; this in terms of dimensionality, data structures, data format, domain experts’ availability, data sparsity, and result discrepancies, among others. Filter methods identify the informative features of a given dataset to establish various predictive models using mathematical models. This paper takes a new route in an attempt to pinpoint recent practical challenges associated with filter methods and discusses potential areas of development to yield better performance. Several practical recommendations, based on recent studies, are made to overcome the identified challenges and make the feature selection process simpler and more efficient.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2020 World Scientic Publishing Co. This is an author-produced version of a paper subsequently published in Journal of Information and Knowledge Management. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Feature selection; filter methods; machine learning; data imbalance; ranking methods |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > Department of Neuroscience (Sheffield) |
Funding Information: | Funder Grant number ACADEMY OF MEDICAL SCIENCES SBF004\1052 |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 08 Jan 2021 16:02 |
Last Modified: | 23 Mar 2021 01:38 |
Status: | Published |
Publisher: | World Scientific Pub Co Pte Lt |
Refereed: | Yes |
Identification Number: | 10.1142/s0219649220400195 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:169458 |