Sun, B. and Wei, H.-L. orcid.org/0000-0002-4704-7346 (2026) A novel ensemble machine learning approach for interpretable modeling, feature extraction and selection with applications to medical and biomedical signals and data. Concurrency and Computation: Practice and Experience, 38 (8). e70697. ISSN: 1532-0626
Abstract
Feature extraction and selection are crucial in biomedical data analysis to address high dimensionality, reduce computational complexity, and enhance model interpretability. However, traditional methods often focus on individual feature importance, overlooking complex inter-feature relationships, especially when processing and modeling dynamic and time-series data. In this study, we propose a novel framework that integrates Feature Co-occurrence Networks (FCN) with global importance scoring via the PageRank algorithm, which is built on a parametric Nonlinear AutoRegressive with eXogenous inputs (NARX) model structure to better capture temporal dependencies in sequential data. The proposed NARX-FCN-PageRank approach combines the strengths of multiple feature selection strategies while leveraging network analysis to identify stable and representative feature subsets. Extensive evaluations across diverse biomedical datasets, including both static and dynamic scenarios, demonstrate that our method effectively reduces feature dimensionality without compromising predictive performance. Moreover, the network visualizations provide valuable insights into the interdependencies and centrality of selected features, supporting model interpretability and enhancing trustworthiness. The NARX-FCN-PageRank framework thus offers a versatile and interpretable solution for feature selection in biomedical data analysis, with the potential to facilitate more efficient and reliable modeling in clinical and medical research applications.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2026 The Author(s). Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
| Keywords: | biomedical signals; feature co-occurrence network; feature selection; medical data; model interpretability; NARX; PageRank |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Automatic Control and Systems Engineering (Sheffield) The University of Sheffield > Faculty of Engineering (Sheffield) > School of Electrical and Electronic Engineering |
| Date Deposited: | 21 Apr 2026 08:38 |
| Last Modified: | 21 Apr 2026 08:38 |
| Status: | Published |
| Publisher: | Wiley |
| Refereed: | Yes |
| Identification Number: | 10.1002/cpe.70697 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:240268 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)