Li, S., Liu, K. and Yang, P. orcid.org/0000-0002-8553-7127 (2025) An interpretable deep learning approach for Alzheimer's disease diagnosis using gene expression data. IEEE/ACM Transactions on Computational Biology and Bioinformatics. ISSN 1545-5963
Abstract
With the global ageing population, the diagnosis of Alzheimer’s disease (AD) has become an urgent public health priority. Gene expression techniques offer the advantages of being less invasive and cost-effective, but their high dimensionality and small sample sizes make them prone to the curse of dimensionality in AD diagnosis. This study proposes a novel interpretable deep learning approach to address these challenges. We introduce a shallow sparse autoencoder for dimensionality reduction and combine it with XGBoost for classification, achieving an Area Under the Receiver Operating Characteristic curve (AUROC) of up to 95.13%. Additionally, we develop a fast, low cost feature selection algorithm that dynamically adjusts feature elimination to enhance model efficiency. Comprehensive cross- dataset evaluation demonstrates the model’s strong generalisation performance on the public datasets: Alzheimer’s Disease Neuroimaging Initiative (ADNI), AddNeuroMed1 (ANM1), and ANM2. Our method also provides biological interpretability through enrichment analysis, offering insights into the mechanisms underlying AD and potential therapeutic targets. This makes our approach a promising tool for early, accurate diagnosis and clinical application.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2025 The Author(s). Except as otherwise noted, this author-accepted version of a journal article published in IEEE Transactions on Computational Biology and Bioinformatics is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ |
Keywords: | feature selection; dimensionality reduction; en23 richment analysis; gene expression; alzheimer’s disease; deep 24 learning |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 14 May 2025 13:43 |
Last Modified: | 16 May 2025 11:29 |
Status: | Published online |
Publisher: | Association for Computing Machinery (ACM) |
Refereed: | Yes |
Identification Number: | 10.1109/TCBBIO.2025.3568711 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:226357 |