Sultana, N. orcid.org/0009-0000-6971-6021 and Ferdushi, K.F. orcid.org/0000-0003-4393-9491 (2026) Predictors of undiagnosed diabetes in the Bangladeshi female population: a propensity score–weighted machine learning analysis of BDHS biomarker data. Journal of Diabetes Research. ISSN: 2314-6745
Abstract
Background In Bangladesh, diabetes mellitus has become a substantial public health burden, imposing a strain on the population both economically and clinically. Robust methods for identifying high-risk populations using national data are urgently needed due to the increasing prevalence of metabolic noncommunicable illnesses throughout South Asia. Using data from the most recent wave of a reliable national survey, this study was aimed at creating and testing a diabetes prediction model for adult female Bangladeshis.
Methods The 2022 Bangladesh Demographic and Health Survey (BDHS) biomarker data were examined, where key factors of undiagnosed diabetes were identified using stacked ensemble machine learning (ML) algorithms together with explainable AI (XAI) approaches after adjusting for selection bias using a propensity score model. A nomogram, a user-friendly clinical tool for risk assessment, was created. The area under the receiver operating characteristic (AUROC) curve was used to assess the model′s performance.
Results Of the 18,547 weighted female participants (unweighted n = 7833), the prevalence of undiagnosed diabetes was 6.26% (95% CI: 5.52%–7.00%). Significant predictors identified included age (mean 42 ± 16 years vs. 38 ± 16 years; p < 0.001), BMI (mean 24.7 ± 4.6 kg/m2; p < 0.001), and hypertension (mean 120/78 mmHg; p = 0.001). ML models demonstrated high predictive accuracy (AUROC > 0.80), and a simplified clinical nomogram was developed to provide personalized risk scores. XAI (SHAP) analysis emphasized nonlinear influences, particularly in urban residents, who accounted for 40% of undiagnosed cases, and the richest quintile, which bore a disproportionate burden of 33% (p < 0.001) compared to just 13% in the poorest quintile.
Conclusion For early diabetes screening in Bangladesh, the established predictive model and nomogram provide an evidence-based method. These technologies can help healthcare providers and policymakers tailor interventions toward high-risk groups by using nationally representative data. This could potentially lessen the sustained health and financial effects of the diabetes epidemic.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Copyright, Publisher and Additional Information: | © 2026 Nahid Sultana and Kanis Fatama Ferdushi. Journal of Diabetes Research published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/4.0/ |
| Keywords: | 2022 BDHS; diabetes mellitus; explainable AI (XAI); nomogram; propensity score; public health |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > School of Health and Related Research (Sheffield) > ScHARR - Sheffield Centre for Health and Related Research |
| Date Deposited: | 01 Jul 2026 15:38 |
| Last Modified: | 01 Jul 2026 15:38 |
| Status: | Published online |
| Publisher: | Wiley |
| Refereed: | Yes |
| Identification Number: | 10.1155/jdr/2162121 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:242803 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)