Lawrence, N.R. orcid.org/0000-0002-7560-0268, Bacila, I., Tonge, J. et al. (6 more authors) (2025) Risk of bias in machine learning and statistical models to predict height or weight: a systematic review in fetal and paediatric medicine. Diagnostic and Prognostic Research, 9 (1). 32. ISSN: 2397-7523
Abstract
Background
Prediction of suboptimal growth allows early intervention that can improve outcomes for developing fetus’ as well as infants and children. We investigate the risk of bias in statistical or machine learning models to predict the height or weight of a fetus, infant or child under 20 years of age to inform the current standard of research and provide insight into why equations developed over 30 years ago are still recommended for use by national professional bodies.
Methods
We systematically searched MEDLINE and EMBASE for peer reviewed original research studies published in 2022. We included studies if they developed or validated a multivariable model to predict height or weight of an individual using two or more variables, excluding studies assessing imaging or using genetics or metabolomics information. Risk of bias was assessed for all prediction models and analyses using the Prediction model Risk Of Bias ASsessment Tool (PROBAST).
Results
Sixty-four studies were included, in which we assessed the development of 180 models and validation of 61 models. Sample size was only considered in 10% of developed models and 13% of validated models. Despite height and weight being continuous variables, 77% of models developed predicted a dichotomised outcome variable.
Registration
The review was registered on PROSPERO (ID: CRD42023421146), the International prospective register of systematic reviews on 26/4/2023.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © The Author(s) 2025. Open Access: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
| Keywords: | Growth; Height; Machine learning; Obstetrics; Paediatrics; Prediction modelling; Risk of bias; Statistical modelling; Systematic review; Weight |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Management School (Sheffield) The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > School of Medicine and Population Health |
| Funding Information: | Funder Grant number NIHR Academy NIHR302559 |
| Date Deposited: | 23 Dec 2025 16:07 |
| Last Modified: | 23 Dec 2025 16:07 |
| Status: | Published |
| Publisher: | Springer Science and Business Media LLC |
| Refereed: | Yes |
| Identification Number: | 10.1186/s41512-025-00215-6 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:235905 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)