Willson, J., Woollam, R.C., Thompson, H. orcid.org/0000-0002-0493-1131 et al. (1 more author) (2026) Effect of Sampling Dataset Size and Distribution on Machine Learning-Based Surrogate Modelling of CO₂ Corrosion. In: Proceedings of the CONFERENCE 2026. AMPP Annual Conference + Expo 2026, 15-19 Mar 2026, Houston, Texas. . Association for Materials Protection and Performance (AMPP). Article no: C2026-00149.
Abstract
Predicting CO₂ corrosion accurately using high-fidelity mechanistic models requires specialist modelling skills and is computationally demanding, both factors limiting their practical application. This paper explores the potential of using machine learning surrogates to provide corrosion predictions accurately and efficiently. Optimized models, including gradient boosted decision trees (GBDTs), deep neural networks (DNNs) and hybrid stacked ensembles, achieved high predictive accuracy (MAPE < 5%) and drastically reduced computation time. Detailed error and data requirement analyses revealed crucial insights. A strong error correlation with low Reynolds number suggests the surrogate replicates source model limitations, potentially acting as a diagnostic tool for the underlying simulation. Furthermore, data requirement analysis highlighted the benefits of optimizing the sampling strategy, with a boundary-focused dataset based on a beta sampling distribution achieving performance plateaus using significantly fewer data points than a uniform sampling strategy (~17,000 compared to ~100,000). The work demonstrates that machine learning surrogates provide significant speed-up and valuable diagnostic capabilities.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Keywords: | Machine Learning, Carbon Dioxide, Corrosion Rate, Modelling, Simulation |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Mechanical Engineering (Leeds) |
| Date Deposited: | 18 May 2026 07:48 |
| Last Modified: | 18 May 2026 07:48 |
| Published Version: | https://content.ampp.org/ampp/proceedings-abstract... |
| Status: | Published |
| Publisher: | Association for Materials Protection and Performance (AMPP) |
| Identification Number: | 10.5006/c2026-00149 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:241102 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)