Chen, W., Shi, C., Ding, J. et al. (2 more authors) (2025) Data-driven sparse learning of three-dimensional subsurface properties incorporating random field theory. Engineering Geology, 349. 107972. ISSN 0013-7952
Abstract
Geotechnical engineers rely on accurate soil property information for engineering analyses. However, it is challenging for spatial learning of soil attributes because in-situ geotechnical testing is typically performed sparsely at discrete locations, and soil properties also exhibit inherent spatial variability. Traditional geostatistical methods for predicting spatial properties at these unsampled locations exhibit high computational complexity and require pre-determination of hyper-parameters, while pure data-driven methods fail to integrate geotechnical knowledge. In this study, a hybrid and parameter-free framework that uses random field theory and machine learning is proposed to model 3D subsurface field with reduced computational complexity. The framework constructs site-specific basis functions for characterizing the spatial variations of soil properties by decomposing a correlation matrix through principal component analysis. To further reduce the computational complexity involved in processing high-dimensional correlation matrices, a sparse sampling strategy is adopted to map correlation matrix onto lower-rank principal component space. A series of synthetic random field examples are generated to illustrate the impact of scale of fluctuation and autocorrelation functions on the accuracy and sensitivity of subsurface modeling. The performance of the proposed method is further validated using both synthetic cases and two real case histories. It is demonstrated that the proposed method generally achieves higher R<sup>2</sup> and lower root mean square error (RMSE) and mean absolute percentage error (MAPE) compared to state-of-the-art methods, such as Kriging and Bayesian compressive sensing. Moreover, the proposed method facilitates the explicit quantification of uncertainty associated with the subsurface models, providing valuable insights for engineering design and analysis. The data and code used in this study are available at https://github.com/Data-Driven-RFT/Sparse-Learning.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | This is an author produced version of an article accepted for publication in Engineering Geology made available under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
Keywords: | Geotechnical spatial variability; Machine learning aided geotechnics; Random field theory; Geotechnical site investigation; Principal component analysis |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Civil Engineering (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 18 Jun 2025 10:12 |
Last Modified: | 18 Jun 2025 10:12 |
Status: | Published |
Publisher: | Elsevier |
Identification Number: | 10.1016/j.enggeo.2025.107972 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:227950 |