Benjamin Matthews , Diego Allaix , Simon Wijte , Marcel Vullings
{"title":"推进无损混凝土抗压强度估计:大规模数据集和机器学习框架","authors":"Benjamin Matthews , Diego Allaix , Simon Wijte , Marcel Vullings","doi":"10.1016/j.ndteint.2025.103549","DOIUrl":null,"url":null,"abstract":"<div><div>Non-destructive test (NDT) methods provide an indirect assessment of the compressive strength of in-situ concrete structures. While traditional static models effectively capture the behaviour of small-scale localised datasets, their accuracy diminishes when applied to larger, aggregated datasets, where increased variability in NDT measurements introduces greater uncertainty in predicting concrete compressive strength. This paper presents three exhaustive, largest-to-date NDT databases on the ultrasonic pulse velocity (UPV), rebound hammer (RH), and SonReb methods, comprising 16,531 test results from 115 studies. First, existing empirical models are evaluated against global dataset trends. New relationships are fitted to reflect the global behaviour of each NDT method, highlighting their innate limitations in capturing large-scale variability. A comprehensive three-phase machine learning (ML) program is then introduced, studying the effects of incomplete features with varying levels of missing data on model performance. Seven diverse ML models are included in Phase 1, while Phase 2 assesses different imputation strategies. Phase 3 integrates the top-performers with a Tree-Structured Parzen estimator (TPE) optimisation algorithm to refine hyperparameters and maximise performance. Across all phases, CatBoost regression emerged as the most robust predictive model due to the high proportion of categorical variables included within the databases. The TPE-CatBoost models achieved final R<sup>2</sup> values of 0.928, 0.896, and 0.947 for UPV, RH, and SonReb, respectively. Finally, a Django-based web application was deployed on a cloud server (<span><span>https://recreate-ndt.onrender.com/</span><svg><path></path></svg></span>), allowing practitioners to generate real-time compressive strength predictions for new NDT results. These novel datasets and ML tools can power future innovation through more advanced data-driven modelling.</div></div>","PeriodicalId":18868,"journal":{"name":"Ndt & E International","volume":"158 ","pages":"Article 103549"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing non-destructive concrete compressive strength estimation: Large-Scale datasets and machine learning framework\",\"authors\":\"Benjamin Matthews , Diego Allaix , Simon Wijte , Marcel Vullings\",\"doi\":\"10.1016/j.ndteint.2025.103549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Non-destructive test (NDT) methods provide an indirect assessment of the compressive strength of in-situ concrete structures. While traditional static models effectively capture the behaviour of small-scale localised datasets, their accuracy diminishes when applied to larger, aggregated datasets, where increased variability in NDT measurements introduces greater uncertainty in predicting concrete compressive strength. This paper presents three exhaustive, largest-to-date NDT databases on the ultrasonic pulse velocity (UPV), rebound hammer (RH), and SonReb methods, comprising 16,531 test results from 115 studies. First, existing empirical models are evaluated against global dataset trends. New relationships are fitted to reflect the global behaviour of each NDT method, highlighting their innate limitations in capturing large-scale variability. A comprehensive three-phase machine learning (ML) program is then introduced, studying the effects of incomplete features with varying levels of missing data on model performance. Seven diverse ML models are included in Phase 1, while Phase 2 assesses different imputation strategies. Phase 3 integrates the top-performers with a Tree-Structured Parzen estimator (TPE) optimisation algorithm to refine hyperparameters and maximise performance. Across all phases, CatBoost regression emerged as the most robust predictive model due to the high proportion of categorical variables included within the databases. The TPE-CatBoost models achieved final R<sup>2</sup> values of 0.928, 0.896, and 0.947 for UPV, RH, and SonReb, respectively. Finally, a Django-based web application was deployed on a cloud server (<span><span>https://recreate-ndt.onrender.com/</span><svg><path></path></svg></span>), allowing practitioners to generate real-time compressive strength predictions for new NDT results. These novel datasets and ML tools can power future innovation through more advanced data-driven modelling.</div></div>\",\"PeriodicalId\":18868,\"journal\":{\"name\":\"Ndt & E International\",\"volume\":\"158 \",\"pages\":\"Article 103549\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ndt & E International\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0963869525002300\",\"RegionNum\":2,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, CHARACTERIZATION & TESTING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ndt & E International","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0963869525002300","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, CHARACTERIZATION & TESTING","Score":null,"Total":0}
Non-destructive test (NDT) methods provide an indirect assessment of the compressive strength of in-situ concrete structures. While traditional static models effectively capture the behaviour of small-scale localised datasets, their accuracy diminishes when applied to larger, aggregated datasets, where increased variability in NDT measurements introduces greater uncertainty in predicting concrete compressive strength. This paper presents three exhaustive, largest-to-date NDT databases on the ultrasonic pulse velocity (UPV), rebound hammer (RH), and SonReb methods, comprising 16,531 test results from 115 studies. First, existing empirical models are evaluated against global dataset trends. New relationships are fitted to reflect the global behaviour of each NDT method, highlighting their innate limitations in capturing large-scale variability. A comprehensive three-phase machine learning (ML) program is then introduced, studying the effects of incomplete features with varying levels of missing data on model performance. Seven diverse ML models are included in Phase 1, while Phase 2 assesses different imputation strategies. Phase 3 integrates the top-performers with a Tree-Structured Parzen estimator (TPE) optimisation algorithm to refine hyperparameters and maximise performance. Across all phases, CatBoost regression emerged as the most robust predictive model due to the high proportion of categorical variables included within the databases. The TPE-CatBoost models achieved final R2 values of 0.928, 0.896, and 0.947 for UPV, RH, and SonReb, respectively. Finally, a Django-based web application was deployed on a cloud server (https://recreate-ndt.onrender.com/), allowing practitioners to generate real-time compressive strength predictions for new NDT results. These novel datasets and ML tools can power future innovation through more advanced data-driven modelling.
期刊介绍:
NDT&E international publishes peer-reviewed results of original research and development in all categories of the fields of nondestructive testing and evaluation including ultrasonics, electromagnetics, radiography, optical and thermal methods. In addition to traditional NDE topics, the emerging technology area of inspection of civil structures and materials is also emphasized. The journal publishes original papers on research and development of new inspection techniques and methods, as well as on novel and innovative applications of established methods. Papers on NDE sensors and their applications both for inspection and process control, as well as papers describing novel NDE systems for structural health monitoring and their performance in industrial settings are also considered. Other regular features include international news, new equipment and a calendar of forthcoming worldwide meetings. This journal is listed in Current Contents.