Woldeamanuel Minwuye Mesfin , Gun Kim , Hyeong-Ki Kim
{"title":"Concrete section segmentation with advanced deep learning models and refined labeling approaches","authors":"Woldeamanuel Minwuye Mesfin , Gun Kim , Hyeong-Ki Kim","doi":"10.1016/j.eswa.2025.127697","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates deep learning models for semantic and instance segmentation of construction site images, focusing on concrete sections. A labeling scheme was developed for the image dataset, extending to multiple classes to achieve more detailed segmentation. ResNet-50, Xception, Inception ResNet V2, and Mask R-CNN models were trained and evaluated using key metrics, including accuracy, intersection over union, boundary F1 score, and receiver operating characteristic curves. The influence of hyperparameters, particularly the learning rate, was thoroughly analyzed. The results demonstrated that the optimal learning rate is 0.01, which led to a performance improvement within the studied range. Additionally, the Xception model consistently outperformed the others across most classes, delivering robust accuracy and reliability,with an accuracy value of 94%, an IoU of 88%, and a BFS of 73%. Furthermore, this study examines the impact of practical factors, including brightness, blur, camera rotation, perspective distortion, and illumination, on segmentation performance. The findings reveal that segmentation accuracy declines significantly under extreme conditions, highlighting the necessity of data augmentation and high-quality image acquisition to improve model resilience.”</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127697"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013193","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates deep learning models for semantic and instance segmentation of construction site images, focusing on concrete sections. A labeling scheme was developed for the image dataset, extending to multiple classes to achieve more detailed segmentation. ResNet-50, Xception, Inception ResNet V2, and Mask R-CNN models were trained and evaluated using key metrics, including accuracy, intersection over union, boundary F1 score, and receiver operating characteristic curves. The influence of hyperparameters, particularly the learning rate, was thoroughly analyzed. The results demonstrated that the optimal learning rate is 0.01, which led to a performance improvement within the studied range. Additionally, the Xception model consistently outperformed the others across most classes, delivering robust accuracy and reliability,with an accuracy value of 94%, an IoU of 88%, and a BFS of 73%. Furthermore, this study examines the impact of practical factors, including brightness, blur, camera rotation, perspective distortion, and illumination, on segmentation performance. The findings reveal that segmentation accuracy declines significantly under extreme conditions, highlighting the necessity of data augmentation and high-quality image acquisition to improve model resilience.”
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.