Lizhi Long , Jingjing Guo , Honghu Chu , Songyue Wang , Shaopeng Xu , Lu Deng
{"title":"Binocular vision-based pose monitoring technique for assembly alignment of precast concrete components","authors":"Lizhi Long , Jingjing Guo , Honghu Chu , Songyue Wang , Shaopeng Xu , Lu Deng","doi":"10.1016/j.aei.2025.103205","DOIUrl":null,"url":null,"abstract":"<div><div>Vision-based pose monitoring for the assembly alignment of precast concrete components at long distances is often hindered by the small size of elliptical targets and cluttered construction backgrounds. To address the challenges in small ellipse identification and coordinate calculation, this study proposes a binocular vision-based pose monitoring technique for precast concrete components. First, an improved ellipse target detection algorithm is developed, integrating Omni-Dimensional Dynamic Convolution (ODConv) and a Coordinate Attention (CA) mechanism to enhance small target recognition under complex backgrounds. ODConv dynamically adjusts convolutional kernel shapes to capture multi-scale features while the CA mechanism embeds precise location information into channel attention. Additionally, The Normalized Wasserstein Distance (NWD) is employed to improve non-maximum suppression by considering both spatial distance and shape similarity, which is particularly beneficial for small and densely distributed targets. Second, an enhanced ellipse center extraction algorithm is introduced, utilizing image magnification and contrast enhancement for more accurate pixel coordinate extraction. The experiments demonstrated the superior performance of the proposed method with a precision of 97.6 %, a recall of 97.8 %, a mean average precision ([email protected]:0.95) of 70.5 %, and an inference speed of 35.8 FPS. This balance between accuracy and efficiency ensures the feasibility of real-time applications. Furthermore, field tests conducted at a hoisting site for precast concrete columns achieved positioning errors in the x and y directions within 5 mm over a monitoring range of 10 m, which confirmed the method’s practical reliability.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103205"},"PeriodicalIF":8.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625000989","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Vision-based pose monitoring for the assembly alignment of precast concrete components at long distances is often hindered by the small size of elliptical targets and cluttered construction backgrounds. To address the challenges in small ellipse identification and coordinate calculation, this study proposes a binocular vision-based pose monitoring technique for precast concrete components. First, an improved ellipse target detection algorithm is developed, integrating Omni-Dimensional Dynamic Convolution (ODConv) and a Coordinate Attention (CA) mechanism to enhance small target recognition under complex backgrounds. ODConv dynamically adjusts convolutional kernel shapes to capture multi-scale features while the CA mechanism embeds precise location information into channel attention. Additionally, The Normalized Wasserstein Distance (NWD) is employed to improve non-maximum suppression by considering both spatial distance and shape similarity, which is particularly beneficial for small and densely distributed targets. Second, an enhanced ellipse center extraction algorithm is introduced, utilizing image magnification and contrast enhancement for more accurate pixel coordinate extraction. The experiments demonstrated the superior performance of the proposed method with a precision of 97.6 %, a recall of 97.8 %, a mean average precision ([email protected]:0.95) of 70.5 %, and an inference speed of 35.8 FPS. This balance between accuracy and efficiency ensures the feasibility of real-time applications. Furthermore, field tests conducted at a hoisting site for precast concrete columns achieved positioning errors in the x and y directions within 5 mm over a monitoring range of 10 m, which confirmed the method’s practical reliability.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.