自主发射龙门:用于预制混凝土梁实时姿态估计的改进单目视觉方法

IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY
Weili Fang , Guanghui Geng , Gan Zhang , Peter E.D. Love
{"title":"自主发射龙门:用于预制混凝土梁实时姿态估计的改进单目视觉方法","authors":"Weili Fang ,&nbsp;Guanghui Geng ,&nbsp;Gan Zhang ,&nbsp;Peter E.D. Love","doi":"10.1016/j.autcon.2025.106534","DOIUrl":null,"url":null,"abstract":"<div><div>The absence of accurate and real-time 6-DoF pose data for precast concrete girders renders launching gantry operations predominantly manual, thereby impeding further automation. Such limitations pose a critical question: <em>How can we accurately and robustly estimate the pose of precast concrete girders in real-time during launching gantry operations?</em> To address that question, our paper proposes a monocular vision-based approach to estimate the 6-DoF pose of the precast concrete girder in launching gantry operations. The approach detects the ChArUco board regions using the YOLOv11n model, applies GAN-based image deblurring. The 6-DoF pose is then estimated using a Perspective-n-Point solver and transformed to the gantry coordinate system. Field tests demonstrate robust performance, achieving a mean reprojection error of 0.113 pixels and a processing latency of 60 ms per frame. The results validate the approach's robustness and real-time performance, highlighting monocular vision as a cost-effective alternative to LiDAR–IMU fusion for large-scale automation in construction.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"180 ","pages":"Article 106534"},"PeriodicalIF":11.5000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Autonomous launching gantry: Improved monocular vision approach for real-time pose estimation of precast concrete girders\",\"authors\":\"Weili Fang ,&nbsp;Guanghui Geng ,&nbsp;Gan Zhang ,&nbsp;Peter E.D. Love\",\"doi\":\"10.1016/j.autcon.2025.106534\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The absence of accurate and real-time 6-DoF pose data for precast concrete girders renders launching gantry operations predominantly manual, thereby impeding further automation. Such limitations pose a critical question: <em>How can we accurately and robustly estimate the pose of precast concrete girders in real-time during launching gantry operations?</em> To address that question, our paper proposes a monocular vision-based approach to estimate the 6-DoF pose of the precast concrete girder in launching gantry operations. The approach detects the ChArUco board regions using the YOLOv11n model, applies GAN-based image deblurring. The 6-DoF pose is then estimated using a Perspective-n-Point solver and transformed to the gantry coordinate system. Field tests demonstrate robust performance, achieving a mean reprojection error of 0.113 pixels and a processing latency of 60 ms per frame. The results validate the approach's robustness and real-time performance, highlighting monocular vision as a cost-effective alternative to LiDAR–IMU fusion for large-scale automation in construction.</div></div>\",\"PeriodicalId\":8660,\"journal\":{\"name\":\"Automation in Construction\",\"volume\":\"180 \",\"pages\":\"Article 106534\"},\"PeriodicalIF\":11.5000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automation in Construction\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926580525005746\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525005746","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

由于缺乏准确和实时的预制混凝土大梁的6自由度姿态数据,使得启动龙门操作主要是手动的,从而阻碍了进一步的自动化。这些限制提出了一个关键的问题:我们如何才能准确和稳健地估计预制混凝土梁的姿态,实时在发射龙门架操作?为了解决这个问题,本文提出了一种基于单目视觉的方法来估计预制混凝土梁在启动龙门作业中的六自由度姿态。该方法使用YOLOv11n模型检测ChArUco板区域,应用基于gan的图像去模糊。然后使用Perspective-n-Point求解器估计6-DoF姿态,并将其转换为龙门坐标系。现场测试显示了强大的性能,实现了0.113像素的平均重投影误差和每帧60毫秒的处理延迟。结果验证了该方法的鲁棒性和实时性,突出了单目视觉作为激光雷达- imu融合的一种经济有效的替代方案,适用于大规模自动化施工。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Autonomous launching gantry: Improved monocular vision approach for real-time pose estimation of precast concrete girders
The absence of accurate and real-time 6-DoF pose data for precast concrete girders renders launching gantry operations predominantly manual, thereby impeding further automation. Such limitations pose a critical question: How can we accurately and robustly estimate the pose of precast concrete girders in real-time during launching gantry operations? To address that question, our paper proposes a monocular vision-based approach to estimate the 6-DoF pose of the precast concrete girder in launching gantry operations. The approach detects the ChArUco board regions using the YOLOv11n model, applies GAN-based image deblurring. The 6-DoF pose is then estimated using a Perspective-n-Point solver and transformed to the gantry coordinate system. Field tests demonstrate robust performance, achieving a mean reprojection error of 0.113 pixels and a processing latency of 60 ms per frame. The results validate the approach's robustness and real-time performance, highlighting monocular vision as a cost-effective alternative to LiDAR–IMU fusion for large-scale automation in construction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Automation in Construction
Automation in Construction 工程技术-工程:土木
CiteScore
19.20
自引率
16.50%
发文量
563
审稿时长
8.5 months
期刊介绍: Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities. The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信