Dynamic gesture recognition during human–robot interaction in autonomous earthmoving machinery used for construction

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Shiwei Guan, Jiajun Wang, Xiaoling Wang, Chen Ding, Hongyang Liang, Qi Wei
{"title":"Dynamic gesture recognition during human–robot interaction in autonomous earthmoving machinery used for construction","authors":"Shiwei Guan,&nbsp;Jiajun Wang,&nbsp;Xiaoling Wang,&nbsp;Chen Ding,&nbsp;Hongyang Liang,&nbsp;Qi Wei","doi":"10.1016/j.aei.2025.103315","DOIUrl":null,"url":null,"abstract":"<div><div>Effective interaction between operators and autonomous earthmoving machinery can accurately convey the rich engineering experience of operators to machines, ensuring efficient human–robot collaboration in construction. In this study, we propose a pipeline for dynamic gesture interaction between authorised operators and autonomous earthmoving machinery. Initially, the autonomous earthmoving machinery preprocessed the video stream using video restoration algorithms if it operated under harsh environmental conditions. Subsequently, the machinery used a safety helmet colour detection algorithm based on YOLOv8 to determine whether an operator has the authorisation to interact with it by recognising the colour of the safety helmet worn by the operator, thereby preventing incorrect operations of the machinery from unauthorised operators. Finally, the autonomous earthmoving machinery utilised the proposed video swin transformer with Adapt multilayer perceptron (AdaptViSwT) dynamic gesture recognition algorithm to recognise dynamic gesture instructions provided by authorised operators and execute the corresponding operations, enabling human–robot collaboration under complex construction conditions. To train the proposed AdaptViSwT effectively, we established a dynamic gesture interaction dataset comprising 6,502 videos that contained nine commonly used instructions for commanding earthmoving machinery. The experiments verified that, on construction-site datasets, the proposed pipeline achieved 91.2% accuracy in detecting authorised worker. In dynamic gesture recognition, it achieved 98.32% accuracy and 98.44% F1-score. These results effectively ensure the safety and reliability of human-robot collaborative construction.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103315"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002083","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Effective interaction between operators and autonomous earthmoving machinery can accurately convey the rich engineering experience of operators to machines, ensuring efficient human–robot collaboration in construction. In this study, we propose a pipeline for dynamic gesture interaction between authorised operators and autonomous earthmoving machinery. Initially, the autonomous earthmoving machinery preprocessed the video stream using video restoration algorithms if it operated under harsh environmental conditions. Subsequently, the machinery used a safety helmet colour detection algorithm based on YOLOv8 to determine whether an operator has the authorisation to interact with it by recognising the colour of the safety helmet worn by the operator, thereby preventing incorrect operations of the machinery from unauthorised operators. Finally, the autonomous earthmoving machinery utilised the proposed video swin transformer with Adapt multilayer perceptron (AdaptViSwT) dynamic gesture recognition algorithm to recognise dynamic gesture instructions provided by authorised operators and execute the corresponding operations, enabling human–robot collaboration under complex construction conditions. To train the proposed AdaptViSwT effectively, we established a dynamic gesture interaction dataset comprising 6,502 videos that contained nine commonly used instructions for commanding earthmoving machinery. The experiments verified that, on construction-site datasets, the proposed pipeline achieved 91.2% accuracy in detecting authorised worker. In dynamic gesture recognition, it achieved 98.32% accuracy and 98.44% F1-score. These results effectively ensure the safety and reliability of human-robot collaborative construction.
工程用自主土方机械人机交互中的动态手势识别
操作人员与自主土方机械之间的有效交互,可以将操作人员丰富的工程经验准确地传递给机器,确保高效的人机协作施工。在这项研究中,我们提出了一个管道,用于授权操作员和自主土方机械之间的动态手势交互。最初,如果自主土方机械在恶劣环境条件下运行,则使用视频恢复算法对视频流进行预处理。随后,机器使用基于YOLOv8的安全帽颜色检测算法,通过识别操作人员所戴安全帽的颜色来确定操作人员是否有权与机器进行交互,从而防止未经授权的操作人员对机器进行错误操作。最后,自主土方机械利用本文提出的带有Adapt多层感知器(AdaptViSwT)动态手势识别算法的视频旋转变压器,识别授权操作员提供的动态手势指令,并执行相应的操作,实现复杂施工条件下的人机协作。为了有效地训练所提出的AdaptViSwT,我们建立了一个包含6502个视频的动态手势交互数据集,其中包含9个常用的土方机械指令。实验证实,在施工现场数据集上,该管道检测授权工人的准确率达到91.2%。在动态手势识别方面,准确率达到98.32%,f1得分达到98.44%。这些结果有效地保证了人机协同施工的安全性和可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信