基于图卷积网络的制造装配任务骨架动作识别

IF 14.2 1区工程技术 Q1 ENGINEERING, INDUSTRIAL

Journal of Manufacturing Systems Pub Date : 2025-06-30 DOI:10.1016/j.jmsy.2025.06.019

Maryam Soleymani , Mahdi Bonyani , Chao Wang

{"title":"基于图卷积网络的制造装配任务骨架动作识别","authors":"Maryam Soleymani , Mahdi Bonyani , Chao Wang","doi":"10.1016/j.jmsy.2025.06.019","DOIUrl":null,"url":null,"abstract":"<div><div>In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"82 ","pages":"Pages 362-375"},"PeriodicalIF":14.2000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Skeleton-based action recognition for manufacturing assembly task through graph convolution network\",\"authors\":\"Maryam Soleymani , Mahdi Bonyani , Chao Wang\",\"doi\":\"10.1016/j.jmsy.2025.06.019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.</div></div>\",\"PeriodicalId\":16227,\"journal\":{\"name\":\"Journal of Manufacturing Systems\",\"volume\":\"82 \",\"pages\":\"Pages 362-375\"},\"PeriodicalIF\":14.2000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Manufacturing Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0278612525001700\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525001700","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}

引用次数: 0

摘要

在现代制造业中，尽管自动化程度有所提高，但人类参与装配过程是必不可少的。然而，由于复杂的时空依赖性和动态联合关系，在这些环境中准确识别人类行为面临挑战。图卷积网络（GCNs）广泛应用于动作识别，但在建模远程节点关联时精度较差。此外，由于对所有帧使用相同的模式提取，当前的GCNs在提取各种特征方面存在局限性。为了克服这些问题，本研究提出了一种基于骨架的制造任务动作识别的新方法，该方法使用双注意图卷积网络（DAGCN）。该模型集成了并行注意图混合器（PAGM）和时空注意积分器（TSAI），增强了对全局和局部关节关系的捕获，并解决了骨骼关节关系的动态性。对基准数据集的广泛评估，包括专门为装配任务设计的HA4M， NTU RGB+D，西北加州大学洛杉矶分校和NTU RGB+D120，揭示了DAGCN在准确性和计算效率方面优于最先进的方法。实验结果表明，DAGCN优于最先进的方法，在HA4M数据集上实现了89.0%的Top-1准确率。结果验证了DAGCN在识别工业环境中细粒度人类行为方面的有效性，有助于提高人机协作的效率和安全性。该模型为智能制造系统中的智能装配监控和自动化提供了一种可扩展且计算效率高的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Skeleton-based action recognition for manufacturing assembly task through graph convolution network

In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Manufacturing Systems 工程技术-工程：工业

CiteScore

23.30

自引率

13.20%

发文量

216

审稿时长

25 days

期刊介绍： The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs. With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.