{"title":"基于图卷积网络的制造装配任务骨架动作识别","authors":"Maryam Soleymani , Mahdi Bonyani , Chao Wang","doi":"10.1016/j.jmsy.2025.06.019","DOIUrl":null,"url":null,"abstract":"<div><div>In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"82 ","pages":"Pages 362-375"},"PeriodicalIF":14.2000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Skeleton-based action recognition for manufacturing assembly task through graph convolution network\",\"authors\":\"Maryam Soleymani , Mahdi Bonyani , Chao Wang\",\"doi\":\"10.1016/j.jmsy.2025.06.019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.</div></div>\",\"PeriodicalId\":16227,\"journal\":{\"name\":\"Journal of Manufacturing Systems\",\"volume\":\"82 \",\"pages\":\"Pages 362-375\"},\"PeriodicalIF\":14.2000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Manufacturing Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0278612525001700\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525001700","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
Skeleton-based action recognition for manufacturing assembly task through graph convolution network
In modern manufacturing, human participation in assembly processes is essential, despite advancements in automation. However, accurately recognizing human actions in these environments presents challenges due to complex spatial–temporal dependencies and dynamic joint relationships. Graph Convolution Networks (GCNs) are utilized widely for action recognition, but they have poor accuracy for modeling long-range node correlations. Also, current GCNs have limitations in extracting various features due to utilizing the same pattern extraction for all frames. To overcome these issues, this study presents a novel approach to skeleton-based action recognition for manufacturing tasks using a Dual-Attention Graph Convolution Network (DAGCN). The proposed model integrates a Parallel Attention-Graph Mixer (PAGM) and Temporal–Spatial Attention Integrator (TSAI), enhancing the capture of both global and local joint relations and addressing the dynamic nature of skeletal joint relationships. Extensive evaluations on benchmark datasets, including HA4M that specifically designed for assembly tasks, NTU RGB+D, Northwestern-UCLA, and NTU RGB+D120, reveal the superior performance of DAGCN over state-of-the-art methods in terms of accuracy and computational efficiency. Experimental results demonstrate that DAGCN outperforms state-of-the-art methods, achieving a Top-1 accuracy of 89.0% on the HA4M dataset. The results validate DAGCN’s effectiveness in recognizing fine-grained human actions in industrial settings, contributing to improved efficiency and safety in human–robot collaboration. The proposed model offers a scalable and computationally efficient solution for intelligent assembly monitoring and automation in smart manufacturing systems.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.