Ye Ma, Dunbing Tang, Haihua Zhu, Qixiang Cai, Zequn Zhang, Liping Wang, Changchun Liu
{"title":"面向工业5.0的探测ar辅助无缝HRC装配:多模态相互认知和llm驱动的知识推理","authors":"Ye Ma, Dunbing Tang, Haihua Zhu, Qixiang Cai, Zequn Zhang, Liping Wang, Changchun Liu","doi":"10.1016/j.rcim.2025.103112","DOIUrl":null,"url":null,"abstract":"<div><div>The advancement of Industry 5.0 fosters human-centric manufacturing, aiming to enhance the well-being and needs of operators. Current research on Human-robot Collaboration (HRC) in the assembly scene is progressively evolving toward intelligent manufacturing with a human-centric focus, particularly in safety and interaction. However, in existing HRC assembly environments, the disjointed relationship between humans and robots presents challenges in handling complex manufacturing tasks. Cobots often struggle to accurately comprehend human actions and assembly contexts, while operators lack real-time insights into the current assembly scene. This mismatch increases the cognitive and physical burden on operators, ultimately reducing assembly efficiency and quality. To address this issue and achieve seamless HRC assembly, this paper proposes an Augmented Reality (AR) assisted HRC assembly method that integrates multimodal mutual cognition with Large Language Model (LLM) driven knowledge reasoning. Firstly, a Transformer-based multimodal fusion perception method for HRC scenarios is proposed to overcome the limitations of current cobots in understanding and responding to human commands and environmental changes in real-time. Based on this, a human-centric mutual cognition and safety framework for HRC is proposed to mitigate interaction risks and promote intelligent coexistence between operators and cobots. Secondly, an LLM-driven assembly knowledge reasoning system is proposed. By constructing an assembly craft information model with semantic associations and dynamic updating capabilities, the system facilitates decision support for HRC tasks and provides optimized assembly plan recommendations. This approach effectively leverages the respective advantages of human operators and cobots in executing assembly tasks. In addition, an AR-based HRC assembly assistance system is designed to enhance visual guidance during the assembly process while integrating the functionalities above. Finally, practical validation is conducted in real-world assembly tasks. Experimental results demonstrate that the proposed method offers efficient, intelligent, and visualized support for HRC assembly tasks, significantly reducing the cognitive load on operators while improving assembly efficiency and product consistency.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"97 ","pages":"Article 103112"},"PeriodicalIF":11.4000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Probing AR-assisted seamless HRC assembly for industry 5.0: Multi-modal mutual cognition and LLM-driven knowledge reasoning\",\"authors\":\"Ye Ma, Dunbing Tang, Haihua Zhu, Qixiang Cai, Zequn Zhang, Liping Wang, Changchun Liu\",\"doi\":\"10.1016/j.rcim.2025.103112\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The advancement of Industry 5.0 fosters human-centric manufacturing, aiming to enhance the well-being and needs of operators. Current research on Human-robot Collaboration (HRC) in the assembly scene is progressively evolving toward intelligent manufacturing with a human-centric focus, particularly in safety and interaction. However, in existing HRC assembly environments, the disjointed relationship between humans and robots presents challenges in handling complex manufacturing tasks. Cobots often struggle to accurately comprehend human actions and assembly contexts, while operators lack real-time insights into the current assembly scene. This mismatch increases the cognitive and physical burden on operators, ultimately reducing assembly efficiency and quality. To address this issue and achieve seamless HRC assembly, this paper proposes an Augmented Reality (AR) assisted HRC assembly method that integrates multimodal mutual cognition with Large Language Model (LLM) driven knowledge reasoning. Firstly, a Transformer-based multimodal fusion perception method for HRC scenarios is proposed to overcome the limitations of current cobots in understanding and responding to human commands and environmental changes in real-time. Based on this, a human-centric mutual cognition and safety framework for HRC is proposed to mitigate interaction risks and promote intelligent coexistence between operators and cobots. Secondly, an LLM-driven assembly knowledge reasoning system is proposed. By constructing an assembly craft information model with semantic associations and dynamic updating capabilities, the system facilitates decision support for HRC tasks and provides optimized assembly plan recommendations. This approach effectively leverages the respective advantages of human operators and cobots in executing assembly tasks. In addition, an AR-based HRC assembly assistance system is designed to enhance visual guidance during the assembly process while integrating the functionalities above. Finally, practical validation is conducted in real-world assembly tasks. Experimental results demonstrate that the proposed method offers efficient, intelligent, and visualized support for HRC assembly tasks, significantly reducing the cognitive load on operators while improving assembly efficiency and product consistency.</div></div>\",\"PeriodicalId\":21452,\"journal\":{\"name\":\"Robotics and Computer-integrated Manufacturing\",\"volume\":\"97 \",\"pages\":\"Article 103112\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Computer-integrated Manufacturing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0736584525001668\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584525001668","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Probing AR-assisted seamless HRC assembly for industry 5.0: Multi-modal mutual cognition and LLM-driven knowledge reasoning
The advancement of Industry 5.0 fosters human-centric manufacturing, aiming to enhance the well-being and needs of operators. Current research on Human-robot Collaboration (HRC) in the assembly scene is progressively evolving toward intelligent manufacturing with a human-centric focus, particularly in safety and interaction. However, in existing HRC assembly environments, the disjointed relationship between humans and robots presents challenges in handling complex manufacturing tasks. Cobots often struggle to accurately comprehend human actions and assembly contexts, while operators lack real-time insights into the current assembly scene. This mismatch increases the cognitive and physical burden on operators, ultimately reducing assembly efficiency and quality. To address this issue and achieve seamless HRC assembly, this paper proposes an Augmented Reality (AR) assisted HRC assembly method that integrates multimodal mutual cognition with Large Language Model (LLM) driven knowledge reasoning. Firstly, a Transformer-based multimodal fusion perception method for HRC scenarios is proposed to overcome the limitations of current cobots in understanding and responding to human commands and environmental changes in real-time. Based on this, a human-centric mutual cognition and safety framework for HRC is proposed to mitigate interaction risks and promote intelligent coexistence between operators and cobots. Secondly, an LLM-driven assembly knowledge reasoning system is proposed. By constructing an assembly craft information model with semantic associations and dynamic updating capabilities, the system facilitates decision support for HRC tasks and provides optimized assembly plan recommendations. This approach effectively leverages the respective advantages of human operators and cobots in executing assembly tasks. In addition, an AR-based HRC assembly assistance system is designed to enhance visual guidance during the assembly process while integrating the functionalities above. Finally, practical validation is conducted in real-world assembly tasks. Experimental results demonstrate that the proposed method offers efficient, intelligent, and visualized support for HRC assembly tasks, significantly reducing the cognitive load on operators while improving assembly efficiency and product consistency.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.