Pengfei Ding , Jie Zhang , Peng Zhang , Hongsen Li , Dexian Wang
{"title":"CCM-FCC: LLM-powered cognition-centered AI agent framework for proactive human-robot collaboration","authors":"Pengfei Ding , Jie Zhang , Peng Zhang , Hongsen Li , Dexian Wang","doi":"10.1016/j.rcim.2025.103145","DOIUrl":null,"url":null,"abstract":"<div><div>Proactive human-robot collaboration (PHRC) primarily relies on predefined rule-based integration of perception, analysis and decision-making into a unified framework, which limits its autonomy and interactivity in dynamic scenarios such as disassembly and assembly. Although AI agent equipped with memory and interaction functions exhibits enhanced adaptability, their task-specific designs result in a lack of holistic cognition, thereby limiting their generalization capability. This paper proposes a Large Language Model (LLM)-powered cognition-centered AI agent framework, which addresses these challenges through the “Cognitive Core Management–Functional Cluster Collaboration” (CCM-FCC) paradigm. Specifically, to enhance the generalization capability of the AI agent, we developed a semantic Chain-of-Thought (CoT) prompt learning-driven cognitive core for predicting key task factors. The semantic CoT prompt learning, which couples task semantics with reasoning logic, empowers the pre-trained LLM to improve the key factors prediction. Subsequently, to ensure centralized management of the cognitive core, we designed a dual-dimensional feature-constrained functional activation module. It extracts task semantic cues from the key factors and autonomously activates functional modules within the AI agent, constrained by task complexity and operator state. Furthermore, a task-semantic-driven functional cluster collaboration module is proposed to generate the optimal collaboration strategy. Finally, a deep reinforcement learning model is constructed to enable the robot to proactively collaborate with the operator for PHRC. The experiments on HRC tasks demonstrates the effectiveness of the proposed method.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"98 ","pages":"Article 103145"},"PeriodicalIF":11.4000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584525001991","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Proactive human-robot collaboration (PHRC) primarily relies on predefined rule-based integration of perception, analysis and decision-making into a unified framework, which limits its autonomy and interactivity in dynamic scenarios such as disassembly and assembly. Although AI agent equipped with memory and interaction functions exhibits enhanced adaptability, their task-specific designs result in a lack of holistic cognition, thereby limiting their generalization capability. This paper proposes a Large Language Model (LLM)-powered cognition-centered AI agent framework, which addresses these challenges through the “Cognitive Core Management–Functional Cluster Collaboration” (CCM-FCC) paradigm. Specifically, to enhance the generalization capability of the AI agent, we developed a semantic Chain-of-Thought (CoT) prompt learning-driven cognitive core for predicting key task factors. The semantic CoT prompt learning, which couples task semantics with reasoning logic, empowers the pre-trained LLM to improve the key factors prediction. Subsequently, to ensure centralized management of the cognitive core, we designed a dual-dimensional feature-constrained functional activation module. It extracts task semantic cues from the key factors and autonomously activates functional modules within the AI agent, constrained by task complexity and operator state. Furthermore, a task-semantic-driven functional cluster collaboration module is proposed to generate the optimal collaboration strategy. Finally, a deep reinforcement learning model is constructed to enable the robot to proactively collaborate with the operator for PHRC. The experiments on HRC tasks demonstrates the effectiveness of the proposed method.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.