{"title":"Large and small-scale models’ fusion-driven proactive robotic manipulation control for human-robot collaborative assembly in industry 5.0","authors":"Dongxu Ma , Chao Zhang , Qingfeng Xu , Guanghui Zhou","doi":"10.1016/j.rcim.2025.103078","DOIUrl":null,"url":null,"abstract":"<div><div>Human-robot collaborative (HRC) assembly has been popular by combining human creativity and dexterity with robotic precision for higher assembly efficiency and resilience in industry 5.0. Nevertheless, current HRC assembly systems rely predefined codes, limiting robot adaptability to dynamic and unstructured assembly environments. To bridge the gap, this paper proposes a novel proactive robotic manipulation control method for HRC assembly, which fully utilizes large-scale model (LSM) in cognitive computing and reasoning for dynamic robotic control path planning, and small-scale models (SSMs) in efficiently computing for dynamic robotic control demand perception and control constraints verification. Specifically, LSM, namely ChatGPT 4o, is deployed on the cloud to proactively generate robotic control constraints according to the robotic control demand derived from SSMs on the edge. Here, two kinds of SSMs are developed, including robotic control demands perception model and robotic control constraints verification model. For robotic control demands perception, an ensemble encoder model is proposed for ongoing human assembly action detection, on which a vision model and fine-tuned assembly instruction generation model are designed for assembly manipulation keypoints image and robot control instruction generation, serving as the input for LSM. For robotic control constraints verification, a digital twin model is used to verify the control constraints derived from LSM, where verified constraints are used for robotic control during assembly process. Finally, the feasibility and effectiveness of the proposed approach are demonstrated through experiments on an HRC assembly process, where over 99 % accuracy for human assembly action detection and 80 % task execution accuracy are conducted.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"97 ","pages":"Article 103078"},"PeriodicalIF":11.4000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584525001322","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Human-robot collaborative (HRC) assembly has been popular by combining human creativity and dexterity with robotic precision for higher assembly efficiency and resilience in industry 5.0. Nevertheless, current HRC assembly systems rely predefined codes, limiting robot adaptability to dynamic and unstructured assembly environments. To bridge the gap, this paper proposes a novel proactive robotic manipulation control method for HRC assembly, which fully utilizes large-scale model (LSM) in cognitive computing and reasoning for dynamic robotic control path planning, and small-scale models (SSMs) in efficiently computing for dynamic robotic control demand perception and control constraints verification. Specifically, LSM, namely ChatGPT 4o, is deployed on the cloud to proactively generate robotic control constraints according to the robotic control demand derived from SSMs on the edge. Here, two kinds of SSMs are developed, including robotic control demands perception model and robotic control constraints verification model. For robotic control demands perception, an ensemble encoder model is proposed for ongoing human assembly action detection, on which a vision model and fine-tuned assembly instruction generation model are designed for assembly manipulation keypoints image and robot control instruction generation, serving as the input for LSM. For robotic control constraints verification, a digital twin model is used to verify the control constraints derived from LSM, where verified constraints are used for robotic control during assembly process. Finally, the feasibility and effectiveness of the proposed approach are demonstrated through experiments on an HRC assembly process, where over 99 % accuracy for human assembly action detection and 80 % task execution accuracy are conducted.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.