Yixuan Liang, Ze Wang, Yunan Wang, Jichuan Yu, Jizhou Yan, Shize Lin, Zhao Jin, Jiuru Lu, Chuxiong Hu
{"title":"基于隐藏动作空间启发式软行为者评价的机器人长短期安全控制框架","authors":"Yixuan Liang, Ze Wang, Yunan Wang, Jichuan Yu, Jizhou Yan, Shize Lin, Zhao Jin, Jiuru Lu, Chuxiong Hu","doi":"10.1016/j.rcim.2025.103107","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time safety control for robots in dynamic environments is a critical and challenging problem in robotics. With the advent of intelligent manufacturing, the demand for advanced safety control technologies in robotics has steadily increased. Robot planning typically involves global and local approaches, but both face limitations in real-time safety control in dynamic environments with unknown obstacles. Recent hybrid frameworks have shown progress, but challenges persist, including limited perception capabilities and poor coordination between global and local components. To address these challenges, this work proposes a novel long short term safety control framework leveraging reinforcement learning for decision-making. Perception and planning are decoupled into long-term and short-term components, with long-term perception utilizing unsupervised clustering DBSCAN for structured environment information and short-term perception enhancing efficiency through prior knowledge. Long-term planning provides reference trajectories based on static environments, while short-term planning adjusts these trajectories in real time for local safety using control barrier functions. Based on hidden action heuristic soft actor–critic and curriculum learning, the decision-making mechanism ensures safety during obstacles or attacks and maximizes robot efficiency without compromising safety. Experiments are conducted with 10,000 randomized obstacle collision scenarios, and our framework is compared with four methods, including SAC and manually designed trajectory adjustment. The results demonstrate that our approach outperforms these methods in both safety performance and operational efficiency. Finally, the system is successfully implemented in a physical environment, showcasing its practical potential for real-world applications.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"98 ","pages":"Article 103107"},"PeriodicalIF":11.4000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Long short term robot safe control framework based on hidden action space heuristic soft actor–critic\",\"authors\":\"Yixuan Liang, Ze Wang, Yunan Wang, Jichuan Yu, Jizhou Yan, Shize Lin, Zhao Jin, Jiuru Lu, Chuxiong Hu\",\"doi\":\"10.1016/j.rcim.2025.103107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Real-time safety control for robots in dynamic environments is a critical and challenging problem in robotics. With the advent of intelligent manufacturing, the demand for advanced safety control technologies in robotics has steadily increased. Robot planning typically involves global and local approaches, but both face limitations in real-time safety control in dynamic environments with unknown obstacles. Recent hybrid frameworks have shown progress, but challenges persist, including limited perception capabilities and poor coordination between global and local components. To address these challenges, this work proposes a novel long short term safety control framework leveraging reinforcement learning for decision-making. Perception and planning are decoupled into long-term and short-term components, with long-term perception utilizing unsupervised clustering DBSCAN for structured environment information and short-term perception enhancing efficiency through prior knowledge. Long-term planning provides reference trajectories based on static environments, while short-term planning adjusts these trajectories in real time for local safety using control barrier functions. Based on hidden action heuristic soft actor–critic and curriculum learning, the decision-making mechanism ensures safety during obstacles or attacks and maximizes robot efficiency without compromising safety. Experiments are conducted with 10,000 randomized obstacle collision scenarios, and our framework is compared with four methods, including SAC and manually designed trajectory adjustment. The results demonstrate that our approach outperforms these methods in both safety performance and operational efficiency. Finally, the system is successfully implemented in a physical environment, showcasing its practical potential for real-world applications.</div></div>\",\"PeriodicalId\":21452,\"journal\":{\"name\":\"Robotics and Computer-integrated Manufacturing\",\"volume\":\"98 \",\"pages\":\"Article 103107\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Computer-integrated Manufacturing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0736584525001619\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584525001619","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Long short term robot safe control framework based on hidden action space heuristic soft actor–critic
Real-time safety control for robots in dynamic environments is a critical and challenging problem in robotics. With the advent of intelligent manufacturing, the demand for advanced safety control technologies in robotics has steadily increased. Robot planning typically involves global and local approaches, but both face limitations in real-time safety control in dynamic environments with unknown obstacles. Recent hybrid frameworks have shown progress, but challenges persist, including limited perception capabilities and poor coordination between global and local components. To address these challenges, this work proposes a novel long short term safety control framework leveraging reinforcement learning for decision-making. Perception and planning are decoupled into long-term and short-term components, with long-term perception utilizing unsupervised clustering DBSCAN for structured environment information and short-term perception enhancing efficiency through prior knowledge. Long-term planning provides reference trajectories based on static environments, while short-term planning adjusts these trajectories in real time for local safety using control barrier functions. Based on hidden action heuristic soft actor–critic and curriculum learning, the decision-making mechanism ensures safety during obstacles or attacks and maximizes robot efficiency without compromising safety. Experiments are conducted with 10,000 randomized obstacle collision scenarios, and our framework is compared with four methods, including SAC and manually designed trajectory adjustment. The results demonstrate that our approach outperforms these methods in both safety performance and operational efficiency. Finally, the system is successfully implemented in a physical environment, showcasing its practical potential for real-world applications.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.