{"title":"集成强化学习和虚拟夹具的更安全自动机器人手术","authors":"Ke Fan;Ziyang Chen","doi":"10.1109/LRA.2025.3559826","DOIUrl":null,"url":null,"abstract":"A primary concern in robotic automation is safety, especially in surgical scenarios. In this letter, we propose a virtual fixture (VF) based safe reinforcement learning framework to ensure safety constraints. The framework ensures that the agent, particularly multi-joint robotic manipulator agents, acts within the hard constraints. In the training phase, VF confines the exploration of the agent within a safe operational space. The core idea is that once the agent violates the VF, it will be pushed back to the safe region. Then, the safe action corrected by the VF is collected and forms a safe experience used for subsequent policy optimization, which we refer to as safety experience reshaping (SER). Subsequently, we design a visual module to detect safety constraints to construct the VF and transfer the trained policy to the real robot. We compare our framework to 5 state-of-the-art RL methods and a nonlearning-based method. Results show that our framework gets a lower rate of constraint violations and better performance in task success. Furthermore, in addition to the static constraint tasks, we also designed two tasks involving dynamic constraints, highlighting the superiority of our method in handling dynamic constraints. The videos of our physical experiment can be found in the following links (Lymph node removal, Human-robot collaboration 1, Human-robot collaboration 2).","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 6","pages":"5265-5272"},"PeriodicalIF":4.6000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating Reinforcement Learning and Virtual Fixtures for Safer Automatic Robotic Surgery\",\"authors\":\"Ke Fan;Ziyang Chen\",\"doi\":\"10.1109/LRA.2025.3559826\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A primary concern in robotic automation is safety, especially in surgical scenarios. In this letter, we propose a virtual fixture (VF) based safe reinforcement learning framework to ensure safety constraints. The framework ensures that the agent, particularly multi-joint robotic manipulator agents, acts within the hard constraints. In the training phase, VF confines the exploration of the agent within a safe operational space. The core idea is that once the agent violates the VF, it will be pushed back to the safe region. Then, the safe action corrected by the VF is collected and forms a safe experience used for subsequent policy optimization, which we refer to as safety experience reshaping (SER). Subsequently, we design a visual module to detect safety constraints to construct the VF and transfer the trained policy to the real robot. We compare our framework to 5 state-of-the-art RL methods and a nonlearning-based method. Results show that our framework gets a lower rate of constraint violations and better performance in task success. Furthermore, in addition to the static constraint tasks, we also designed two tasks involving dynamic constraints, highlighting the superiority of our method in handling dynamic constraints. The videos of our physical experiment can be found in the following links (Lymph node removal, Human-robot collaboration 1, Human-robot collaboration 2).\",\"PeriodicalId\":13241,\"journal\":{\"name\":\"IEEE Robotics and Automation Letters\",\"volume\":\"10 6\",\"pages\":\"5265-5272\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Robotics and Automation Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10964172/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10964172/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
Integrating Reinforcement Learning and Virtual Fixtures for Safer Automatic Robotic Surgery
A primary concern in robotic automation is safety, especially in surgical scenarios. In this letter, we propose a virtual fixture (VF) based safe reinforcement learning framework to ensure safety constraints. The framework ensures that the agent, particularly multi-joint robotic manipulator agents, acts within the hard constraints. In the training phase, VF confines the exploration of the agent within a safe operational space. The core idea is that once the agent violates the VF, it will be pushed back to the safe region. Then, the safe action corrected by the VF is collected and forms a safe experience used for subsequent policy optimization, which we refer to as safety experience reshaping (SER). Subsequently, we design a visual module to detect safety constraints to construct the VF and transfer the trained policy to the real robot. We compare our framework to 5 state-of-the-art RL methods and a nonlearning-based method. Results show that our framework gets a lower rate of constraint violations and better performance in task success. Furthermore, in addition to the static constraint tasks, we also designed two tasks involving dynamic constraints, highlighting the superiority of our method in handling dynamic constraints. The videos of our physical experiment can be found in the following links (Lymph node removal, Human-robot collaboration 1, Human-robot collaboration 2).
期刊介绍:
The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.