可变形多线性物体双臂机器人操作的GNN拓扑表示学习

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Automation Science and Engineering Pub Date : 2025-04-18 DOI:10.1109/TASE.2025.3562231

Alessio Caporali;Kevin Galassi;Riccardo Zanella;Gianluca Palli

{"title":"可变形多线性物体双臂机器人操作的GNN拓扑表示学习","authors":"Alessio Caporali;Kevin Galassi;Riccardo Zanella;Gianluca Palli","doi":"10.1109/TASE.2025.3562231","DOIUrl":null,"url":null,"abstract":"Deformable Multi-Linear Objects (DMLOs), or Branched Deformable Linear Objects (BDLOs), are flexible objects that possess a linear structure similar to DLOs but also feature branching or bifurcation points where the object’s path diverges into multiple sections. The representation of complex DMLOs, such as wiring harnesses, poses significant challenges in various applications, including robotic systems’ perception and manipulation planning. This paper proposes an approach to address the robust and efficient estimation of a topological representation for DMLOs leveraging a graph-based description of the scene obtained via graph neural networks. Starting from a binary mask of the scene, graph nodes are sampled along the objects’ estimated centerlines. Then, a data-driven pipeline is employed to learn the assignment of graph edges between nodes and to characterize the node’s type based on their local topology and orientation. Finally, by utilizing the learned information, a solver combines the predictions and generates a coherent representation of the objects in the scene. The approach is experimentally evaluated using a test set of complex real-world DMLOs. Within an offline evaluation, the proposed approach achieves a Dice score exceeding 90% in predicting graph edges. Similarly, the identification accuracy of branch and intersection points in the graph topology is above 90%. Additionally, the method demonstrates efficient performance, achieving a runtime of over 20 FPS. In an online assessment employing a dual-arm robotic setup, the approach is successfully applied to disentangle three automotive wiring harnesses, demonstrating the effectiveness of the proposed approach in a real-world scenario. Note to Practitioners—This research is motivated by the lack of effective robotic perception and manipulation solutions for deformable multi-linear objects such as wiring harnesses. These objects play a crucial role in several manufacturing industries. However, their intricate branching structures, inherent deformable nature, and limited distinctive visual characteristics present formidable challenges when automating tasks that involve them. The proposed approach addresses several of these challenges: 1) It maintains robustness in the presence of noise and disruptions in the input image mask due to its graph-based structure; 2) It remains unaffected by the existence of large connectors, clips, fixtures, and other objects along the branch sections of the object; 3) It avoids making any assumptions about the structure of the object; 4) It is capable of simultaneously identifying branch-points and resolving intersection-points among different branch sections of these objects. The experiments have verified the effectiveness of this solution when applied to real-world deformable multi-linear objects found in automotive settings. This underscores the potential to integrate this solution into industrial scenarios that involve the assembly and routing of such objects.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"14738-14751"},"PeriodicalIF":6.4000,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10970007","citationCount":"0","resultStr":"{\"title\":\"GNN Topology Representation Learning for Deformable Multi-Linear Objects Dual-Arm Robotic Manipulation\",\"authors\":\"Alessio Caporali;Kevin Galassi;Riccardo Zanella;Gianluca Palli\",\"doi\":\"10.1109/TASE.2025.3562231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deformable Multi-Linear Objects (DMLOs), or Branched Deformable Linear Objects (BDLOs), are flexible objects that possess a linear structure similar to DLOs but also feature branching or bifurcation points where the object’s path diverges into multiple sections. The representation of complex DMLOs, such as wiring harnesses, poses significant challenges in various applications, including robotic systems’ perception and manipulation planning. This paper proposes an approach to address the robust and efficient estimation of a topological representation for DMLOs leveraging a graph-based description of the scene obtained via graph neural networks. Starting from a binary mask of the scene, graph nodes are sampled along the objects’ estimated centerlines. Then, a data-driven pipeline is employed to learn the assignment of graph edges between nodes and to characterize the node’s type based on their local topology and orientation. Finally, by utilizing the learned information, a solver combines the predictions and generates a coherent representation of the objects in the scene. The approach is experimentally evaluated using a test set of complex real-world DMLOs. Within an offline evaluation, the proposed approach achieves a Dice score exceeding 90% in predicting graph edges. Similarly, the identification accuracy of branch and intersection points in the graph topology is above 90%. Additionally, the method demonstrates efficient performance, achieving a runtime of over 20 FPS. In an online assessment employing a dual-arm robotic setup, the approach is successfully applied to disentangle three automotive wiring harnesses, demonstrating the effectiveness of the proposed approach in a real-world scenario. Note to Practitioners—This research is motivated by the lack of effective robotic perception and manipulation solutions for deformable multi-linear objects such as wiring harnesses. These objects play a crucial role in several manufacturing industries. However, their intricate branching structures, inherent deformable nature, and limited distinctive visual characteristics present formidable challenges when automating tasks that involve them. The proposed approach addresses several of these challenges: 1) It maintains robustness in the presence of noise and disruptions in the input image mask due to its graph-based structure; 2) It remains unaffected by the existence of large connectors, clips, fixtures, and other objects along the branch sections of the object; 3) It avoids making any assumptions about the structure of the object; 4) It is capable of simultaneously identifying branch-points and resolving intersection-points among different branch sections of these objects. The experiments have verified the effectiveness of this solution when applied to real-world deformable multi-linear objects found in automotive settings. This underscores the potential to integrate this solution into industrial scenarios that involve the assembly and routing of such objects.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"14738-14751\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-04-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10970007\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10970007/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10970007/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

可变形多线性对象（DMLOs）或分支可变形线性对象（BDLOs）是具有与DLOs相似的线性结构的柔性对象，但也具有分支或分岔点，其中对象的路径发散为多个部分。复杂dmlo的表示，如线束，在各种应用中提出了重大挑战，包括机器人系统的感知和操作规划。本文提出了一种利用通过图神经网络获得的基于图的场景描述来解决DMLOs拓扑表示的鲁棒性和有效性估计的方法。从场景的二进制掩码开始，沿着物体估计的中心线对图节点进行采样。然后，采用数据驱动管道学习节点间图边的分配，并根据节点的局部拓扑和方向来表征节点的类型。最后，通过利用学习到的信息，求解器结合预测并生成场景中物体的连贯表示。使用复杂的现实世界DMLOs测试集对该方法进行了实验评估。在离线评估中，该方法在预测图边方面达到了超过90%的Dice得分。同样，图拓扑中分支点和交点的识别精度在90%以上。此外，该方法显示了高效的性能，实现了超过20 FPS的运行时。在采用双臂机器人装置的在线评估中，该方法成功地应用于解开三根汽车线束，证明了该方法在现实场景中的有效性。从业人员注意事项：这项研究的动机是缺乏有效的机器人感知和操作解决方案，用于可变形的多线性物体，如线束。这些物品在几个制造业中起着至关重要的作用。然而，它们复杂的分支结构、固有的可变形特性和有限的独特视觉特征，在自动化涉及它们的任务时带来了巨大的挑战。所提出的方法解决了其中的几个挑战：1)由于其基于图的结构，它在输入图像掩膜中存在噪声和中断时保持鲁棒性；2)不受沿物体分支部分存在的大型连接器、夹子、夹具和其他物体的影响；3)避免对对象的结构作任何假设；4)能够同时识别分支点和求解这些对象的不同分支截面之间的相交点。实验验证了该解决方案在应用于汽车设置中发现的实际可变形多线性物体时的有效性。这强调了将此解决方案集成到涉及此类对象的组装和路由的工业场景中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

GNN Topology Representation Learning for Deformable Multi-Linear Objects Dual-Arm Robotic Manipulation

Deformable Multi-Linear Objects (DMLOs), or Branched Deformable Linear Objects (BDLOs), are flexible objects that possess a linear structure similar to DLOs but also feature branching or bifurcation points where the object’s path diverges into multiple sections. The representation of complex DMLOs, such as wiring harnesses, poses significant challenges in various applications, including robotic systems’ perception and manipulation planning. This paper proposes an approach to address the robust and efficient estimation of a topological representation for DMLOs leveraging a graph-based description of the scene obtained via graph neural networks. Starting from a binary mask of the scene, graph nodes are sampled along the objects’ estimated centerlines. Then, a data-driven pipeline is employed to learn the assignment of graph edges between nodes and to characterize the node’s type based on their local topology and orientation. Finally, by utilizing the learned information, a solver combines the predictions and generates a coherent representation of the objects in the scene. The approach is experimentally evaluated using a test set of complex real-world DMLOs. Within an offline evaluation, the proposed approach achieves a Dice score exceeding 90% in predicting graph edges. Similarly, the identification accuracy of branch and intersection points in the graph topology is above 90%. Additionally, the method demonstrates efficient performance, achieving a runtime of over 20 FPS. In an online assessment employing a dual-arm robotic setup, the approach is successfully applied to disentangle three automotive wiring harnesses, demonstrating the effectiveness of the proposed approach in a real-world scenario. Note to Practitioners—This research is motivated by the lack of effective robotic perception and manipulation solutions for deformable multi-linear objects such as wiring harnesses. These objects play a crucial role in several manufacturing industries. However, their intricate branching structures, inherent deformable nature, and limited distinctive visual characteristics present formidable challenges when automating tasks that involve them. The proposed approach addresses several of these challenges: 1) It maintains robustness in the presence of noise and disruptions in the input image mask due to its graph-based structure; 2) It remains unaffected by the existence of large connectors, clips, fixtures, and other objects along the branch sections of the object; 3) It avoids making any assumptions about the structure of the object; 4) It is capable of simultaneously identifying branch-points and resolving intersection-points among different branch sections of these objects. The experiments have verified the effectiveness of this solution when applied to real-world deformable multi-linear objects found in automotive settings. This underscores the potential to integrate this solution into industrial scenarios that involve the assembly and routing of such objects.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.