通过基于实时学习的形状检测实现机器人抓取圆柱形和立方体物体

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Huixu Dong;Jiadong Zhou;Haoyong Yu
{"title":"通过基于实时学习的形状检测实现机器人抓取圆柱形和立方体物体","authors":"Huixu Dong;Jiadong Zhou;Haoyong Yu","doi":"10.1109/TASE.2024.3510592","DOIUrl":null,"url":null,"abstract":"Robots grasping objects are critical capabilities in warehouse environments and industrial settings. A robotic grasp generally occurs in a scenario where it is unfeasible for a worker to efficiently complete a tedious task, such as picking food and drink cans (cylinder-shaped) and packaging boxes (cube-shaped). It is worth noting that the tops of cylinders and cubes can be represented by ellipses and rectangles in the two-dimensional (2D) space, respectively. Therefore, a robot can grasp cylinder-shaped and cube-shaped objects by ellipse and rectangle detection. However, it faces the challenge of how to accurately detect cylindrical and cubic objects in real-time for robot grasping. To tackle the above research problem, we propose a grasping system that enables a robot to grasp cylinder-shaped and cube-shaped objects in static and dynamic environments by the proposed ellipse and rectangle detector. An end-to-end learning model is constructed to first incorporate a one-stage detection backbone and then, accommodate the proposed adaptive multi-branch multi-scale net with a designed iterative feature pyramid network, local inception net, and multi-receptive-field feature fusion net to generate object detection recommendations. Employing depth information, the coordinates of detected objects are converted to the 3D space via sampling a series of registered depths and pixels on objects from the live video stream. Comparisons with recent detection methods on the same dataset indicate that the proposed ellipse and rectangle detectors present better performance. Abundant grasping experiments are conducted to illustrate that a robot, empowered by the proposed detector, has the capability of grasping cylindrical and cubic objects in dynamic scenarios. (Video on YouTube, <uri>https://youtu.be/KK1OtW6GvL0</uri>). Note to Practitioners—This paper is motivated by the problem of how to enable a robot to grasp objects with the basic geometric primitives-ellipses and rectangles in static and dynamic scenarios. Our target is to provide a potential solution for flexible industrial settings in operating moving cylinder-shaped and cube-shaped objects (food and drink cans and packaging boxes) in dynamic scenarios such as conveyors of production lines and logistics lines. We constructed a supervised learning model that can accurately and quickly detect ellipses and rectangles. Through the verification of the comparisons with recent methods and robotic grasping experiments, the behavior of the proposed method can be used in practical applications. In the future, we will deploy this robotic grasping system based on the proposed perception method to grasp food and drink cans and packaging boxes from moving conveyors on production lines and logistics lines.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"9681-9697"},"PeriodicalIF":6.4000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robotic Grasps of Cylindrical and Cubic Objects via Real-Time Learning-Based Shape Detection\",\"authors\":\"Huixu Dong;Jiadong Zhou;Haoyong Yu\",\"doi\":\"10.1109/TASE.2024.3510592\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Robots grasping objects are critical capabilities in warehouse environments and industrial settings. A robotic grasp generally occurs in a scenario where it is unfeasible for a worker to efficiently complete a tedious task, such as picking food and drink cans (cylinder-shaped) and packaging boxes (cube-shaped). It is worth noting that the tops of cylinders and cubes can be represented by ellipses and rectangles in the two-dimensional (2D) space, respectively. Therefore, a robot can grasp cylinder-shaped and cube-shaped objects by ellipse and rectangle detection. However, it faces the challenge of how to accurately detect cylindrical and cubic objects in real-time for robot grasping. To tackle the above research problem, we propose a grasping system that enables a robot to grasp cylinder-shaped and cube-shaped objects in static and dynamic environments by the proposed ellipse and rectangle detector. An end-to-end learning model is constructed to first incorporate a one-stage detection backbone and then, accommodate the proposed adaptive multi-branch multi-scale net with a designed iterative feature pyramid network, local inception net, and multi-receptive-field feature fusion net to generate object detection recommendations. Employing depth information, the coordinates of detected objects are converted to the 3D space via sampling a series of registered depths and pixels on objects from the live video stream. Comparisons with recent detection methods on the same dataset indicate that the proposed ellipse and rectangle detectors present better performance. Abundant grasping experiments are conducted to illustrate that a robot, empowered by the proposed detector, has the capability of grasping cylindrical and cubic objects in dynamic scenarios. (Video on YouTube, <uri>https://youtu.be/KK1OtW6GvL0</uri>). Note to Practitioners—This paper is motivated by the problem of how to enable a robot to grasp objects with the basic geometric primitives-ellipses and rectangles in static and dynamic scenarios. Our target is to provide a potential solution for flexible industrial settings in operating moving cylinder-shaped and cube-shaped objects (food and drink cans and packaging boxes) in dynamic scenarios such as conveyors of production lines and logistics lines. We constructed a supervised learning model that can accurately and quickly detect ellipses and rectangles. Through the verification of the comparisons with recent methods and robotic grasping experiments, the behavior of the proposed method can be used in practical applications. In the future, we will deploy this robotic grasping system based on the proposed perception method to grasp food and drink cans and packaging boxes from moving conveyors on production lines and logistics lines.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"9681-9697\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2024-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10818984/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10818984/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

机器人抓取物体是仓库环境和工业环境中的关键能力。机器人抓取通常发生在工人无法有效完成繁琐任务的情况下,例如挑选食品和饮料罐(圆柱形)和包装盒(立方体)。值得注意的是,圆柱体和立方体的顶部可以分别用二维(2D)空间中的椭圆和矩形表示。因此,机器人可以通过椭圆和矩形检测来抓取圆柱形和立方体物体。然而,如何实时准确地检测出机器人抓取的圆柱形和立方体物体,是当前机器人抓取技术面临的挑战。为了解决上述研究问题,我们提出了一种抓取系统,通过所提出的椭圆和矩形探测器,使机器人能够在静态和动态环境中抓取圆柱体和立方体物体。构建端到端学习模型,首先结合单阶段检测骨干,然后通过设计迭代特征金字塔网络、局部初始网络和多接收场特征融合网络来适应所提出的自适应多分支多尺度网络,从而生成目标检测建议。利用深度信息,通过对实时视频流中物体的一系列注册深度和像素进行采样,将检测到的物体的坐标转换为3D空间。在同一数据集上与现有检测方法的比较表明,本文提出的椭圆和矩形检测器具有更好的性能。大量的抓取实验表明,在该检测器的支持下,机器人能够在动态场景下抓取圆柱形和立方体物体。(视频来自YouTube, https://youtu.be/KK1OtW6GvL0)。从业人员注意事项:本文的动机是如何使机器人能够在静态和动态场景中抓取具有基本几何基元的物体-椭圆和矩形。我们的目标是为灵活的工业环境提供一个潜在的解决方案,在动态场景中操作移动的圆柱形和立方体物体(食品和饮料罐和包装盒),如生产线和物流线的输送机。我们构建了一个监督学习模型,可以准确快速地检测椭圆和矩形。通过与现有方法的比较和机器人抓取实验验证,所提方法的性能可用于实际应用。在未来,我们将基于所提出的感知方法部署这种机器人抓取系统,从生产线和物流线上的移动输送机上抓取食品和饮料罐和包装盒。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Robotic Grasps of Cylindrical and Cubic Objects via Real-Time Learning-Based Shape Detection
Robots grasping objects are critical capabilities in warehouse environments and industrial settings. A robotic grasp generally occurs in a scenario where it is unfeasible for a worker to efficiently complete a tedious task, such as picking food and drink cans (cylinder-shaped) and packaging boxes (cube-shaped). It is worth noting that the tops of cylinders and cubes can be represented by ellipses and rectangles in the two-dimensional (2D) space, respectively. Therefore, a robot can grasp cylinder-shaped and cube-shaped objects by ellipse and rectangle detection. However, it faces the challenge of how to accurately detect cylindrical and cubic objects in real-time for robot grasping. To tackle the above research problem, we propose a grasping system that enables a robot to grasp cylinder-shaped and cube-shaped objects in static and dynamic environments by the proposed ellipse and rectangle detector. An end-to-end learning model is constructed to first incorporate a one-stage detection backbone and then, accommodate the proposed adaptive multi-branch multi-scale net with a designed iterative feature pyramid network, local inception net, and multi-receptive-field feature fusion net to generate object detection recommendations. Employing depth information, the coordinates of detected objects are converted to the 3D space via sampling a series of registered depths and pixels on objects from the live video stream. Comparisons with recent detection methods on the same dataset indicate that the proposed ellipse and rectangle detectors present better performance. Abundant grasping experiments are conducted to illustrate that a robot, empowered by the proposed detector, has the capability of grasping cylindrical and cubic objects in dynamic scenarios. (Video on YouTube, https://youtu.be/KK1OtW6GvL0). Note to Practitioners—This paper is motivated by the problem of how to enable a robot to grasp objects with the basic geometric primitives-ellipses and rectangles in static and dynamic scenarios. Our target is to provide a potential solution for flexible industrial settings in operating moving cylinder-shaped and cube-shaped objects (food and drink cans and packaging boxes) in dynamic scenarios such as conveyors of production lines and logistics lines. We constructed a supervised learning model that can accurately and quickly detect ellipses and rectangles. Through the verification of the comparisons with recent methods and robotic grasping experiments, the behavior of the proposed method can be used in practical applications. In the future, we will deploy this robotic grasping system based on the proposed perception method to grasp food and drink cans and packaging boxes from moving conveyors on production lines and logistics lines.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信