面向自动驾驶机器人的轻量级目标检测框架

2021 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2021-11-01 DOI:10.1109/DICTA52665.2021.9647256

Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Gwangzeen Ko, Jung Ick Moon, Jun Jo, N. Q. Hung

{"title":"面向自动驾驶机器人的轻量级目标检测框架","authors":"Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Gwangzeen Ko, Jung Ick Moon, Jun Jo, N. Q. Hung","doi":"10.1109/DICTA52665.2021.9647256","DOIUrl":null,"url":null,"abstract":"Object detection is an emerging and essential problem in recent years, which has been widely applied in many aspects of daily life such as video surveillance, self-driving robots, and automatic payment. The rapid development of deep learning models allows object detectors to work in real-time with high accuracy. However, such a sophisticated model often requires robust computing infrastructure such as powerful graphics processing units (GPUs). This requirement might cause a severe issue for embedded systems with small, power-efficient artificial intelligence (AI) systems like Jetson Nano, which are often restricted in both memory storage and computing sheer power. In this work, we aim to address this challenge by proposing a lightweight object detection framework that is specialized for the Internet of Things (IoT) devices with low-power processors such as Jetson Nano. In order to detect the object with different size, our framework employs a backbone residual CNN-based network as the feature extractor. We then design a multi-layer model to combine the feature at different levels of granularity, before using the processed feature to locate and classify the object. We also apply augmentation techniques to enhance the robustness of the framework to adversarial factors. Extensive experiments on real devices in many scenarios, such as autonomous cars or wireless robot recharging systems, showed that our technique can achieve nearly on par results with the state-of-the-art YOLOv5 while requires only one-fourth of computation power.","PeriodicalId":424950,"journal":{"name":"2021 Digital Image Computing: Techniques and Applications (DICTA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"ODAR: A Lightweight Object Detection Framework for Autonomous Driving Robots\",\"authors\":\"Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Gwangzeen Ko, Jung Ick Moon, Jun Jo, N. Q. Hung\",\"doi\":\"10.1109/DICTA52665.2021.9647256\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection is an emerging and essential problem in recent years, which has been widely applied in many aspects of daily life such as video surveillance, self-driving robots, and automatic payment. The rapid development of deep learning models allows object detectors to work in real-time with high accuracy. However, such a sophisticated model often requires robust computing infrastructure such as powerful graphics processing units (GPUs). This requirement might cause a severe issue for embedded systems with small, power-efficient artificial intelligence (AI) systems like Jetson Nano, which are often restricted in both memory storage and computing sheer power. In this work, we aim to address this challenge by proposing a lightweight object detection framework that is specialized for the Internet of Things (IoT) devices with low-power processors such as Jetson Nano. In order to detect the object with different size, our framework employs a backbone residual CNN-based network as the feature extractor. We then design a multi-layer model to combine the feature at different levels of granularity, before using the processed feature to locate and classify the object. We also apply augmentation techniques to enhance the robustness of the framework to adversarial factors. Extensive experiments on real devices in many scenarios, such as autonomous cars or wireless robot recharging systems, showed that our technique can achieve nearly on par results with the state-of-the-art YOLOv5 while requires only one-fourth of computation power.\",\"PeriodicalId\":424950,\"journal\":{\"name\":\"2021 Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA52665.2021.9647256\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA52665.2021.9647256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

目标检测是近年来一个新兴的、必不可少的问题，在视频监控、自动驾驶机器人、自动支付等日常生活的许多方面都有广泛的应用。深度学习模型的快速发展使目标探测器能够以高精度实时工作。然而，这样一个复杂的模型通常需要健壮的计算基础设施，比如功能强大的图形处理单元(gpu)。对于像Jetson Nano这样的小型节能人工智能(AI)系统的嵌入式系统来说，这一要求可能会导致严重的问题，这些系统通常在内存存储和计算能力方面都受到限制。在这项工作中，我们的目标是通过提出一个轻量级的对象检测框架来解决这一挑战，该框架专门用于具有低功耗处理器(如Jetson Nano)的物联网(IoT)设备。为了检测不同大小的目标，我们的框架采用基于骨干残差cnn的网络作为特征提取器。然后，我们设计了一个多层模型来组合不同粒度级别的特征，然后使用处理后的特征对目标进行定位和分类。我们还应用增强技术来增强框架对对抗因素的鲁棒性。在自动驾驶汽车或无线机器人充电系统等许多场景下的真实设备上进行的大量实验表明，我们的技术可以达到与最先进的YOLOv5几乎相当的结果，而只需要四分之一的计算能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ODAR: A Lightweight Object Detection Framework for Autonomous Driving Robots

Object detection is an emerging and essential problem in recent years, which has been widely applied in many aspects of daily life such as video surveillance, self-driving robots, and automatic payment. The rapid development of deep learning models allows object detectors to work in real-time with high accuracy. However, such a sophisticated model often requires robust computing infrastructure such as powerful graphics processing units (GPUs). This requirement might cause a severe issue for embedded systems with small, power-efficient artificial intelligence (AI) systems like Jetson Nano, which are often restricted in both memory storage and computing sheer power. In this work, we aim to address this challenge by proposing a lightweight object detection framework that is specialized for the Internet of Things (IoT) devices with low-power processors such as Jetson Nano. In order to detect the object with different size, our framework employs a backbone residual CNN-based network as the feature extractor. We then design a multi-layer model to combine the feature at different levels of granularity, before using the processed feature to locate and classify the object. We also apply augmentation techniques to enhance the robustness of the framework to adversarial factors. Extensive experiments on real devices in many scenarios, such as autonomous cars or wireless robot recharging systems, showed that our technique can achieve nearly on par results with the state-of-the-art YOLOv5 while requires only one-fourth of computation power.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量