{"title":"High-accuracy Low-latency Non-Maximum Suppression Processor for Traffic Object Detection","authors":"Chenbo Yuan, Peng Xu, Gang Chen","doi":"10.1587/elex.20.20230445","DOIUrl":null,"url":null,"abstract":"As autonomous driving technology advances, the requirements for object detection are becoming increasingly high. Non-maximum suppression (NMS) algorithm, as a key component in traffic object detection algorithms, is an independent post-processing process in the object detection framework. Due to the complexity of real-world road scenarios and high density of detected entities in urban traffic, the number of candidate bounding boxes generated by the neural network is large. Hence, low-precision processors may generate a significant number of redundant target bounding boxes. The excessive output of redundant target bounding boxes not only imposes a workload on subsequent processing but also has the potential to result in non-optimal decision-making. We propose a high-performance NMS processor that can quickly process a large number of candidate boxes without performing sorting of their scores. Also, it has low precision loss computing units and high parallel computing arrays. Combined with algorithm design, it effectively reduces the computational complexity and reduces the inference time of the end-to-end task of the NMS algorithm. Thus, our NMS processor’s speed is comparable to SOTA architecture, and the average accuracy loss is only 0.4% .","PeriodicalId":50387,"journal":{"name":"Ieice Electronics Express","volume":"27 1","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ieice Electronics Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/elex.20.20230445","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
As autonomous driving technology advances, the requirements for object detection are becoming increasingly high. Non-maximum suppression (NMS) algorithm, as a key component in traffic object detection algorithms, is an independent post-processing process in the object detection framework. Due to the complexity of real-world road scenarios and high density of detected entities in urban traffic, the number of candidate bounding boxes generated by the neural network is large. Hence, low-precision processors may generate a significant number of redundant target bounding boxes. The excessive output of redundant target bounding boxes not only imposes a workload on subsequent processing but also has the potential to result in non-optimal decision-making. We propose a high-performance NMS processor that can quickly process a large number of candidate boxes without performing sorting of their scores. Also, it has low precision loss computing units and high parallel computing arrays. Combined with algorithm design, it effectively reduces the computational complexity and reduces the inference time of the end-to-end task of the NMS algorithm. Thus, our NMS processor’s speed is comparable to SOTA architecture, and the average accuracy loss is only 0.4% .
期刊介绍:
An aim of ELEX is rapid publication of original, peer-reviewed short papers that treat the field of modern electronics and electrical engineering. The boundaries of acceptable fields are not strictly delimited and they are flexibly varied to reflect trends of the fields. The scope of ELEX has mainly been focused on device and circuit technologies. Current appropriate topics include:
- Integrated optoelectronics (lasers and optoelectronic devices, silicon photonics, planar lightwave circuits, polymer optical circuits, etc.)
- Optical hardware (fiber optics, microwave photonics, optical interconnects, photonic signal processing, photonic integration and modules, optical sensing, etc.)
- Electromagnetic theory
- Microwave and millimeter-wave devices, circuits, and modules
- THz devices, circuits and modules
- Electron devices, circuits and modules (silicon, compound semiconductor, organic and novel materials)
- Integrated circuits (memory, logic, analog, RF, sensor)
- Power devices and circuits
- Micro- or nano-electromechanical systems
- Circuits and modules for storage
- Superconducting electronics
- Energy harvesting devices, circuits and modules
- Circuits and modules for electronic displays
- Circuits and modules for electronic instrumentation
- Devices, circuits and modules for IoT and biomedical applications