A Fusion of Graph- and Grid-Based Hybrid Model of Object Detection and Semantic Segmentation for 4-D Millimeter-Wave Radar

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2024-11-05 DOI:10.1109/JSEN.2024.3479214

Hongyan Wang;Zifeng Huang;Jiakang Ma;Huimei Feng

{"title":"A Fusion of Graph- and Grid-Based Hybrid Model of Object Detection and Semantic Segmentation for 4-D Millimeter-Wave Radar","authors":"Hongyan Wang;Zifeng Huang;Jiakang Ma;Huimei Feng","doi":"10.1109/JSEN.2024.3479214","DOIUrl":null,"url":null,"abstract":"The next-generation 4-D millimeter-wave radar can provide rich information and dense point cloud and perceive the environment under all-weather and all-operating conditions, making it very suitable for autonomous driving systems. However, the existing object detection and semantic segmentation models based on 4-D millimeter-wave radar are usually directly transplanted from light laser detection and ranging (LiDAR), which cannot effectively adapt to the millimeter-wave radar point cloud features, resulting in poor detection performance. Concerning this, a hybrid model for 4-D millimeter-wave radar object detection and semantic segmentation is developed here via fusing graph and grid. A graph neural network (GNN) module named radar adaptive multichannel GNN (RAMGNN) is first designed, which leverages topology graphs and feature maps to propagate and update point cloud features. The node embeddings outputted from RAMGNN can be directly used for semantic segmentation and serve as a point cloud feature encoder for subsequent object detection. In what follows, the point cloud is projected onto a 2-D bird’s-eye view (BEV) grid, and its multiscale features can be extracted exploiting a backbone network with channel attention mechanism. Finally, multiscale features are fused to achieve effective object detection and semantic segmentation. Experimental results conducted on the publicly available dataset view-of-delft (VoD) demonstrate that the proposed model outperforms state-of-the-art algorithms in terms of both object detection performance and semantic segmentation quality.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 24","pages":"42268-42280"},"PeriodicalIF":4.3000,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10745207/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The next-generation 4-D millimeter-wave radar can provide rich information and dense point cloud and perceive the environment under all-weather and all-operating conditions, making it very suitable for autonomous driving systems. However, the existing object detection and semantic segmentation models based on 4-D millimeter-wave radar are usually directly transplanted from light laser detection and ranging (LiDAR), which cannot effectively adapt to the millimeter-wave radar point cloud features, resulting in poor detection performance. Concerning this, a hybrid model for 4-D millimeter-wave radar object detection and semantic segmentation is developed here via fusing graph and grid. A graph neural network (GNN) module named radar adaptive multichannel GNN (RAMGNN) is first designed, which leverages topology graphs and feature maps to propagate and update point cloud features. The node embeddings outputted from RAMGNN can be directly used for semantic segmentation and serve as a point cloud feature encoder for subsequent object detection. In what follows, the point cloud is projected onto a 2-D bird’s-eye view (BEV) grid, and its multiscale features can be extracted exploiting a backbone network with channel attention mechanism. Finally, multiscale features are fused to achieve effective object detection and semantic segmentation. Experimental results conducted on the publicly available dataset view-of-delft (VoD) demonstrate that the proposed model outperforms state-of-the-art algorithms in terms of both object detection performance and semantic segmentation quality.

查看原文本刊更多论文

基于图形和网格的 4-D 毫米波雷达物体检测与语义分割混合模型的融合

下一代四维毫米波雷达可提供丰富的信息和密集的点云，可在全天候、全工况条件下感知环境，非常适合自动驾驶系统。然而，现有的基于四维毫米波雷达的物体检测和语义分割模型通常直接移植自光激光测距（LiDAR），无法有效适应毫米波雷达点云特征，导致检测性能不佳。为此，本文通过融合图和网格，开发了一种用于四维毫米波雷达目标检测和语义分割的混合模型。首先设计了一个名为雷达自适应多通道 GNN（RAMGNN）的图神经网络（GNN）模块，该模块利用拓扑图和特征图来传播和更新点云特征。RAMGNN 输出的节点嵌入可直接用于语义分割，并作为点云特征编码器用于后续的物体检测。在下文中，点云被投影到二维鸟瞰图（BEV）网格上，其多尺度特征可利用具有通道关注机制的骨干网络进行提取。最后，通过融合多尺度特征，实现有效的物体检测和语义分割。在公开数据集 view-of-delft (VoD) 上进行的实验结果表明，所提出的模型在物体检测性能和语义分割质量方面都优于最先进的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice