Yunting Yang, Jun Liu, Hongsi Liu, Guangfeng Jiang
{"title":"Radar M3-Net: Multi-scale, multi-layer, multi-frame network with a large receptive field for 3D object detection","authors":"Yunting Yang, Jun Liu, Hongsi Liu, Guangfeng Jiang","doi":"10.1016/j.eswa.2025.127515","DOIUrl":null,"url":null,"abstract":"<div><div>4D millimeter-wave radar has demonstrated significant potential for 3D object detection in autonomous driving due to its cost-effectiveness and robustness. However, the inherent sparsity of radar data poses a significant challenge to achieving accurate 3D object detection, as it limits the amount of meaningful information available for feature learning. The key to addressing the performance degradation caused by sparsity lies in expanding the receptive field and enriching feature representations. To this end, we propose a multi-scale, multi-layer, multi-frame network with a large receptive field, named Radar <span><math><msup><mrow><mtext>M</mtext></mrow><mrow><mn>3</mn></mrow></msup></math></span>-Net. First, we design a multi-scale voxel feature encoding (MSVFE) module and a multi-layer attention (MLA) module, both of which significantly expand the receptive field and enrich features, effectively addressing the issue of sparsity. Then, a multi-frame fusion module is developed to further enhance features by utilizing the accumulation of temporal information. Simultaneously, we design a novel sparse dual-head within the sparse framework to address the decline in detection accuracy for large object categories caused by radar sparsity. Extensive experiments on the View-of-Delft and TJ4DRadSet datasets have confirmed the advancement and effectiveness of our network. Specifically, our method achieves state-of-the-art mean average precision (mAP) performance on both datasets, even outperforming some multimodal approaches in certain metrics.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"286 ","pages":"Article 127515"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011376","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
4D millimeter-wave radar has demonstrated significant potential for 3D object detection in autonomous driving due to its cost-effectiveness and robustness. However, the inherent sparsity of radar data poses a significant challenge to achieving accurate 3D object detection, as it limits the amount of meaningful information available for feature learning. The key to addressing the performance degradation caused by sparsity lies in expanding the receptive field and enriching feature representations. To this end, we propose a multi-scale, multi-layer, multi-frame network with a large receptive field, named Radar -Net. First, we design a multi-scale voxel feature encoding (MSVFE) module and a multi-layer attention (MLA) module, both of which significantly expand the receptive field and enrich features, effectively addressing the issue of sparsity. Then, a multi-frame fusion module is developed to further enhance features by utilizing the accumulation of temporal information. Simultaneously, we design a novel sparse dual-head within the sparse framework to address the decline in detection accuracy for large object categories caused by radar sparsity. Extensive experiments on the View-of-Delft and TJ4DRadSet datasets have confirmed the advancement and effectiveness of our network. Specifically, our method achieves state-of-the-art mean average precision (mAP) performance on both datasets, even outperforming some multimodal approaches in certain metrics.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.