Feng Wang , Meixia Dong , Jiajun Zou , Shitong Ye , Zhiping Wan , Shaojiang Liu
{"title":"基于语义优先级调度的自适应多模态融合协同三维检测与通信方法","authors":"Feng Wang , Meixia Dong , Jiajun Zou , Shitong Ye , Zhiping Wan , Shaojiang Liu","doi":"10.1016/j.aej.2025.03.113","DOIUrl":null,"url":null,"abstract":"<div><div>In response to the urgent demand for high-precision 3D object detection and efficient communication in intelligent transportation systems and autonomous driving, this paper proposes an Adaptive Multimodal Fusion and Semantic Priority Scheduling approach for collaborative 3D detection and communication (AMFS-C3D). During the fusion of multimodal sensor data (from cameras, LiDAR, and radar), a graph neural network is introduced to structurally encode the integrated 3D features, thoroughly capturing spatial interactions among targets; a variational autoencoder then compresses these high-dimensional graph embeddings, retaining essential semantic information even under bandwidth constraints and significant channel noise. To further reduce latency and increase communication efficiency, the system employs an adaptive priority scheduling mechanism that dynamically allocates bandwidth based on real-time network load and target importance, ensuring the timely transmission of critical objects. Experimental results show that AMFS-C3D significantly outperforms comparative methods in key metrics such as mean Average Precision (mAP), recall, semantic fidelity, and transmission latency. Under the same mAP threshold, AMFS-C3D reduces bandwidth requirements by over 10 % on average, and under high signal-to-noise ratio conditions, it improves detection accuracy and recall by 3–5 percentage points. Moreover, AMFS-C3D demonstrates superior applicability and robustness across diverse network loads and channel conditions.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"124 ","pages":"Pages 362-375"},"PeriodicalIF":6.8000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive multimodal fusion with semantic priority scheduling for cooperative 3D detection and communication methods\",\"authors\":\"Feng Wang , Meixia Dong , Jiajun Zou , Shitong Ye , Zhiping Wan , Shaojiang Liu\",\"doi\":\"10.1016/j.aej.2025.03.113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In response to the urgent demand for high-precision 3D object detection and efficient communication in intelligent transportation systems and autonomous driving, this paper proposes an Adaptive Multimodal Fusion and Semantic Priority Scheduling approach for collaborative 3D detection and communication (AMFS-C3D). During the fusion of multimodal sensor data (from cameras, LiDAR, and radar), a graph neural network is introduced to structurally encode the integrated 3D features, thoroughly capturing spatial interactions among targets; a variational autoencoder then compresses these high-dimensional graph embeddings, retaining essential semantic information even under bandwidth constraints and significant channel noise. To further reduce latency and increase communication efficiency, the system employs an adaptive priority scheduling mechanism that dynamically allocates bandwidth based on real-time network load and target importance, ensuring the timely transmission of critical objects. Experimental results show that AMFS-C3D significantly outperforms comparative methods in key metrics such as mean Average Precision (mAP), recall, semantic fidelity, and transmission latency. Under the same mAP threshold, AMFS-C3D reduces bandwidth requirements by over 10 % on average, and under high signal-to-noise ratio conditions, it improves detection accuracy and recall by 3–5 percentage points. Moreover, AMFS-C3D demonstrates superior applicability and robustness across diverse network loads and channel conditions.</div></div>\",\"PeriodicalId\":7484,\"journal\":{\"name\":\"alexandria engineering journal\",\"volume\":\"124 \",\"pages\":\"Pages 362-375\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"alexandria engineering journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110016825004193\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825004193","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Adaptive multimodal fusion with semantic priority scheduling for cooperative 3D detection and communication methods
In response to the urgent demand for high-precision 3D object detection and efficient communication in intelligent transportation systems and autonomous driving, this paper proposes an Adaptive Multimodal Fusion and Semantic Priority Scheduling approach for collaborative 3D detection and communication (AMFS-C3D). During the fusion of multimodal sensor data (from cameras, LiDAR, and radar), a graph neural network is introduced to structurally encode the integrated 3D features, thoroughly capturing spatial interactions among targets; a variational autoencoder then compresses these high-dimensional graph embeddings, retaining essential semantic information even under bandwidth constraints and significant channel noise. To further reduce latency and increase communication efficiency, the system employs an adaptive priority scheduling mechanism that dynamically allocates bandwidth based on real-time network load and target importance, ensuring the timely transmission of critical objects. Experimental results show that AMFS-C3D significantly outperforms comparative methods in key metrics such as mean Average Precision (mAP), recall, semantic fidelity, and transmission latency. Under the same mAP threshold, AMFS-C3D reduces bandwidth requirements by over 10 % on average, and under high signal-to-noise ratio conditions, it improves detection accuracy and recall by 3–5 percentage points. Moreover, AMFS-C3D demonstrates superior applicability and robustness across diverse network loads and channel conditions.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering