Jirui Li , Cong Zheng , GuoZhi Li , Haihua Zhu , Zigang Chen , Jie Yuan
{"title":"M2-MADRL:一种基于MADRL的边缘自组网多模路由优化策略","authors":"Jirui Li , Cong Zheng , GuoZhi Li , Haihua Zhu , Zigang Chen , Jie Yuan","doi":"10.1016/j.aej.2025.08.021","DOIUrl":null,"url":null,"abstract":"<div><div>This paper addresses the challenges of dynamic routing in Edge Ad Hoc Networks (EANETs), such as high mobility, complex topologies, and diverse service requirements, by proposing M2-MADRL, a Multi-Mode routing framework based on Multi-Agent Deep Reinforcement Learning. The study focuses on three key aspects: first, designing a multi-mode routing architecture that enables nodes to dynamically switch between routing protocols (e.g., AODV, DSDV) and adaptive parameter adjustments, adapting to varying network scales and mobility levels. Second, integrating an enhanced MADRL model based on the Centralized Training with Decentralized Execution (CTDE) framework, which uses local value networks for individual node optimization and a global value network to improve overall network performance, resolving conflicts between centralized control and EANETs’ distributed nature. Third, introducing a double-layer Semantic Self-Attention Mechanism (SSAM) to help agents accurately interpret heterogeneous network states and context-aware actions, accelerating convergence and enhancing decision-making precision. Experiments show M2-MADRL outperforms MADDPG, DE-MADDPG, and RTHop in Packet Delivery Rate (PDR), Effective Throughput (ET), and Average Transmission Delay (ATD), demonstrating superior adaptability in EANETs. For example, in 150-node networks, it averagely demonstrates 11.9% higher PDR, 23.0% higher ET, and 44.6% lower ATD than the classic MADDPG method.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"129 ","pages":"Pages 1165-1184"},"PeriodicalIF":6.8000,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"M2-MADRL: A multi-mode routing optimization strategy based on MADRL for Edge Ad Hoc Networks\",\"authors\":\"Jirui Li , Cong Zheng , GuoZhi Li , Haihua Zhu , Zigang Chen , Jie Yuan\",\"doi\":\"10.1016/j.aej.2025.08.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper addresses the challenges of dynamic routing in Edge Ad Hoc Networks (EANETs), such as high mobility, complex topologies, and diverse service requirements, by proposing M2-MADRL, a Multi-Mode routing framework based on Multi-Agent Deep Reinforcement Learning. The study focuses on three key aspects: first, designing a multi-mode routing architecture that enables nodes to dynamically switch between routing protocols (e.g., AODV, DSDV) and adaptive parameter adjustments, adapting to varying network scales and mobility levels. Second, integrating an enhanced MADRL model based on the Centralized Training with Decentralized Execution (CTDE) framework, which uses local value networks for individual node optimization and a global value network to improve overall network performance, resolving conflicts between centralized control and EANETs’ distributed nature. Third, introducing a double-layer Semantic Self-Attention Mechanism (SSAM) to help agents accurately interpret heterogeneous network states and context-aware actions, accelerating convergence and enhancing decision-making precision. Experiments show M2-MADRL outperforms MADDPG, DE-MADDPG, and RTHop in Packet Delivery Rate (PDR), Effective Throughput (ET), and Average Transmission Delay (ATD), demonstrating superior adaptability in EANETs. For example, in 150-node networks, it averagely demonstrates 11.9% higher PDR, 23.0% higher ET, and 44.6% lower ATD than the classic MADDPG method.</div></div>\",\"PeriodicalId\":7484,\"journal\":{\"name\":\"alexandria engineering journal\",\"volume\":\"129 \",\"pages\":\"Pages 1165-1184\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"alexandria engineering journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110016825009172\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825009172","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
M2-MADRL: A multi-mode routing optimization strategy based on MADRL for Edge Ad Hoc Networks
This paper addresses the challenges of dynamic routing in Edge Ad Hoc Networks (EANETs), such as high mobility, complex topologies, and diverse service requirements, by proposing M2-MADRL, a Multi-Mode routing framework based on Multi-Agent Deep Reinforcement Learning. The study focuses on three key aspects: first, designing a multi-mode routing architecture that enables nodes to dynamically switch between routing protocols (e.g., AODV, DSDV) and adaptive parameter adjustments, adapting to varying network scales and mobility levels. Second, integrating an enhanced MADRL model based on the Centralized Training with Decentralized Execution (CTDE) framework, which uses local value networks for individual node optimization and a global value network to improve overall network performance, resolving conflicts between centralized control and EANETs’ distributed nature. Third, introducing a double-layer Semantic Self-Attention Mechanism (SSAM) to help agents accurately interpret heterogeneous network states and context-aware actions, accelerating convergence and enhancing decision-making precision. Experiments show M2-MADRL outperforms MADDPG, DE-MADDPG, and RTHop in Packet Delivery Rate (PDR), Effective Throughput (ET), and Average Transmission Delay (ATD), demonstrating superior adaptability in EANETs. For example, in 150-node networks, it averagely demonstrates 11.9% higher PDR, 23.0% higher ET, and 44.6% lower ATD than the classic MADDPG method.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering