Ankur Nahar , Debasis Das , Ramnarayan Yadav , Khujamatov Halimjon , Ernazar Reypnazarov
{"title":"Meta-enhanced hierarchical multi-agent reinforcement learning for dynamic spectrum management and trust-based routing in cognitive vehicular networks","authors":"Ankur Nahar , Debasis Das , Ramnarayan Yadav , Khujamatov Halimjon , Ernazar Reypnazarov","doi":"10.1016/j.adhoc.2025.103874","DOIUrl":null,"url":null,"abstract":"<div><div>This research introduces the Meta-Enhanced Recurrent Multi-Agent Reinforcement Learning (M-RMARL) framework, designed to tackle the challenges of reliable routing and dynamic spectrum management in Cognitive Vehicular Ad Hoc Networks (CR-VANETs). The framework is built on Meta-Agnostic Meta-Learning (MAML), utilizing Meta-Learned Deep Recurrent Q-Networks (DRQNs) to significantly reduce training time, enabling vehicles to quickly identify optimal routes and enhance spectrum sensing with minimal adjustments. M-RMARL also features a dynamic spectrum management system that employs Long Short-Term Memory (LSTM)-based meta-predictive models to forecast future spectrum availability and network conditions. These predictions allow DRQNs to make proactive, intelligent decisions, improving spectrum efficiency. To ensure secure communication, the framework incorporates a Trust-Based Meta-Coordination mechanism, which dynamically evaluates agent trustworthiness and integrates these assessments into the decision-making process. Additionally, the framework leverages a Hierarchical Meta-Agent Coordination architecture, where Roadside Units (RSUs) manage global coordination and meta-learning updates, while vehicle agents implement the derived policies. This structure enhances scalability and resource management, making M-RMARL particularly effective in complex decision-making environments. Extensive simulations demonstrate the framework’s effectiveness, showing improvements of 18% in spectrum utilization, 25% in training convergence, 20% in spectrum prediction accuracy, 30% in training efficiency, and 17% in trust evaluation reliability.</div></div>","PeriodicalId":55555,"journal":{"name":"Ad Hoc Networks","volume":"175 ","pages":"Article 103874"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ad Hoc Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1570870525001222","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This research introduces the Meta-Enhanced Recurrent Multi-Agent Reinforcement Learning (M-RMARL) framework, designed to tackle the challenges of reliable routing and dynamic spectrum management in Cognitive Vehicular Ad Hoc Networks (CR-VANETs). The framework is built on Meta-Agnostic Meta-Learning (MAML), utilizing Meta-Learned Deep Recurrent Q-Networks (DRQNs) to significantly reduce training time, enabling vehicles to quickly identify optimal routes and enhance spectrum sensing with minimal adjustments. M-RMARL also features a dynamic spectrum management system that employs Long Short-Term Memory (LSTM)-based meta-predictive models to forecast future spectrum availability and network conditions. These predictions allow DRQNs to make proactive, intelligent decisions, improving spectrum efficiency. To ensure secure communication, the framework incorporates a Trust-Based Meta-Coordination mechanism, which dynamically evaluates agent trustworthiness and integrates these assessments into the decision-making process. Additionally, the framework leverages a Hierarchical Meta-Agent Coordination architecture, where Roadside Units (RSUs) manage global coordination and meta-learning updates, while vehicle agents implement the derived policies. This structure enhances scalability and resource management, making M-RMARL particularly effective in complex decision-making environments. Extensive simulations demonstrate the framework’s effectiveness, showing improvements of 18% in spectrum utilization, 25% in training convergence, 20% in spectrum prediction accuracy, 30% in training efficiency, and 17% in trust evaluation reliability.
期刊介绍:
The Ad Hoc Networks is an international and archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in ad hoc and sensor networking areas. The Ad Hoc Networks considers original, high quality and unpublished contributions addressing all aspects of ad hoc and sensor networks. Specific areas of interest include, but are not limited to:
Mobile and Wireless Ad Hoc Networks
Sensor Networks
Wireless Local and Personal Area Networks
Home Networks
Ad Hoc Networks of Autonomous Intelligent Systems
Novel Architectures for Ad Hoc and Sensor Networks
Self-organizing Network Architectures and Protocols
Transport Layer Protocols
Routing protocols (unicast, multicast, geocast, etc.)
Media Access Control Techniques
Error Control Schemes
Power-Aware, Low-Power and Energy-Efficient Designs
Synchronization and Scheduling Issues
Mobility Management
Mobility-Tolerant Communication Protocols
Location Tracking and Location-based Services
Resource and Information Management
Security and Fault-Tolerance Issues
Hardware and Software Platforms, Systems, and Testbeds
Experimental and Prototype Results
Quality-of-Service Issues
Cross-Layer Interactions
Scalability Issues
Performance Analysis and Simulation of Protocols.