{"title":"智能物流中卡车-无人机路线的多智能体强化学习:综述","authors":"Ali Arishi , Paras Ahuja","doi":"10.1016/j.compeleceng.2025.110529","DOIUrl":null,"url":null,"abstract":"<div><div>The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.</div><div>This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.</div><div>The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"127 ","pages":"Article 110529"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Agent Reinforcement Learning for truck–drone routing in smart logistics: A comprehensive review\",\"authors\":\"Ali Arishi , Paras Ahuja\",\"doi\":\"10.1016/j.compeleceng.2025.110529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.</div><div>This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.</div><div>The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"127 \",\"pages\":\"Article 110529\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625004720\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625004720","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Multi-Agent Reinforcement Learning for truck–drone routing in smart logistics: A comprehensive review
The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.
This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.
The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.