智能物流中卡车-无人机路线的多智能体强化学习：综述

IF 4.9 3区计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Computers & Electrical Engineering Pub Date : 2025-07-02 DOI:10.1016/j.compeleceng.2025.110529

Ali Arishi , Paras Ahuja

{"title":"智能物流中卡车-无人机路线的多智能体强化学习：综述","authors":"Ali Arishi , Paras Ahuja","doi":"10.1016/j.compeleceng.2025.110529","DOIUrl":null,"url":null,"abstract":"<div><div>The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.</div><div>This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.</div><div>The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"127 ","pages":"Article 110529"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Agent Reinforcement Learning for truck–drone routing in smart logistics: A comprehensive review\",\"authors\":\"Ali Arishi , Paras Ahuja\",\"doi\":\"10.1016/j.compeleceng.2025.110529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.</div><div>This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.</div><div>The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"127 \",\"pages\":\"Article 110529\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625004720\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625004720","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

摘要

电子商务的激增和对当日送达的需求加剧了对高效的最后一英里物流解决方案的需求。传统的车辆路径问题在解决交通拥堵、能源限制和动态交付需求等现实约束方面存在不足。作为回应，多智能体强化学习（MARL）已经成为优化卡车-无人机协作、实现分散决策、实时重新路由和智能资源分配的变革性方法。本文全面回顾了基于marl的卡车-无人机物流，对基于价值、基于策略和混合学习方法的最新进展进行了分类，包括深度q学习、近端策略优化、多智能体深度确定性策略梯度和元强化学习。它提出了一种基于学习策略、优化目标和环境约束的结构化分类法。关键性能驱动因素，如交付效率、能源感知调度和可扩展性，以及现实世界的限制，包括交通拥堵、法规遵从性、基础设施依赖和天气可变性。本文还确定了关键挑战——计算复杂性、通信开销和对抗鲁棒性，并概述了未来的研究方向，包括混合学习架构、区块链安全协调、边缘智能和可持续性驱动的MARL优化。通过综合最新的研究和确定可行的途径，本综述为推进智能、自适应和生态高效的卡车-无人机运输系统提供了基础见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Agent Reinforcement Learning for truck–drone routing in smart logistics: A comprehensive review

The surge in e-commerce and demand for same-day delivery have intensified the need for efficient last-mile logistics solutions. Traditional vehicle routing problems fall short in addressing real-world constraints such as traffic congestion, energy limitations, and dynamic delivery demands. In response, Multi-Agent Reinforcement Learning (MARL) has emerged as a transformative approach for optimizing truck–drone collaboration, enabling decentralized decision-making, real-time re-routing, and intelligent resource allocation.

This paper presents a comprehensive review of MARL-based truck–drone logistics, categorizing recent advancements in value-based, policy-based, and hybrid learning approaches, including Deep Q-learning, proximal policy optimization, and multi-agent deep deterministic policy gradient, and meta-reinforcement learning. It proposes a structured taxonomy based on learning strategies, optimization objectives, and environmental constraints. Key performance drivers such as delivery efficiency, energy-aware scheduling, and scalability are examined alongside real-world limitations, including traffic congestion, regulatory compliance, infrastructure dependence, and weather variability.

The paper also identifies critical challenges – computational complexity, communication overhead, and adversarial robustness and outlines future research directions, including hybrid learning architectures, blockchain-secured coordination, edge intelligence, and sustainability – driven MARL optimization. By synthesizing state-of-the-art research and identifying actionable pathways, this review provides foundational insights for advancing intelligent, adaptive, and eco-efficient truck–drone delivery systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers & Electrical Engineering 工程技术-工程：电子与电气

CiteScore

9.20

自引率

7.00%

发文量

661

审稿时长

47 days

期刊介绍： The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.