Xin Gao, Xueyuan Li, Hao Liu, Ao Li, Zhaoyang Ma, Zirui Li
{"title":"基于嵌套图强化学习的生态排队决策策略","authors":"Xin Gao, Xueyuan Li, Hao Liu, Ao Li, Zhaoyang Ma, Zirui Li","doi":"arxiv-2408.07578","DOIUrl":null,"url":null,"abstract":"Platooning technology is renowned for its precise vehicle control, traffic\nflow optimization, and energy efficiency enhancement. However, in large-scale\nmixed platoons, vehicle heterogeneity and unpredictable traffic conditions lead\nto virtual bottlenecks. These bottlenecks result in reduced traffic throughput\nand increased energy consumption within the platoon. To address these\nchallenges, we introduce a decision-making strategy based on nested graph\nreinforcement learning. This strategy improves collaborative decision-making,\nensuring energy efficiency and alleviating congestion. We propose a theory of\nnested traffic graph representation that maps dynamic interactions between\nvehicles and platoons in non-Euclidean spaces. By incorporating spatio-temporal\nweighted graph into a multi-head attention mechanism, we further enhance the\nmodel's capacity to process both local and global data. Additionally, we have\ndeveloped a nested graph reinforcement learning framework to enhance the\nself-iterative learning capabilities of platooning. Using the I-24 dataset, we\ndesigned and conducted comparative algorithm experiments, generalizability\ntesting, and permeability ablation experiments, thereby validating the proposed\nstrategy's effectiveness. Compared to the baseline, our strategy increases\nthroughput by 10% and decreases energy use by 9%. Specifically, increasing the\npenetration rate of CAVs significantly enhances traffic throughput, though it\nalso increases energy consumption.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Nested Graph Reinforcement Learning-based Decision-making Strategy for Eco-platooning\",\"authors\":\"Xin Gao, Xueyuan Li, Hao Liu, Ao Li, Zhaoyang Ma, Zirui Li\",\"doi\":\"arxiv-2408.07578\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Platooning technology is renowned for its precise vehicle control, traffic\\nflow optimization, and energy efficiency enhancement. However, in large-scale\\nmixed platoons, vehicle heterogeneity and unpredictable traffic conditions lead\\nto virtual bottlenecks. These bottlenecks result in reduced traffic throughput\\nand increased energy consumption within the platoon. To address these\\nchallenges, we introduce a decision-making strategy based on nested graph\\nreinforcement learning. This strategy improves collaborative decision-making,\\nensuring energy efficiency and alleviating congestion. We propose a theory of\\nnested traffic graph representation that maps dynamic interactions between\\nvehicles and platoons in non-Euclidean spaces. By incorporating spatio-temporal\\nweighted graph into a multi-head attention mechanism, we further enhance the\\nmodel's capacity to process both local and global data. Additionally, we have\\ndeveloped a nested graph reinforcement learning framework to enhance the\\nself-iterative learning capabilities of platooning. Using the I-24 dataset, we\\ndesigned and conducted comparative algorithm experiments, generalizability\\ntesting, and permeability ablation experiments, thereby validating the proposed\\nstrategy's effectiveness. Compared to the baseline, our strategy increases\\nthroughput by 10% and decreases energy use by 9%. Specifically, increasing the\\npenetration rate of CAVs significantly enhances traffic throughput, though it\\nalso increases energy consumption.\",\"PeriodicalId\":501315,\"journal\":{\"name\":\"arXiv - CS - Multiagent Systems\",\"volume\":\"70 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Multiagent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.07578\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.07578","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Nested Graph Reinforcement Learning-based Decision-making Strategy for Eco-platooning
Platooning technology is renowned for its precise vehicle control, traffic
flow optimization, and energy efficiency enhancement. However, in large-scale
mixed platoons, vehicle heterogeneity and unpredictable traffic conditions lead
to virtual bottlenecks. These bottlenecks result in reduced traffic throughput
and increased energy consumption within the platoon. To address these
challenges, we introduce a decision-making strategy based on nested graph
reinforcement learning. This strategy improves collaborative decision-making,
ensuring energy efficiency and alleviating congestion. We propose a theory of
nested traffic graph representation that maps dynamic interactions between
vehicles and platoons in non-Euclidean spaces. By incorporating spatio-temporal
weighted graph into a multi-head attention mechanism, we further enhance the
model's capacity to process both local and global data. Additionally, we have
developed a nested graph reinforcement learning framework to enhance the
self-iterative learning capabilities of platooning. Using the I-24 dataset, we
designed and conducted comparative algorithm experiments, generalizability
testing, and permeability ablation experiments, thereby validating the proposed
strategy's effectiveness. Compared to the baseline, our strategy increases
throughput by 10% and decreases energy use by 9%. Specifically, increasing the
penetration rate of CAVs significantly enhances traffic throughput, though it
also increases energy consumption.