Deep Reinforcement Learning for Optimal Resource Allocation in Blockchain-based IoV Secure Systems

Hongzhi Xiao, Chen Qiu, Qinglin Yang, Huakun Huang, Junbo Wang, Chunhua Su
{"title":"Deep Reinforcement Learning for Optimal Resource Allocation in Blockchain-based IoV Secure Systems","authors":"Hongzhi Xiao, Chen Qiu, Qinglin Yang, Huakun Huang, Junbo Wang, Chunhua Su","doi":"10.1109/MSN50589.2020.00036","DOIUrl":null,"url":null,"abstract":"Driven by the advanced technologies of vehicular communications and networking, the Internet of Vehicles (IoV) has become an emerging paradigm in smart world. However, privacy and security are still quite critical issues for the current IoV system because of various sensitive information and the centralized interaction architecture. To address these challenges, a decentralized architecture is proposed to develop a blockchain-supported IoV (BS-IoV) system. In the BS-IoV system, the Roadside Units (RSUs) are redesigned for Mobile Edge Computing (MEC). Except for information collection and communication, the RSUs also need to audit the data uploaded by vehicles, packing data as block transactions to guarantee high-quality data sharing. However, since block generating is critical resource-consuming, the distributed database will cost high computing power. Additionally, due to the dynamical variation environment of traffic system, the computing resource is quite difficult to be allocated. In this paper, to solve the above problems, we propose a Deep Reinforcement Learning (DRL) based algorithm for resource optimization in the BS-IoV system. Specifically, to maximize the satisfaction of the system and users, we formulate a resource optimization problem and exploit the DRL-based algorithm to determine the allocation scheme. The evaluation of the proposed learning scheme is performed in the SUMO with Flow, which is a professional simulation tool for traffic simulation with reinforcement learning functions interfaces. Evaluation results have demonstrated good effectiveness of the proposed scheme.","PeriodicalId":447605,"journal":{"name":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN50589.2020.00036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Driven by the advanced technologies of vehicular communications and networking, the Internet of Vehicles (IoV) has become an emerging paradigm in smart world. However, privacy and security are still quite critical issues for the current IoV system because of various sensitive information and the centralized interaction architecture. To address these challenges, a decentralized architecture is proposed to develop a blockchain-supported IoV (BS-IoV) system. In the BS-IoV system, the Roadside Units (RSUs) are redesigned for Mobile Edge Computing (MEC). Except for information collection and communication, the RSUs also need to audit the data uploaded by vehicles, packing data as block transactions to guarantee high-quality data sharing. However, since block generating is critical resource-consuming, the distributed database will cost high computing power. Additionally, due to the dynamical variation environment of traffic system, the computing resource is quite difficult to be allocated. In this paper, to solve the above problems, we propose a Deep Reinforcement Learning (DRL) based algorithm for resource optimization in the BS-IoV system. Specifically, to maximize the satisfaction of the system and users, we formulate a resource optimization problem and exploit the DRL-based algorithm to determine the allocation scheme. The evaluation of the proposed learning scheme is performed in the SUMO with Flow, which is a professional simulation tool for traffic simulation with reinforcement learning functions interfaces. Evaluation results have demonstrated good effectiveness of the proposed scheme.
基于区块链的车联网安全系统资源优化配置的深度强化学习
在先进的车辆通信和网络技术的推动下,车联网(IoV)已成为智能世界的新兴范式。然而,由于当前车联网系统中存在各种敏感信息和集中式交互架构,隐私和安全仍然是非常关键的问题。为了应对这些挑战,提出了一种分散的架构来开发区块链支持的车联网(BS-IoV)系统。在BS-IoV系统中,路边单元(rsu)针对移动边缘计算(MEC)进行了重新设计。除了信息采集和通信外,rsu还需要对车辆上传的数据进行审计,将数据打包为块交易,以保证高质量的数据共享。然而,由于区块生成是一个重要的资源消耗,分布式数据库将消耗大量的计算能力。此外,由于交通系统的动态变化环境,计算资源的分配相当困难。为了解决上述问题,本文提出了一种基于深度强化学习(Deep Reinforcement Learning, DRL)的车联网系统资源优化算法。具体来说,为了使系统和用户的满意度最大化,我们提出了一个资源优化问题,并利用基于drl的算法来确定分配方案。在SUMO with Flow中对所提出的学习方案进行了评估,SUMO with Flow是具有强化学习功能接口的交通仿真专业仿真工具。评价结果表明该方案具有良好的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信