Deep Reinforcement Learning for Optimal Resource Allocation in Blockchain-based IoV Secure Systems

2020 16th International Conference on Mobility, Sensing and Networking (MSN) Pub Date : 2020-12-01 DOI:10.1109/MSN50589.2020.00036

Hongzhi Xiao, Chen Qiu, Qinglin Yang, Huakun Huang, Junbo Wang, Chunhua Su

{"title":"Deep Reinforcement Learning for Optimal Resource Allocation in Blockchain-based IoV Secure Systems","authors":"Hongzhi Xiao, Chen Qiu, Qinglin Yang, Huakun Huang, Junbo Wang, Chunhua Su","doi":"10.1109/MSN50589.2020.00036","DOIUrl":null,"url":null,"abstract":"Driven by the advanced technologies of vehicular communications and networking, the Internet of Vehicles (IoV) has become an emerging paradigm in smart world. However, privacy and security are still quite critical issues for the current IoV system because of various sensitive information and the centralized interaction architecture. To address these challenges, a decentralized architecture is proposed to develop a blockchain-supported IoV (BS-IoV) system. In the BS-IoV system, the Roadside Units (RSUs) are redesigned for Mobile Edge Computing (MEC). Except for information collection and communication, the RSUs also need to audit the data uploaded by vehicles, packing data as block transactions to guarantee high-quality data sharing. However, since block generating is critical resource-consuming, the distributed database will cost high computing power. Additionally, due to the dynamical variation environment of traffic system, the computing resource is quite difficult to be allocated. In this paper, to solve the above problems, we propose a Deep Reinforcement Learning (DRL) based algorithm for resource optimization in the BS-IoV system. Specifically, to maximize the satisfaction of the system and users, we formulate a resource optimization problem and exploit the DRL-based algorithm to determine the allocation scheme. The evaluation of the proposed learning scheme is performed in the SUMO with Flow, which is a professional simulation tool for traffic simulation with reinforcement learning functions interfaces. Evaluation results have demonstrated good effectiveness of the proposed scheme.","PeriodicalId":447605,"journal":{"name":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN50589.2020.00036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Driven by the advanced technologies of vehicular communications and networking, the Internet of Vehicles (IoV) has become an emerging paradigm in smart world. However, privacy and security are still quite critical issues for the current IoV system because of various sensitive information and the centralized interaction architecture. To address these challenges, a decentralized architecture is proposed to develop a blockchain-supported IoV (BS-IoV) system. In the BS-IoV system, the Roadside Units (RSUs) are redesigned for Mobile Edge Computing (MEC). Except for information collection and communication, the RSUs also need to audit the data uploaded by vehicles, packing data as block transactions to guarantee high-quality data sharing. However, since block generating is critical resource-consuming, the distributed database will cost high computing power. Additionally, due to the dynamical variation environment of traffic system, the computing resource is quite difficult to be allocated. In this paper, to solve the above problems, we propose a Deep Reinforcement Learning (DRL) based algorithm for resource optimization in the BS-IoV system. Specifically, to maximize the satisfaction of the system and users, we formulate a resource optimization problem and exploit the DRL-based algorithm to determine the allocation scheme. The evaluation of the proposed learning scheme is performed in the SUMO with Flow, which is a professional simulation tool for traffic simulation with reinforcement learning functions interfaces. Evaluation results have demonstrated good effectiveness of the proposed scheme.

查看原文本刊更多论文

基于区块链的车联网安全系统资源优化配置的深度强化学习

在先进的车辆通信和网络技术的推动下，车联网(IoV)已成为智能世界的新兴范式。然而，由于当前车联网系统中存在各种敏感信息和集中式交互架构，隐私和安全仍然是非常关键的问题。为了应对这些挑战，提出了一种分散的架构来开发区块链支持的车联网(BS-IoV)系统。在BS-IoV系统中，路边单元(rsu)针对移动边缘计算(MEC)进行了重新设计。除了信息采集和通信外，rsu还需要对车辆上传的数据进行审计，将数据打包为块交易，以保证高质量的数据共享。然而，由于区块生成是一个重要的资源消耗，分布式数据库将消耗大量的计算能力。此外，由于交通系统的动态变化环境，计算资源的分配相当困难。为了解决上述问题，本文提出了一种基于深度强化学习(Deep Reinforcement Learning, DRL)的车联网系统资源优化算法。具体来说，为了使系统和用户的满意度最大化，我们提出了一个资源优化问题，并利用基于drl的算法来确定分配方案。在SUMO with Flow中对所提出的学习方案进行了评估，SUMO with Flow是具有强化学习功能接口的交通仿真专业仿真工具。评价结果表明该方案具有良好的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 16th International Conference on Mobility, Sensing and Networking (MSN)

自引率

0.00%

发文量