Achieving Scalable Model-Free Demand Response in Charging an Electric Vehicle Fleet with Reinforcement Learning

Proceedings of the Ninth International Conference on Future Energy Systems Pub Date : 2018-06-12 DOI:10.1145/3208903.3212042

Nasrin Sadeghianpourhamami, Johannes Deleu, Chris Develder

引用次数: 11

Abstract

To achieve coordinated electric vehicle (EV) charging with demand response (DR), a model-free approach using reinforcement learning (RL) is an attractive proposition. Using RL, the DR algorithm is defined as a Markov decision process (MDP). Initial work in this area comprises algorithms to control just one EV at a time, because of scalability challenges when taking coupling between EVs into account. In this paper, we propose a novel MDP definition for charging an EV fleet. More specifically, we propose (1) a relatively compact aggregate state and action space representation, and (2) a batch RL algorithm (i.e., an instance of fitted Q-iteration, FQI) to learn the optimal EV charging policy.

查看原文本刊更多论文

利用强化学习实现电动汽车充电中可扩展的无模型需求响应

为了实现具有需求响应(DR)的协调电动汽车(EV)充电，使用强化学习(RL)的无模型方法是一个有吸引力的提议。利用RL, DR算法被定义为马尔可夫决策过程(MDP)。该领域的初始工作包括一次只控制一辆电动汽车的算法，因为考虑到电动汽车之间的耦合时存在可扩展性挑战。在本文中，我们提出了一种新的针对电动汽车车队充电的MDP定义。更具体地说，我们提出了(1)一个相对紧凑的聚合状态和动作空间表示，(2)一个批处理RL算法(即拟合q -迭代的一个实例，FQI)来学习最优的电动汽车充电策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Ninth International Conference on Future Energy Systems

自引率

0.00%

发文量