A comparison of reinforcement learning based approaches to appliance scheduling

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I) Pub Date : 2016-12-01 DOI:10.1109/IC3I.2016.7917970

Namit Chauhan, Neha Choudhary, K. George

引用次数: 11

Abstract

Reinforcement learning is often proposed as a technique for intelligent control in a smart home setup with dynamic real-time energy pricing and advanced sub-metering infrastructure. In this paper, we introduce a variation of State Action Reward State Action (SARSA) as an optimization algorithm for appliance scheduling in smart homes with multiple appliances and compare it with the popular reinforcement learning method Q-learning. A simple, intuitive and unique treelike Markov decision process (MDP) structure of appliances is proposed which takes into account the states, such as on/off/runtime status, of all schedulable appliances but does not require the knowledge of the state to state transition probabilities.

查看原文本刊更多论文

基于强化学习的电器调度方法比较

强化学习通常被提出作为智能家居设置中的智能控制技术，具有动态实时能源定价和先进的分计量基础设施。在本文中，我们引入了一种状态动作奖励状态动作(SARSA)的变体作为多家电智能家居中家电调度的优化算法，并将其与流行的强化学习方法Q-learning进行了比较。提出了一种简单、直观、独特的树状马尔可夫决策过程结构，该结构考虑了所有可调度设备的状态，如开/关/运行状态，但不需要知道状态到状态转移概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)

自引率

0.00%

发文量