{"title":"基于深度强化学习的建筑物自适应异常检测","authors":"Tong Wu, Jorge Ortiz","doi":"10.1145/3360322.3361011","DOIUrl":null,"url":null,"abstract":"In this paper, we present early results on the use of deep reinforcement learning (DRL) for maximizing anomaly detection performance in buildings. We conjecture that DRL can improve performance by exploring the entire parameter space for all sensors, individually. Many anomaly detection algorithms are designed to use a single parameter for ease of use, however there are usually many parameter values that are pre-set, a priori. We hypothesize that a single threshold cannot work well for all sensors and propose the use of DRL to explore the entire parameter space. We use a deterministic policy gradient algorithm - Deep Deterministic Policy Gradient (DDPG)[4] - and use a building-specific anomaly detection algorithm, Strip, Bind, and Search (SBS) [2]. We find that while the maximum performance achieved by both approaches is similar, the DRL-based approach is significantly less biased, more consistent - up to 3x smaller standard deviation across individual sensor scores.","PeriodicalId":128826,"journal":{"name":"Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Towards Adaptive Anomaly Detection in Buildings with Deep Reinforcement Learning\",\"authors\":\"Tong Wu, Jorge Ortiz\",\"doi\":\"10.1145/3360322.3361011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present early results on the use of deep reinforcement learning (DRL) for maximizing anomaly detection performance in buildings. We conjecture that DRL can improve performance by exploring the entire parameter space for all sensors, individually. Many anomaly detection algorithms are designed to use a single parameter for ease of use, however there are usually many parameter values that are pre-set, a priori. We hypothesize that a single threshold cannot work well for all sensors and propose the use of DRL to explore the entire parameter space. We use a deterministic policy gradient algorithm - Deep Deterministic Policy Gradient (DDPG)[4] - and use a building-specific anomaly detection algorithm, Strip, Bind, and Search (SBS) [2]. We find that while the maximum performance achieved by both approaches is similar, the DRL-based approach is significantly less biased, more consistent - up to 3x smaller standard deviation across individual sensor scores.\",\"PeriodicalId\":128826,\"journal\":{\"name\":\"Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3360322.3361011\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3360322.3361011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards Adaptive Anomaly Detection in Buildings with Deep Reinforcement Learning
In this paper, we present early results on the use of deep reinforcement learning (DRL) for maximizing anomaly detection performance in buildings. We conjecture that DRL can improve performance by exploring the entire parameter space for all sensors, individually. Many anomaly detection algorithms are designed to use a single parameter for ease of use, however there are usually many parameter values that are pre-set, a priori. We hypothesize that a single threshold cannot work well for all sensors and propose the use of DRL to explore the entire parameter space. We use a deterministic policy gradient algorithm - Deep Deterministic Policy Gradient (DDPG)[4] - and use a building-specific anomaly detection algorithm, Strip, Bind, and Search (SBS) [2]. We find that while the maximum performance achieved by both approaches is similar, the DRL-based approach is significantly less biased, more consistent - up to 3x smaller standard deviation across individual sensor scores.