Non-Technical Losses Detection in Electric Distribution Systems Using BERT and GAN

Jia-He Lim, Yu-Wen Chen, C. Chu
{"title":"Non-Technical Losses Detection in Electric Distribution Systems Using BERT and GAN","authors":"Jia-He Lim, Yu-Wen Chen, C. Chu","doi":"10.1109/IAS54023.2022.9939802","DOIUrl":null,"url":null,"abstract":"Non-technical losses have caused lots of revenue loss in many electric utility companies around the world. In current practices, manual analysis on collected power consumption data first. Then, on-site inspections are conducted. With recent advances of machine learning techniques, several works have been developed to solve this task in a more effective manner. However, most existing machine learning approaches still require the feature extraction step. Moreover, most current studies overlook the imbalanced dataset from consumer's power meters. This paper proposes a new approach to deal with these problems by integrating two deep machine learning techniques. First, we use Bidirectional Encoder Representations from Transformers (BERT) to remove the feature extraction step. Meanwhile, the generative adversarial network (GAN) is considered to generate fake data to increase the number of the minority class in the imbalanced dataset. The effectiveness of the proposed method has been evaluated on various metrics. Experimental results demonstrated that the proposed method can indeed improve the recall and F1-score significantly.","PeriodicalId":193587,"journal":{"name":"2022 IEEE Industry Applications Society Annual Meeting (IAS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Industry Applications Society Annual Meeting (IAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAS54023.2022.9939802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Non-technical losses have caused lots of revenue loss in many electric utility companies around the world. In current practices, manual analysis on collected power consumption data first. Then, on-site inspections are conducted. With recent advances of machine learning techniques, several works have been developed to solve this task in a more effective manner. However, most existing machine learning approaches still require the feature extraction step. Moreover, most current studies overlook the imbalanced dataset from consumer's power meters. This paper proposes a new approach to deal with these problems by integrating two deep machine learning techniques. First, we use Bidirectional Encoder Representations from Transformers (BERT) to remove the feature extraction step. Meanwhile, the generative adversarial network (GAN) is considered to generate fake data to increase the number of the minority class in the imbalanced dataset. The effectiveness of the proposed method has been evaluated on various metrics. Experimental results demonstrated that the proposed method can indeed improve the recall and F1-score significantly.
基于BERT和GAN的配电系统非技术损耗检测
非技术损失给世界上许多电力公司造成了巨大的收入损失。在目前的实践中,首先对采集到的功耗数据进行人工分析。然后进行现场检查。随着最近机器学习技术的进步,已经开发了一些工作来以更有效的方式解决这个任务。然而,大多数现有的机器学习方法仍然需要特征提取步骤。此外,目前的大多数研究都忽略了来自消费者电表的不平衡数据。本文提出了一种结合两种深度机器学习技术来处理这些问题的新方法。首先,我们使用来自变压器的双向编码器表示(BERT)来去除特征提取步骤。同时,考虑生成对抗网络(GAN)生成假数据,以增加不平衡数据集中少数类的数量。所提出的方法的有效性已经在各种指标上进行了评估。实验结果表明,该方法确实能显著提高查全率和f1分数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信