{"title":"基于BERT和GAN的配电系统非技术损耗检测","authors":"Jia-He Lim, Yu-Wen Chen, C. Chu","doi":"10.1109/IAS54023.2022.9939802","DOIUrl":null,"url":null,"abstract":"Non-technical losses have caused lots of revenue loss in many electric utility companies around the world. In current practices, manual analysis on collected power consumption data first. Then, on-site inspections are conducted. With recent advances of machine learning techniques, several works have been developed to solve this task in a more effective manner. However, most existing machine learning approaches still require the feature extraction step. Moreover, most current studies overlook the imbalanced dataset from consumer's power meters. This paper proposes a new approach to deal with these problems by integrating two deep machine learning techniques. First, we use Bidirectional Encoder Representations from Transformers (BERT) to remove the feature extraction step. Meanwhile, the generative adversarial network (GAN) is considered to generate fake data to increase the number of the minority class in the imbalanced dataset. The effectiveness of the proposed method has been evaluated on various metrics. Experimental results demonstrated that the proposed method can indeed improve the recall and F1-score significantly.","PeriodicalId":193587,"journal":{"name":"2022 IEEE Industry Applications Society Annual Meeting (IAS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Non-Technical Losses Detection in Electric Distribution Systems Using BERT and GAN\",\"authors\":\"Jia-He Lim, Yu-Wen Chen, C. Chu\",\"doi\":\"10.1109/IAS54023.2022.9939802\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Non-technical losses have caused lots of revenue loss in many electric utility companies around the world. In current practices, manual analysis on collected power consumption data first. Then, on-site inspections are conducted. With recent advances of machine learning techniques, several works have been developed to solve this task in a more effective manner. However, most existing machine learning approaches still require the feature extraction step. Moreover, most current studies overlook the imbalanced dataset from consumer's power meters. This paper proposes a new approach to deal with these problems by integrating two deep machine learning techniques. First, we use Bidirectional Encoder Representations from Transformers (BERT) to remove the feature extraction step. Meanwhile, the generative adversarial network (GAN) is considered to generate fake data to increase the number of the minority class in the imbalanced dataset. The effectiveness of the proposed method has been evaluated on various metrics. Experimental results demonstrated that the proposed method can indeed improve the recall and F1-score significantly.\",\"PeriodicalId\":193587,\"journal\":{\"name\":\"2022 IEEE Industry Applications Society Annual Meeting (IAS)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Industry Applications Society Annual Meeting (IAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAS54023.2022.9939802\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Industry Applications Society Annual Meeting (IAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAS54023.2022.9939802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Non-Technical Losses Detection in Electric Distribution Systems Using BERT and GAN
Non-technical losses have caused lots of revenue loss in many electric utility companies around the world. In current practices, manual analysis on collected power consumption data first. Then, on-site inspections are conducted. With recent advances of machine learning techniques, several works have been developed to solve this task in a more effective manner. However, most existing machine learning approaches still require the feature extraction step. Moreover, most current studies overlook the imbalanced dataset from consumer's power meters. This paper proposes a new approach to deal with these problems by integrating two deep machine learning techniques. First, we use Bidirectional Encoder Representations from Transformers (BERT) to remove the feature extraction step. Meanwhile, the generative adversarial network (GAN) is considered to generate fake data to increase the number of the minority class in the imbalanced dataset. The effectiveness of the proposed method has been evaluated on various metrics. Experimental results demonstrated that the proposed method can indeed improve the recall and F1-score significantly.