Feature Extraction of Risk Group and Electricity Theft by using Electrical Profiles and Physical Data for Classification in the Power Utilities

Supakan Janthong, Rakkrit Duangsoithong, K. Chalermyanont
{"title":"Feature Extraction of Risk Group and Electricity Theft by using Electrical Profiles and Physical Data for Classification in the Power Utilities","authors":"Supakan Janthong, Rakkrit Duangsoithong, K. Chalermyanont","doi":"10.37936/ecti-cit.2024181.252738","DOIUrl":null,"url":null,"abstract":"Non-technical loss (NTL) is one of the problems that has been a major issue in lost revenue for many years. Electricity distributors have attempted to reduce NTL by detecting electricity theft using various methods. Some events are difficult to detect that conventional meters inspection is inadequate. Moreover, many anomaly patterns found are very complex, confusing in identifying or distinguishing what types of electricity customers are at abnormal risk or energy theft that affects NTL. This paper proposes five key feature extraction methods and six classifying electricity customers using supervised learning. The main problem was studied and collected information, including kilowatt meters, electronic meters, TOU meters, and AMR meters, which cover four customer types that were recorded in the Provincial Electricity Authority (PEA) of Thailand. An electrical profile to be extracted for in-depth analysis of the behavior of each type of electricity customer, combined with the information of physical data to help enhance and increase efficiency. All features examined the relationships in each feature using Pearson correlation and handled unbalanced data using random oversampling (ROS). Then, the extracted data has been trained, validated, and tested to classify three classes: normal, risk, and theft, where we evaluate the results with performance metrics. The results show that random forest (RF) outperforms the rest of the classifiers by achieving a precision-recall area under the curve of 90% and a receiver operating characteristic curve of 78%. Significantly, the results were compared to previous studies and benchmark datasets, which revealed that the proposed method gave better results than other techniques.","PeriodicalId":507234,"journal":{"name":"ECTI Transactions on Computer and Information Technology (ECTI-CIT)","volume":"32 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ECTI Transactions on Computer and Information Technology (ECTI-CIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37936/ecti-cit.2024181.252738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Non-technical loss (NTL) is one of the problems that has been a major issue in lost revenue for many years. Electricity distributors have attempted to reduce NTL by detecting electricity theft using various methods. Some events are difficult to detect that conventional meters inspection is inadequate. Moreover, many anomaly patterns found are very complex, confusing in identifying or distinguishing what types of electricity customers are at abnormal risk or energy theft that affects NTL. This paper proposes five key feature extraction methods and six classifying electricity customers using supervised learning. The main problem was studied and collected information, including kilowatt meters, electronic meters, TOU meters, and AMR meters, which cover four customer types that were recorded in the Provincial Electricity Authority (PEA) of Thailand. An electrical profile to be extracted for in-depth analysis of the behavior of each type of electricity customer, combined with the information of physical data to help enhance and increase efficiency. All features examined the relationships in each feature using Pearson correlation and handled unbalanced data using random oversampling (ROS). Then, the extracted data has been trained, validated, and tested to classify three classes: normal, risk, and theft, where we evaluate the results with performance metrics. The results show that random forest (RF) outperforms the rest of the classifiers by achieving a precision-recall area under the curve of 90% and a receiver operating characteristic curve of 78%. Significantly, the results were compared to previous studies and benchmark datasets, which revealed that the proposed method gave better results than other techniques.
利用电力档案和物理数据对电力公司中的风险群体和窃电行为进行特征提取分类
非技术性损失(NTL)是多年来收入损失的主要问题之一。配电公司试图通过各种方法检测窃电行为来减少 NTL。有些事件很难被发现,传统的电表检查是不够的。此外,发现的许多异常模式非常复杂,在识别或区分哪些类型的电力客户存在异常风险或影响 NTL 的窃电行为时容易混淆。本文提出了五种关键特征提取方法和六种利用监督学习对电力客户进行分类的方法。主要问题是研究和收集信息,包括千瓦表、电子表、TOU 表和 AMR 表,涵盖泰国省电力局(PEA)记录的四种客户类型。将提取的电力概况与物理数据信息相结合,对各类电力客户的行为进行深入分析,以帮助增强和提高效率。所有特征均使用皮尔逊相关性检验每个特征中的关系,并使用随机超采样(ROS)处理不平衡数据。然后,对提取的数据进行训练、验证和测试,将其分为三个等级:正常、风险和盗窃,并用性能指标对结果进行评估。结果表明,随机森林(RF)的精度-召回曲线下面积达到 90%,接收者操作特征曲线达到 78%,优于其他分类器。值得注意的是,将结果与以前的研究和基准数据集进行比较后发现,所提出的方法比其他技术取得了更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信