Robust resampling and stacked learning models for electricity theft detection in smart grid

IF 4.7 3区 工程技术 Q2 ENERGY & FUELS
Ashraf Ullah , Inam Ullah Khan , Muhammad Zeeshan Younas , Maqbool Ahmad , Natalia Kryvinska
{"title":"Robust resampling and stacked learning models for electricity theft detection in smart grid","authors":"Ashraf Ullah ,&nbsp;Inam Ullah Khan ,&nbsp;Muhammad Zeeshan Younas ,&nbsp;Maqbool Ahmad ,&nbsp;Natalia Kryvinska","doi":"10.1016/j.egyr.2024.12.041","DOIUrl":null,"url":null,"abstract":"<div><div>Electricity theft (ET) is a critical contributor to non-technical losses (NTLs) that significantly threaten the efficiency and reliability of power grids, leading to increased power wastage and financial losses. Despite the development of various artificial intelligence (AI)-based machine learning (ML) and deep learning (DL) approaches for electricity theft detection (ETD), existing methods often exhibit limitations in memorization and generalization, mainly when applied to large-scale electricity consumption datasets characterized by high variance, missing values, and complex nonlinear relationships. These challenges can result in models needing high variance and bias, reducing their effectiveness in accurately predicting electricity theft cases. To address these limitations, we propose a three-layer framework that employs a stacking ensemble model to combine the benefits of both ML and DL algorithms. During the first stage of data preprocessing, missing data is imputed through data interpolation, while the normalization is done through min–max scaling. To solve the high-class imbalance problem prevalent in most real-world datasets, we combine borderline synthetic minority oversampling techniques and near-miss undersampling strategies. In the final layer of our proposed ETD framework, we employ four ML base and five meta-classifiers. The outputs of base classifiers are aggregated and passed to a meta-classifier, where we evaluate recurrent neural networks (RNN) and convolutional neural network (CNN) as potential meta-classifiers. The RNN are long short-term memory (LSTM), gated recurrent unit (GRU), Bi-directional LSTM (Bi-LSTM) and Bi-directional GRU (Bi-GRU), respectively. Experimental outcomes show that the proposed Bi-GRU better achieves accuracy enhancement of detection in general than meta-classifiers and other state-of-the-art models used for ETD.</div></div>","PeriodicalId":11798,"journal":{"name":"Energy Reports","volume":"13 ","pages":"Pages 770-779"},"PeriodicalIF":4.7000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Reports","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235248472400859X","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

Electricity theft (ET) is a critical contributor to non-technical losses (NTLs) that significantly threaten the efficiency and reliability of power grids, leading to increased power wastage and financial losses. Despite the development of various artificial intelligence (AI)-based machine learning (ML) and deep learning (DL) approaches for electricity theft detection (ETD), existing methods often exhibit limitations in memorization and generalization, mainly when applied to large-scale electricity consumption datasets characterized by high variance, missing values, and complex nonlinear relationships. These challenges can result in models needing high variance and bias, reducing their effectiveness in accurately predicting electricity theft cases. To address these limitations, we propose a three-layer framework that employs a stacking ensemble model to combine the benefits of both ML and DL algorithms. During the first stage of data preprocessing, missing data is imputed through data interpolation, while the normalization is done through min–max scaling. To solve the high-class imbalance problem prevalent in most real-world datasets, we combine borderline synthetic minority oversampling techniques and near-miss undersampling strategies. In the final layer of our proposed ETD framework, we employ four ML base and five meta-classifiers. The outputs of base classifiers are aggregated and passed to a meta-classifier, where we evaluate recurrent neural networks (RNN) and convolutional neural network (CNN) as potential meta-classifiers. The RNN are long short-term memory (LSTM), gated recurrent unit (GRU), Bi-directional LSTM (Bi-LSTM) and Bi-directional GRU (Bi-GRU), respectively. Experimental outcomes show that the proposed Bi-GRU better achieves accuracy enhancement of detection in general than meta-classifiers and other state-of-the-art models used for ETD.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy Reports
Energy Reports Energy-General Energy
CiteScore
8.20
自引率
13.50%
发文量
2608
审稿时长
38 days
期刊介绍: Energy Reports is a new online multidisciplinary open access journal which focuses on publishing new research in the area of Energy with a rapid review and publication time. Energy Reports will be open to direct submissions and also to submissions from other Elsevier Energy journals, whose Editors have determined that Energy Reports would be a better fit.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信