Outliers Detection and Repairing Technique for Measurement Data in the Distribution System

Muhammad Ahmad Khan, M. Yousaf, M. F. Tahir, Abdullah Qadoos, Mazhar Ali, Ahmad Raza
{"title":"Outliers Detection and Repairing Technique for Measurement Data in the Distribution System","authors":"Muhammad Ahmad Khan, M. Yousaf, M. F. Tahir, Abdullah Qadoos, Mazhar Ali, Ahmad Raza","doi":"10.1109/ETECTE55893.2022.10007343","DOIUrl":null,"url":null,"abstract":"In any power network, data perform a very critical role in the operation, management, and regulations of power systems. However, most of the data contain anomalies, which may have an impact on the outcomes of data-driven applications. Therefore, to avoid problems during operations it is very important to detect these outliers and remove them from the data. This research investigates the anomalies cleaning approach for measuring data in the distribution system in order to enhance data quality. This approach includes a set of association rule (AR) that are built automatically using past measuring data. This study demonstrates a data-mining approach based on a mix of density-based spatial clustering of applications with noise (DBSCAN) clustering and auto-generated association rules using historical data. Following that, a novel cost function based on Mahalanobis distance is developed and used for data restoration; this function describes the similarity between different data points. Finally, simulation results show that the suggested model outperforms existing detection and repair strategies. The evaluation section of this research demonstrates that as the number of historical data increases, so does the resilience of the suggested method.","PeriodicalId":131572,"journal":{"name":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETECTE55893.2022.10007343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In any power network, data perform a very critical role in the operation, management, and regulations of power systems. However, most of the data contain anomalies, which may have an impact on the outcomes of data-driven applications. Therefore, to avoid problems during operations it is very important to detect these outliers and remove them from the data. This research investigates the anomalies cleaning approach for measuring data in the distribution system in order to enhance data quality. This approach includes a set of association rule (AR) that are built automatically using past measuring data. This study demonstrates a data-mining approach based on a mix of density-based spatial clustering of applications with noise (DBSCAN) clustering and auto-generated association rules using historical data. Following that, a novel cost function based on Mahalanobis distance is developed and used for data restoration; this function describes the similarity between different data points. Finally, simulation results show that the suggested model outperforms existing detection and repair strategies. The evaluation section of this research demonstrates that as the number of historical data increases, so does the resilience of the suggested method.
配电系统测量数据异常点检测与修复技术
在任何电网中,数据在电力系统的运行、管理和监管中都起着至关重要的作用。然而,大多数数据包含异常,这可能会对数据驱动应用程序的结果产生影响。因此,为了避免在操作过程中出现问题,检测这些异常值并将其从数据中移除是非常重要的。为了提高配电系统测量数据的质量,本文研究了测量数据的异常清理方法。该方法包括一组关联规则(AR),这些规则是使用过去的测量数据自动构建的。本研究展示了一种数据挖掘方法,该方法基于基于密度的噪声应用空间聚类(DBSCAN)聚类和使用历史数据自动生成关联规则的混合。在此基础上,提出了一种基于马氏距离的成本函数,并将其用于数据恢复;这个函数描述了不同数据点之间的相似性。最后,仿真结果表明,该模型优于现有的检测和修复策略。本研究的评价部分表明,随着历史数据数量的增加,所建议方法的弹性也随之增加。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信