DIC2FBA: Distributed Incremental Clustering with Closeness Factor Based Algorithm for Analysis of Smart Meter Data

A. Chaudhari, Preeti Mulay, Ayushi Agarwal, Krithika Iyer, Saloni Sarbhai
{"title":"DIC2FBA: Distributed Incremental Clustering with Closeness Factor Based Algorithm for Analysis of Smart Meter Data","authors":"A. Chaudhari, Preeti Mulay, Ayushi Agarwal, Krithika Iyer, Saloni Sarbhai","doi":"10.12785/ijcds/160103","DOIUrl":null,"url":null,"abstract":": Due to increased civilization, smart cities, and the advent of technology, lots of buildings including commercials, residential, and other types are populating in numbers in the recent past. The electricity consumption is also a ff ecting due to increased occupancy in these buildings. The analysis of the electricity consumption patterns will be helpful for consumers and electricity generation units to know about consumption and future requirements of electricity. As per the literature, the Incremental clustering algorithm is the best choice to handle ever-increasing data. In this research work, in the first phase, the electricity consumption data was extracted from smart meter images, and then in the second phase, the data was taken from extracted .csv files merging data from various sources together. This research proposes Distributed Incremental Clustering with Closeness Factor Based Algorithm (DIC2FBA) to update load patterns without overall daily load curve clustering. The proposed DIC2FBA has used Amazon Web Service(AWS) and Microsoft Azure HDInsight service. The AWS EC2 instance, along with the AWS S3 bucket and HdInsight, operates by clustering data from numerous sites using an iterative and incremental approach. The DIC2FBA first extracts load patterns from new data and then intergrades the existing load patterns with the new ones. Further, we have compared the findings achieved using the DIC2FBA with IK means and NFICA based on time, features, silhouette score, and the Davis Bouldin index, which indicate that our method can provide an e ffi cient response for electricity consumption patterns analysis to end consumers via smart meters.","PeriodicalId":37180,"journal":{"name":"International Journal of Computing and Digital Systems","volume":"4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computing and Digital Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12785/ijcds/160103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

: Due to increased civilization, smart cities, and the advent of technology, lots of buildings including commercials, residential, and other types are populating in numbers in the recent past. The electricity consumption is also a ff ecting due to increased occupancy in these buildings. The analysis of the electricity consumption patterns will be helpful for consumers and electricity generation units to know about consumption and future requirements of electricity. As per the literature, the Incremental clustering algorithm is the best choice to handle ever-increasing data. In this research work, in the first phase, the electricity consumption data was extracted from smart meter images, and then in the second phase, the data was taken from extracted .csv files merging data from various sources together. This research proposes Distributed Incremental Clustering with Closeness Factor Based Algorithm (DIC2FBA) to update load patterns without overall daily load curve clustering. The proposed DIC2FBA has used Amazon Web Service(AWS) and Microsoft Azure HDInsight service. The AWS EC2 instance, along with the AWS S3 bucket and HdInsight, operates by clustering data from numerous sites using an iterative and incremental approach. The DIC2FBA first extracts load patterns from new data and then intergrades the existing load patterns with the new ones. Further, we have compared the findings achieved using the DIC2FBA with IK means and NFICA based on time, features, silhouette score, and the Davis Bouldin index, which indicate that our method can provide an e ffi cient response for electricity consumption patterns analysis to end consumers via smart meters.
DIC2FBA:基于邻近因子的分布式增量聚类算法,用于分析智能电表数据
:由于文明程度的提高、智能城市的发展和技术的进步,近期大量商业、住宅和其他类型的建筑物不断涌现。由于这些建筑的入住率增加,用电量也在不断增加。对用电模式的分析将有助于消费者和发电单位了解用电量和未来的用电需求。根据文献,增量聚类算法是处理不断增加的数据的最佳选择。在这项研究工作中,第一阶段是从智能电表图像中提取用电数据,然后在第二阶段从提取的 .csv 文件中获取数据,将不同来源的数据合并在一起。本研究提出了基于邻近因子的分布式增量聚类算法(DIC2FBA),用于更新负荷模式,而无需对整体日负荷曲线进行聚类。拟议的 DIC2FBA 使用了亚马逊网络服务(AWS)和微软 Azure HDInsight 服务。AWS EC2实例以及AWS S3桶和HdInsight通过使用迭代和增量方法对来自众多站点的数据进行聚类来运行。DIC2FBA 首先从新数据中提取负载模式,然后将现有负载模式与新负载模式相互融合。此外,我们还将 DIC2FBA 与基于时间、特征、剪影得分和 Davis Bouldin 指数的 IK means 和 NFICA 进行了比较,结果表明,我们的方法可以为通过智能电表进行用电模式分析的终端用户提供有效的响应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Computing and Digital Systems
International Journal of Computing and Digital Systems Business, Management and Accounting-Management of Technology and Innovation
CiteScore
1.70
自引率
0.00%
发文量
111
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信