Performance analysis of dimensionality reduction techniques for demand side management

Ahmed Aleshinloye, Abdul Bais, I. Al-Anbagi
{"title":"Performance analysis of dimensionality reduction techniques for demand side management","authors":"Ahmed Aleshinloye, Abdul Bais, I. Al-Anbagi","doi":"10.1109/EPEC.2017.8286232","DOIUrl":null,"url":null,"abstract":"The advancement of the electric grid has led to tremendous growth in data generated from the installed sensors. Smart meters measure the electric energy usage of a consumer, transmit the measured data to the utility and receive pricing information. This requires a two way communication between the utility and the end user. With the projected increase in the number of deployed smart meters, utilities would be facing challenges in handling huge quantities of data, referred to as Big Data. For the analysis of the large data to be tractable, we need to extract important lower dimensional features from raw measurements. In this paper we critically analyze dimensionality reduction of smart meter data for smart grid applications. We compare performance of two dimensionality reduction techniques, Random Projection and Principal Component Analysis, on projecting smart meters data onto a linear subspace of reduced dimensions. We compute the Euclidean distance between pair of data samples in the original and reduced dimensions and obtained the mean and standard deviation of the relative error. Additionally, we cluster the users using the original data and after applying dimensionality reduction. The sum of square error (SSE), distance between datapoints and the centroid in a given cluster, is used to compare the clustering performance of the two techniques.","PeriodicalId":141250,"journal":{"name":"2017 IEEE Electrical Power and Energy Conference (EPEC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Electrical Power and Energy Conference (EPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EPEC.2017.8286232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

The advancement of the electric grid has led to tremendous growth in data generated from the installed sensors. Smart meters measure the electric energy usage of a consumer, transmit the measured data to the utility and receive pricing information. This requires a two way communication between the utility and the end user. With the projected increase in the number of deployed smart meters, utilities would be facing challenges in handling huge quantities of data, referred to as Big Data. For the analysis of the large data to be tractable, we need to extract important lower dimensional features from raw measurements. In this paper we critically analyze dimensionality reduction of smart meter data for smart grid applications. We compare performance of two dimensionality reduction techniques, Random Projection and Principal Component Analysis, on projecting smart meters data onto a linear subspace of reduced dimensions. We compute the Euclidean distance between pair of data samples in the original and reduced dimensions and obtained the mean and standard deviation of the relative error. Additionally, we cluster the users using the original data and after applying dimensionality reduction. The sum of square error (SSE), distance between datapoints and the centroid in a given cluster, is used to compare the clustering performance of the two techniques.
需求侧管理降维技术的性能分析
电网的进步导致安装的传感器产生的数据急剧增长。智能电表测量消费者的电能使用情况,将测量数据传输给公用事业公司并接收价格信息。这需要在实用程序和最终用户之间进行双向通信。随着智能电表部署数量的预计增加,公用事业将面临处理大量数据(即大数据)的挑战。为了使大数据分析易于处理,我们需要从原始测量中提取重要的低维特征。在本文中,我们批判性地分析了智能电网应用中智能电表数据的降维。我们比较了两种降维技术的性能,随机投影和主成分分析,将智能电表数据投影到降维的线性子空间上。计算原始维数和降维数对数据样本之间的欧氏距离,得到相对误差的均值和标准差。此外,我们使用原始数据和应用降维后对用户进行聚类。使用误差平方和(SSE),即给定聚类中数据点与质心之间的距离,来比较两种技术的聚类性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信