短期负荷预测的机器学习模型比较分析

G. Patel, V. Kale, sudarshan khond
{"title":"短期负荷预测的机器学习模型比较分析","authors":"G. Patel, V. Kale, sudarshan khond","doi":"10.1109/OTCON56053.2023.10114042","DOIUrl":null,"url":null,"abstract":"Load forecasting is challenging owing to the impact of unforeseeable and fuzzy variables such as temperature, humidity, wind speed, and other parameters on load prediction. In this work, algorithms such as Support Vector Machine, Decision Tree, Random Forest, and Linear Regression are used for load forecasting are analyzed for their efficacy. Using the case of each algorithm for a particular load forecasting problem is discussed and general parameter settings for an algorithm to obtain optimal load forecasting are reported. Models are implemented using Jupyter Notebook with Python 3 programming language. Data is used on the Panama Power system from March 2015 to June 2020. This data is further divided into three sizes as small, medium, and large datasets with increasing feature numbers. SVR comes out to be the best algorithm for small datasets. For medium dataset MLR, SVR with holiday feature and RF with holiday performs the best. For large dataset with all features RF perform the best. But the SVR and MLR performs better throughout for all features. Also with temperature feature with large dataset a sudden spike in MAPE from datetime increases by 160.4% for one city temperature, 67.27% for two city temperatures and 25.7% for three city temperatures but when holiday used the same model gives the best results among all model, which shows the importance of holiday feature for SVR in large dataset.","PeriodicalId":265966,"journal":{"name":"2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON)","volume":"345 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Analysis of Machine Learning Models for Short-Term Load Forecasting\",\"authors\":\"G. Patel, V. Kale, sudarshan khond\",\"doi\":\"10.1109/OTCON56053.2023.10114042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Load forecasting is challenging owing to the impact of unforeseeable and fuzzy variables such as temperature, humidity, wind speed, and other parameters on load prediction. In this work, algorithms such as Support Vector Machine, Decision Tree, Random Forest, and Linear Regression are used for load forecasting are analyzed for their efficacy. Using the case of each algorithm for a particular load forecasting problem is discussed and general parameter settings for an algorithm to obtain optimal load forecasting are reported. Models are implemented using Jupyter Notebook with Python 3 programming language. Data is used on the Panama Power system from March 2015 to June 2020. This data is further divided into three sizes as small, medium, and large datasets with increasing feature numbers. SVR comes out to be the best algorithm for small datasets. For medium dataset MLR, SVR with holiday feature and RF with holiday performs the best. For large dataset with all features RF perform the best. But the SVR and MLR performs better throughout for all features. Also with temperature feature with large dataset a sudden spike in MAPE from datetime increases by 160.4% for one city temperature, 67.27% for two city temperatures and 25.7% for three city temperatures but when holiday used the same model gives the best results among all model, which shows the importance of holiday feature for SVR in large dataset.\",\"PeriodicalId\":265966,\"journal\":{\"name\":\"2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON)\",\"volume\":\"345 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/OTCON56053.2023.10114042\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/OTCON56053.2023.10114042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

由于温度、湿度、风速等不可预见和模糊变量对负荷预测的影响,负荷预测具有挑战性。在这项工作中,分析了支持向量机、决策树、随机森林和线性回归等算法用于负荷预测的有效性。针对每种算法的具体负荷预测问题进行了讨论,并给出了实现最优负荷预测算法的一般参数设置。模型使用Jupyter Notebook和Python 3编程语言实现。数据用于2015年3月至2020年6月的巴拿马电力系统。随着特征数量的增加,这些数据进一步分为小、中、大三种大小的数据集。对于小数据集,SVR是最好的算法。对于中等数据集的MLR,带假日特征的SVR和带假日特征的RF表现最好。对于具有所有特征的大型数据集,RF表现最好。但SVR和MLR在所有功能上都表现得更好。同样,在大数据集的温度特征下,一个城市温度的MAPE从日期时间的突然峰值增加了160.4%,两个城市温度增加了67.27%,三个城市温度增加了25.7%,但当使用相同的模型时,假期在所有模型中给出了最好的结果,这表明假期特征对大数据集的SVR的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Analysis of Machine Learning Models for Short-Term Load Forecasting
Load forecasting is challenging owing to the impact of unforeseeable and fuzzy variables such as temperature, humidity, wind speed, and other parameters on load prediction. In this work, algorithms such as Support Vector Machine, Decision Tree, Random Forest, and Linear Regression are used for load forecasting are analyzed for their efficacy. Using the case of each algorithm for a particular load forecasting problem is discussed and general parameter settings for an algorithm to obtain optimal load forecasting are reported. Models are implemented using Jupyter Notebook with Python 3 programming language. Data is used on the Panama Power system from March 2015 to June 2020. This data is further divided into three sizes as small, medium, and large datasets with increasing feature numbers. SVR comes out to be the best algorithm for small datasets. For medium dataset MLR, SVR with holiday feature and RF with holiday performs the best. For large dataset with all features RF perform the best. But the SVR and MLR performs better throughout for all features. Also with temperature feature with large dataset a sudden spike in MAPE from datetime increases by 160.4% for one city temperature, 67.27% for two city temperatures and 25.7% for three city temperatures but when holiday used the same model gives the best results among all model, which shows the importance of holiday feature for SVR in large dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信