Application of Ensemble Learning with Mean Shift Clustering for Output Profile Classification and Anomaly Detection in Energy Production of Grid-Tied Photovoltaic System
Justin D. de Guia, Ronnie S. Concepcion, Hilario A. Calinao, Sandy C. Lauguico, E. Dadios, R. R. Vicerra
{"title":"Application of Ensemble Learning with Mean Shift Clustering for Output Profile Classification and Anomaly Detection in Energy Production of Grid-Tied Photovoltaic System","authors":"Justin D. de Guia, Ronnie S. Concepcion, Hilario A. Calinao, Sandy C. Lauguico, E. Dadios, R. R. Vicerra","doi":"10.1109/ICITEE49829.2020.9271699","DOIUrl":null,"url":null,"abstract":"Fault detection and monitoring system in photovoltaic (PV) energy management system is important in achieving its optimal performance. An effective diagnostic system involves correct analysis of electrical parameters of a PV array on a given weather condition. In the study, mean-shift clustering was applied for pre-classification and anomaly detection of time-series data of electrical parameters from grid-tied inverter, and solar-irradiance. Classification and anomaly detection applied is based in ensemble learning, where its base learners are based from multilayer perceptron. A stacking ensemble is used in classification of energy production profile while bagging ensemble is used detecting anomalous trend in time-series data. A stacking ensemble got a highest accuracy value of 94% compared to single classifiers which have accuracy value of 85.25%, 84.14%, and 63.4%, respectively. The bagging ensemble autoencoders have the lowest mean squared error during model reconstruction compared to single autoencoder. It has a fair performance in classifying anomaly points from normal datapoints, having an AUC value of 0.795 and F1-score of 0.71, given that the hyperparameter is 0.5. Overall, ensemble learners improve the performance in classification and detection tasks.","PeriodicalId":245013,"journal":{"name":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE49829.2020.9271699","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Fault detection and monitoring system in photovoltaic (PV) energy management system is important in achieving its optimal performance. An effective diagnostic system involves correct analysis of electrical parameters of a PV array on a given weather condition. In the study, mean-shift clustering was applied for pre-classification and anomaly detection of time-series data of electrical parameters from grid-tied inverter, and solar-irradiance. Classification and anomaly detection applied is based in ensemble learning, where its base learners are based from multilayer perceptron. A stacking ensemble is used in classification of energy production profile while bagging ensemble is used detecting anomalous trend in time-series data. A stacking ensemble got a highest accuracy value of 94% compared to single classifiers which have accuracy value of 85.25%, 84.14%, and 63.4%, respectively. The bagging ensemble autoencoders have the lowest mean squared error during model reconstruction compared to single autoencoder. It has a fair performance in classifying anomaly points from normal datapoints, having an AUC value of 0.795 and F1-score of 0.71, given that the hyperparameter is 0.5. Overall, ensemble learners improve the performance in classification and detection tasks.