{"title":"Data-centric explainable artificial intelligence techniques for cyber-attack detection in microgrid networks","authors":"Rohit Trivedi, Sandipan Patra, Shafi Khadem","doi":"10.1016/j.egyr.2024.11.075","DOIUrl":null,"url":null,"abstract":"<div><div>This article presents a strategy for pre-processing data and interpreting the performance of advanced machine learning (ML) models to improve data quality and modify the models based on identified problems during the iterative development of classification models. The process consists of three main stages aimed at maximizing the performance of ML models. Stage-1 involves data collection and preprocessing. Stage-2 focuses on improving data quality and extracting features. The extracted features are further refined using hyperparameters in Stage-3. Additionally, the framework utilizes SHapley Additive exPlanations (SHAP) for explainable artificial intelligence (XAI) to elucidate the learning process of ML models and, therefore, make the model's behavior interpretable. As an example, this innovative process has been applied to cyber-attack detection in microgrid networks. In Stages-1, a CIGRE low-voltage microgrid network is simulated for data collection during cyber-attacks, followed by data pre-processing. In Stage-2, data augmentation is performed using SMOTE and ENN, and feature extraction is carried out using the Boruta Python package. In Stage-3, hyperparameters are tuned using the TPE algorithm. The results demonstrate the effectiveness of this approach, enabling the model to progressively enhance its predictive capacity through each stage.</div></div>","PeriodicalId":11798,"journal":{"name":"Energy Reports","volume":"13 ","pages":"Pages 217-229"},"PeriodicalIF":4.7000,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Reports","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352484724007960","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
This article presents a strategy for pre-processing data and interpreting the performance of advanced machine learning (ML) models to improve data quality and modify the models based on identified problems during the iterative development of classification models. The process consists of three main stages aimed at maximizing the performance of ML models. Stage-1 involves data collection and preprocessing. Stage-2 focuses on improving data quality and extracting features. The extracted features are further refined using hyperparameters in Stage-3. Additionally, the framework utilizes SHapley Additive exPlanations (SHAP) for explainable artificial intelligence (XAI) to elucidate the learning process of ML models and, therefore, make the model's behavior interpretable. As an example, this innovative process has been applied to cyber-attack detection in microgrid networks. In Stages-1, a CIGRE low-voltage microgrid network is simulated for data collection during cyber-attacks, followed by data pre-processing. In Stage-2, data augmentation is performed using SMOTE and ENN, and feature extraction is carried out using the Boruta Python package. In Stage-3, hyperparameters are tuned using the TPE algorithm. The results demonstrate the effectiveness of this approach, enabling the model to progressively enhance its predictive capacity through each stage.
期刊介绍:
Energy Reports is a new online multidisciplinary open access journal which focuses on publishing new research in the area of Energy with a rapid review and publication time. Energy Reports will be open to direct submissions and also to submissions from other Elsevier Energy journals, whose Editors have determined that Energy Reports would be a better fit.