Chibueze D. Ukwuoma , Dongsheng Cai , Chiagoziem C. Ukwuoma , Chinedu I. Otuka , Qi Huang
{"title":"输电系统电气故障检测和分类的数据驱动模型的比较分析:可解释性、适用性和工业意义","authors":"Chibueze D. Ukwuoma , Dongsheng Cai , Chiagoziem C. Ukwuoma , Chinedu I. Otuka , Qi Huang","doi":"10.1016/j.aej.2025.04.097","DOIUrl":null,"url":null,"abstract":"<div><div>Most data-driven fault detection methods often face challenges in accuracy, adaptability, and real-time implementation, particularly in complex transmission networks. To address these issues, this study presents an in-depth comparative analysis of data-driven models, including machine learning, neural networks, and deep learning techniques, for detecting and classifying electrical faults in transmission lines. A few key gaps were identified to persist particularly in terms of the accuracy, computational complexity, explainability, applicability, and industrial implications of these models. Hence, a Quadratic Discriminant Analysis by Projection (QDA-P) model which effectively captures non-linear relationships between electrical fault features is proposed, enhancing classification accuracy, less computationally complex and explainable for both binary and multi-class scenarios. Using a publicly available simulated dataset generated through MATLAB Simulink to represent various fault types, the QDA-P model achieved binary classification scores of 0.988, 0.980, 0.982, 0.995, and 0.987 while recording a multi-class classification score of 0.982, 0.979, 0.982, 0.982, and 0.980 for accuracy, precision, specificity, recall, and F1-score, respectively. Feature importance analysis showed that voltage at the sending or receiving end of the transmission line for phase A was the most influential while the length of the transmission line in phase A had the lowest importance. In contrast, techniques such as SHAP, LIME, and PDP reveal that the length of the transmission line in Phases A & C significantly influenced Class 1 and Class 5 predictions, with the length of the transmission line in Phase C contributing positively and the length of the transmission line in phase A showing varied effects. An industrial applicability analysis using the Mahalanobis distance plot confirmed that the QDA-P model effectively captured underlying data patterns without significant outliers. These findings highlight the model's potential to improve fault detection accuracy and reliability, contributing to more efficient and secure energy transmission infrastructures.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"127 ","pages":"Pages 75-91"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative analysis of data-driven models on detection and classification of electrical faults in transmission systems: Explainability, applicability and industrial implications\",\"authors\":\"Chibueze D. Ukwuoma , Dongsheng Cai , Chiagoziem C. Ukwuoma , Chinedu I. Otuka , Qi Huang\",\"doi\":\"10.1016/j.aej.2025.04.097\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Most data-driven fault detection methods often face challenges in accuracy, adaptability, and real-time implementation, particularly in complex transmission networks. To address these issues, this study presents an in-depth comparative analysis of data-driven models, including machine learning, neural networks, and deep learning techniques, for detecting and classifying electrical faults in transmission lines. A few key gaps were identified to persist particularly in terms of the accuracy, computational complexity, explainability, applicability, and industrial implications of these models. Hence, a Quadratic Discriminant Analysis by Projection (QDA-P) model which effectively captures non-linear relationships between electrical fault features is proposed, enhancing classification accuracy, less computationally complex and explainable for both binary and multi-class scenarios. Using a publicly available simulated dataset generated through MATLAB Simulink to represent various fault types, the QDA-P model achieved binary classification scores of 0.988, 0.980, 0.982, 0.995, and 0.987 while recording a multi-class classification score of 0.982, 0.979, 0.982, 0.982, and 0.980 for accuracy, precision, specificity, recall, and F1-score, respectively. Feature importance analysis showed that voltage at the sending or receiving end of the transmission line for phase A was the most influential while the length of the transmission line in phase A had the lowest importance. In contrast, techniques such as SHAP, LIME, and PDP reveal that the length of the transmission line in Phases A & C significantly influenced Class 1 and Class 5 predictions, with the length of the transmission line in Phase C contributing positively and the length of the transmission line in phase A showing varied effects. An industrial applicability analysis using the Mahalanobis distance plot confirmed that the QDA-P model effectively captured underlying data patterns without significant outliers. These findings highlight the model's potential to improve fault detection accuracy and reliability, contributing to more efficient and secure energy transmission infrastructures.</div></div>\",\"PeriodicalId\":7484,\"journal\":{\"name\":\"alexandria engineering journal\",\"volume\":\"127 \",\"pages\":\"Pages 75-91\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"alexandria engineering journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110016825005848\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825005848","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
大多数数据驱动的故障检测方法往往面临准确性、适应性和实时性方面的挑战,特别是在复杂的输电网络中。为了解决这些问题,本研究对数据驱动模型进行了深入的比较分析,包括机器学习、神经网络和深度学习技术,用于检测和分类输电线路中的电气故障。在这些模型的准确性、计算复杂性、可解释性、适用性和工业意义方面,存在一些关键的差距。为此,提出了一种二次判别分析(Quadratic Discriminant Analysis by Projection, QDA-P)模型,该模型能有效地捕捉电气故障特征之间的非线性关系,提高了分类精度,减少了计算复杂度,并且可用于二类和多类场景。利用MATLAB Simulink生成的公开仿真数据集表示各种故障类型,QDA-P模型的二分类得分分别为0.988、0.980、0.982、0.995和0.987,多分类得分分别为0.982、0.979、0.982、0.982和0.980,准确率、精密度、特异性、召回率和f1得分分别为0.982、0.979、0.982、0.982和0.980。特征重要性分析表明,A相输电线路的发送端或接收端电压影响最大,而A相输电线路的长度影响最小。相比之下,诸如SHAP、LIME和PDP等技术揭示了A相输电线路的长度;C显著影响1类和5类预测,C相输电线路长度正向贡献,A相输电线路长度表现出不同的影响。使用马氏距离图的工业适用性分析证实,QDA-P模型有效地捕获了底层数据模式,没有显著的异常值。这些发现突出了该模型在提高故障检测准确性和可靠性方面的潜力,有助于提高能源传输基础设施的效率和安全性。
Comparative analysis of data-driven models on detection and classification of electrical faults in transmission systems: Explainability, applicability and industrial implications
Most data-driven fault detection methods often face challenges in accuracy, adaptability, and real-time implementation, particularly in complex transmission networks. To address these issues, this study presents an in-depth comparative analysis of data-driven models, including machine learning, neural networks, and deep learning techniques, for detecting and classifying electrical faults in transmission lines. A few key gaps were identified to persist particularly in terms of the accuracy, computational complexity, explainability, applicability, and industrial implications of these models. Hence, a Quadratic Discriminant Analysis by Projection (QDA-P) model which effectively captures non-linear relationships between electrical fault features is proposed, enhancing classification accuracy, less computationally complex and explainable for both binary and multi-class scenarios. Using a publicly available simulated dataset generated through MATLAB Simulink to represent various fault types, the QDA-P model achieved binary classification scores of 0.988, 0.980, 0.982, 0.995, and 0.987 while recording a multi-class classification score of 0.982, 0.979, 0.982, 0.982, and 0.980 for accuracy, precision, specificity, recall, and F1-score, respectively. Feature importance analysis showed that voltage at the sending or receiving end of the transmission line for phase A was the most influential while the length of the transmission line in phase A had the lowest importance. In contrast, techniques such as SHAP, LIME, and PDP reveal that the length of the transmission line in Phases A & C significantly influenced Class 1 and Class 5 predictions, with the length of the transmission line in Phase C contributing positively and the length of the transmission line in phase A showing varied effects. An industrial applicability analysis using the Mahalanobis distance plot confirmed that the QDA-P model effectively captured underlying data patterns without significant outliers. These findings highlight the model's potential to improve fault detection accuracy and reliability, contributing to more efficient and secure energy transmission infrastructures.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering