{"title":"利用机器学习诊断变压器故障:SHAP 特征选择与 LGBM 智能优化相结合的方法","authors":"Cheng Liu, Weiming Yang","doi":"10.1186/s42162-025-00519-3","DOIUrl":null,"url":null,"abstract":"<div><p>This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is employed to evaluate feature importance and select a subset that significantly influences model predictions, thereby simplifying the model and enhancing its interpretability. Following this, the bald eagle search (BES) intelligent optimization algorithm is utilized to optimize the hyperparameters of the light gradient boosting machine (LGBM) model, further improving its predictive capability. Comparative experiments with various traditional machine learning models validate the effectiveness of the proposed method. The SHAP-BES-LGBM model achieves the highest accuracy of 0.9509 and an f1 score of 0.9606 on the test set, with only 11 samples misclassified, demonstrating superior classification performance and underscoring the advantages of this integrated approach in transformer fault diagnosis.</p></div>","PeriodicalId":538,"journal":{"name":"Energy Informatics","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://energyinformatics.springeropen.com/counter/pdf/10.1186/s42162-025-00519-3","citationCount":"0","resultStr":"{\"title\":\"Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM\",\"authors\":\"Cheng Liu, Weiming Yang\",\"doi\":\"10.1186/s42162-025-00519-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is employed to evaluate feature importance and select a subset that significantly influences model predictions, thereby simplifying the model and enhancing its interpretability. Following this, the bald eagle search (BES) intelligent optimization algorithm is utilized to optimize the hyperparameters of the light gradient boosting machine (LGBM) model, further improving its predictive capability. Comparative experiments with various traditional machine learning models validate the effectiveness of the proposed method. The SHAP-BES-LGBM model achieves the highest accuracy of 0.9509 and an f1 score of 0.9606 on the test set, with only 11 samples misclassified, demonstrating superior classification performance and underscoring the advantages of this integrated approach in transformer fault diagnosis.</p></div>\",\"PeriodicalId\":538,\"journal\":{\"name\":\"Energy Informatics\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://energyinformatics.springeropen.com/counter/pdf/10.1186/s42162-025-00519-3\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s42162-025-00519-3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Energy\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Informatics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s42162-025-00519-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Energy","Score":null,"Total":0}
Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is employed to evaluate feature importance and select a subset that significantly influences model predictions, thereby simplifying the model and enhancing its interpretability. Following this, the bald eagle search (BES) intelligent optimization algorithm is utilized to optimize the hyperparameters of the light gradient boosting machine (LGBM) model, further improving its predictive capability. Comparative experiments with various traditional machine learning models validate the effectiveness of the proposed method. The SHAP-BES-LGBM model achieves the highest accuracy of 0.9509 and an f1 score of 0.9606 on the test set, with only 11 samples misclassified, demonstrating superior classification performance and underscoring the advantages of this integrated approach in transformer fault diagnosis.