{"title":"Enhancing Partial Discharge Classification Through Augmented Fault Data Balancing","authors":"Saurabh Dutta;Shiyu Chen;Hazlee Azil Illias","doi":"10.1109/TDEI.2025.3585844","DOIUrl":null,"url":null,"abstract":"Partial discharge (PD) is a prevalent phenomenon in high-voltage (HV) equipment, and its accurate classification is crucial for ensuring the reliability of power systems. For in situ systems, different types of faults, such as corona, floating electrode, surface, and void discharge, exhibit varying occurrences, posing challenges to accurate classification. This research addresses the inherent issues of classification accuracy caused by unbalanced fault data. Employing z-score normalization and combined synthetic data generation using a random undersampling and synthetic minority oversampling technique (SMOTE) ensures a fair representation of different fault types, leading to more accurate classification results. Further, after applying grid-search to optimize the hyperparameters, k-nearest neighbor (KNN), random forest (RF), and gradient boosting (GB) have achieved accuracies of 98.43%, 95.29%, and 88.54% for balanced denoised, unbalanced denoised, and unbalanced noisy datasets, respectively. The presented results also demonstrate a significant statistical difference in classifier accuracies between the three datasets, as confirmed by the analysis of variance (ANOVA) test. This emphasizes the efficacy of balancing the denoised signal features for improved classification performance. The findings of this work contribute valuable insights into the optimization of PD classification models, paving the way for more reliable fault detection and classification in HV equipment.","PeriodicalId":13247,"journal":{"name":"IEEE Transactions on Dielectrics and Electrical Insulation","volume":"32 5","pages":"2948-2957"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Dielectrics and Electrical Insulation","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11071291/","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Partial discharge (PD) is a prevalent phenomenon in high-voltage (HV) equipment, and its accurate classification is crucial for ensuring the reliability of power systems. For in situ systems, different types of faults, such as corona, floating electrode, surface, and void discharge, exhibit varying occurrences, posing challenges to accurate classification. This research addresses the inherent issues of classification accuracy caused by unbalanced fault data. Employing z-score normalization and combined synthetic data generation using a random undersampling and synthetic minority oversampling technique (SMOTE) ensures a fair representation of different fault types, leading to more accurate classification results. Further, after applying grid-search to optimize the hyperparameters, k-nearest neighbor (KNN), random forest (RF), and gradient boosting (GB) have achieved accuracies of 98.43%, 95.29%, and 88.54% for balanced denoised, unbalanced denoised, and unbalanced noisy datasets, respectively. The presented results also demonstrate a significant statistical difference in classifier accuracies between the three datasets, as confirmed by the analysis of variance (ANOVA) test. This emphasizes the efficacy of balancing the denoised signal features for improved classification performance. The findings of this work contribute valuable insights into the optimization of PD classification models, paving the way for more reliable fault detection and classification in HV equipment.
期刊介绍:
Topics that are concerned with dielectric phenomena and measurements, with development and characterization of gaseous, vacuum, liquid and solid electrical insulating materials and systems; and with utilization of these materials in circuits and systems under condition of use.