{"title":"Clustering and Cleaning Method based on Practical Fault Data in Distribution Network","authors":"Jiekai Zhang, Xiao Ma, Liang Ding, Zhan-gang Yang","doi":"10.1109/CISCE58541.2023.10142572","DOIUrl":null,"url":null,"abstract":"The issue of large amounts of interference, errors, and invalid waveforms in the electrical parameter data collected after DTU (Data Terminal Unit) devices generate false remote control signals poses a huge threat to the safe and stable operation of the power grid. This paper proposes a clustering cleaning method based on Stacked Sparse Autoencoders (SSAE) to clean up faulty data. By using SSAE to extract features and reduce dimensionality of the waveform data uploaded by DTU devices, the clustering by fast search and find of density peaks (CFSFDP) algorithm is then used to cluster and clean up erroneous and faulty data. The network parameters are reasonably adjusted to ensure a high correct cleaning rate while maintaining a low erroneous cleaning rate. Comparative analysis proves the superiority of the CFSFDP algorithm in isolating interference, errors, and invalid waveforms. Effective filtering of erroneous and faulty data is achieved through pre-cleaning and complete cleaning, which improves the quality of the database data and provides technical support for subsequent data mining research and effective support for accurate fault diagnosis.","PeriodicalId":145263,"journal":{"name":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE58541.2023.10142572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The issue of large amounts of interference, errors, and invalid waveforms in the electrical parameter data collected after DTU (Data Terminal Unit) devices generate false remote control signals poses a huge threat to the safe and stable operation of the power grid. This paper proposes a clustering cleaning method based on Stacked Sparse Autoencoders (SSAE) to clean up faulty data. By using SSAE to extract features and reduce dimensionality of the waveform data uploaded by DTU devices, the clustering by fast search and find of density peaks (CFSFDP) algorithm is then used to cluster and clean up erroneous and faulty data. The network parameters are reasonably adjusted to ensure a high correct cleaning rate while maintaining a low erroneous cleaning rate. Comparative analysis proves the superiority of the CFSFDP algorithm in isolating interference, errors, and invalid waveforms. Effective filtering of erroneous and faulty data is achieved through pre-cleaning and complete cleaning, which improves the quality of the database data and provides technical support for subsequent data mining research and effective support for accurate fault diagnosis.