{"title":"Evaluation of Fault Detection Algorithms for Photovoltaic Array Using Distributed Machine Learning Platform","authors":"Apoorva Choumal, V. Yadav","doi":"10.1109/NPSC57038.2022.10069621","DOIUrl":null,"url":null,"abstract":"The protection of solar photovoltaic (PV) systems has become immensely important in the last several decades due to the exponential growth in the World’s PV power capacity. As a result, fault analysis in PV arrays has evolved into a crucial task for protecting PV modules from damage and minimizing the possibility of occupational hazards. However, PV systems are covered by standard protection mechanisms, although some faults with low mismatch and high fault impedance levels may go undetectable. It might be challenging to discern an anomaly from normal operation due to the often-minute changes in electrical signal magnitude caused by malfunctioning photovoltaic components during such faults. In such cases, data-driven machine learning methods give reliable detection and classification results. This paper presents a workflow to use a machine-learning library of a distributed computing framework, PySpark. PySpark, a Python API for Apache Spark, is a powerful computational engine that efficiently handles enormous data volumes. The key characteristics of I-V curves under various fault occurrences and standard conditions are extracted from a MATLAB simulation of the PV Module for fault detection on the dc side of the PV array. The ML library in PySpark is then used to examine these attributes and detect faults. A confusion matrix addressing soft accuracy, precision, recall, etc., is used in a comparative analysis of several classification methods.","PeriodicalId":162808,"journal":{"name":"2022 22nd National Power Systems Conference (NPSC)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 22nd National Power Systems Conference (NPSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NPSC57038.2022.10069621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The protection of solar photovoltaic (PV) systems has become immensely important in the last several decades due to the exponential growth in the World’s PV power capacity. As a result, fault analysis in PV arrays has evolved into a crucial task for protecting PV modules from damage and minimizing the possibility of occupational hazards. However, PV systems are covered by standard protection mechanisms, although some faults with low mismatch and high fault impedance levels may go undetectable. It might be challenging to discern an anomaly from normal operation due to the often-minute changes in electrical signal magnitude caused by malfunctioning photovoltaic components during such faults. In such cases, data-driven machine learning methods give reliable detection and classification results. This paper presents a workflow to use a machine-learning library of a distributed computing framework, PySpark. PySpark, a Python API for Apache Spark, is a powerful computational engine that efficiently handles enormous data volumes. The key characteristics of I-V curves under various fault occurrences and standard conditions are extracted from a MATLAB simulation of the PV Module for fault detection on the dc side of the PV array. The ML library in PySpark is then used to examine these attributes and detect faults. A confusion matrix addressing soft accuracy, precision, recall, etc., is used in a comparative analysis of several classification methods.