Nanthiine Nair Ravi, Sulfeeza Mohd Drus, P. S. Krishnan, Nur Laila Abdul Ghani
{"title":"基于文本挖掘的变电站变压器故障分析","authors":"Nanthiine Nair Ravi, Sulfeeza Mohd Drus, P. S. Krishnan, Nur Laila Abdul Ghani","doi":"10.1109/ISCAIE.2019.8743719","DOIUrl":null,"url":null,"abstract":"Transformer failure could occur in terms of tripping that results in an unplanned or unseen outage. A good maintenance strategy is therefore an essential component in a power system to prevent unexpected failures. In this paper, the causes of transformer failure within the power transformer systems have been reviewed. Data is obtained from the transmission substation assets from the whole of Peninsular Malaysia for the past 5 years. However, the challenge is that the problem descriptions of the datasets are all in text formats. Thus, text mining approach is chosen for the data analysis using R. This paper covers the most common steps in R, from data preparation to analysis, and visualization through wordcloud generation. This study mainly focuses on bag-of-word text analysis approaches, which means that only word frequencies per text are used and word positions are ignored. Although this simplifies text content dramatically, research and many applications in the real world show that word frequencies alone contain adequate information for many types of analysis. As a result of analysis, keywords like \"leak\", \"lightning\", \"animal\", \"cable\" and \"temperature\" are identified as the main causes of transformer failures based on the number of word frequency in the tripping dataset. Further enhancement could be made in the future to predict the failure beforehand using predictive analytics approaches.","PeriodicalId":369098,"journal":{"name":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Substation Transformer Failure Analysis Through Text Mining\",\"authors\":\"Nanthiine Nair Ravi, Sulfeeza Mohd Drus, P. S. Krishnan, Nur Laila Abdul Ghani\",\"doi\":\"10.1109/ISCAIE.2019.8743719\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transformer failure could occur in terms of tripping that results in an unplanned or unseen outage. A good maintenance strategy is therefore an essential component in a power system to prevent unexpected failures. In this paper, the causes of transformer failure within the power transformer systems have been reviewed. Data is obtained from the transmission substation assets from the whole of Peninsular Malaysia for the past 5 years. However, the challenge is that the problem descriptions of the datasets are all in text formats. Thus, text mining approach is chosen for the data analysis using R. This paper covers the most common steps in R, from data preparation to analysis, and visualization through wordcloud generation. This study mainly focuses on bag-of-word text analysis approaches, which means that only word frequencies per text are used and word positions are ignored. Although this simplifies text content dramatically, research and many applications in the real world show that word frequencies alone contain adequate information for many types of analysis. As a result of analysis, keywords like \\\"leak\\\", \\\"lightning\\\", \\\"animal\\\", \\\"cable\\\" and \\\"temperature\\\" are identified as the main causes of transformer failures based on the number of word frequency in the tripping dataset. Further enhancement could be made in the future to predict the failure beforehand using predictive analytics approaches.\",\"PeriodicalId\":369098,\"journal\":{\"name\":\"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCAIE.2019.8743719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAIE.2019.8743719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Substation Transformer Failure Analysis Through Text Mining
Transformer failure could occur in terms of tripping that results in an unplanned or unseen outage. A good maintenance strategy is therefore an essential component in a power system to prevent unexpected failures. In this paper, the causes of transformer failure within the power transformer systems have been reviewed. Data is obtained from the transmission substation assets from the whole of Peninsular Malaysia for the past 5 years. However, the challenge is that the problem descriptions of the datasets are all in text formats. Thus, text mining approach is chosen for the data analysis using R. This paper covers the most common steps in R, from data preparation to analysis, and visualization through wordcloud generation. This study mainly focuses on bag-of-word text analysis approaches, which means that only word frequencies per text are used and word positions are ignored. Although this simplifies text content dramatically, research and many applications in the real world show that word frequencies alone contain adequate information for many types of analysis. As a result of analysis, keywords like "leak", "lightning", "animal", "cable" and "temperature" are identified as the main causes of transformer failures based on the number of word frequency in the tripping dataset. Further enhancement could be made in the future to predict the failure beforehand using predictive analytics approaches.