Bo Zhang, Zesheng Xi, T. Zhang, Yuanyuan Ma, Zhipeng Shao, Hongfa Li
{"title":"电力系统海量I/O日志数据流降维研究","authors":"Bo Zhang, Zesheng Xi, T. Zhang, Yuanyuan Ma, Zhipeng Shao, Hongfa Li","doi":"10.1109/CISCE50729.2020.00045","DOIUrl":null,"url":null,"abstract":"Every day, the power system receives massive I/O logs. The amount of data in these logs is so large that it takes huge computational resources to analyze. Therefore, it is necessary to reduce the size of the massive I/O logs and only analyze the key log data, thereby reducing the workload of invalid analysis. This paper takes the I/O log of the substation as the research object, and studies the dimension reduction method of the massive I/O data flow log, which reduces the computational load brought by the high-dimensional I/O data flow log data and reduces the massive I/O data flow log. This paper proposes a method of secondary dimensionality reduction. Firstly, the high dimensional I/O log data stream is classified, so that the data is transformed from high-dimensional to low-dimensional. Then, the dimension is reduced again in each category, so that the most simplified massive I/O logs are achieved. Through theoretical analysis, we can come to the conclusion that the computational time complexity of the data after dimension reduction is reduced by more than 80%.","PeriodicalId":101777,"journal":{"name":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Dimensionality Reduction of Massive I/O Log Data Flow in Power System\",\"authors\":\"Bo Zhang, Zesheng Xi, T. Zhang, Yuanyuan Ma, Zhipeng Shao, Hongfa Li\",\"doi\":\"10.1109/CISCE50729.2020.00045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Every day, the power system receives massive I/O logs. The amount of data in these logs is so large that it takes huge computational resources to analyze. Therefore, it is necessary to reduce the size of the massive I/O logs and only analyze the key log data, thereby reducing the workload of invalid analysis. This paper takes the I/O log of the substation as the research object, and studies the dimension reduction method of the massive I/O data flow log, which reduces the computational load brought by the high-dimensional I/O data flow log data and reduces the massive I/O data flow log. This paper proposes a method of secondary dimensionality reduction. Firstly, the high dimensional I/O log data stream is classified, so that the data is transformed from high-dimensional to low-dimensional. Then, the dimension is reduced again in each category, so that the most simplified massive I/O logs are achieved. Through theoretical analysis, we can come to the conclusion that the computational time complexity of the data after dimension reduction is reduced by more than 80%.\",\"PeriodicalId\":101777,\"journal\":{\"name\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE50729.2020.00045\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE50729.2020.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dimensionality Reduction of Massive I/O Log Data Flow in Power System
Every day, the power system receives massive I/O logs. The amount of data in these logs is so large that it takes huge computational resources to analyze. Therefore, it is necessary to reduce the size of the massive I/O logs and only analyze the key log data, thereby reducing the workload of invalid analysis. This paper takes the I/O log of the substation as the research object, and studies the dimension reduction method of the massive I/O data flow log, which reduces the computational load brought by the high-dimensional I/O data flow log data and reduces the massive I/O data flow log. This paper proposes a method of secondary dimensionality reduction. Firstly, the high dimensional I/O log data stream is classified, so that the data is transformed from high-dimensional to low-dimensional. Then, the dimension is reduced again in each category, so that the most simplified massive I/O logs are achieved. Through theoretical analysis, we can come to the conclusion that the computational time complexity of the data after dimension reduction is reduced by more than 80%.