{"title":"支持大规模云存储系统多粒度数据融合的故障预测方法","authors":"Yongyang Cheng, T. Zhang, Jing Luo","doi":"10.1145/3569966.3570119","DOIUrl":null,"url":null,"abstract":"With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.","PeriodicalId":145580,"journal":{"name":"Proceedings of the 5th International Conference on Computer Science and Software Engineering","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Failure Prediction Approach Supporting Multi Granularity Data Fusion for Large-scale Cloud Storage Systems\",\"authors\":\"Yongyang Cheng, T. Zhang, Jing Luo\",\"doi\":\"10.1145/3569966.3570119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.\",\"PeriodicalId\":145580,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Computer Science and Software Engineering\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Computer Science and Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3569966.3570119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Computer Science and Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3569966.3570119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Failure Prediction Approach Supporting Multi Granularity Data Fusion for Large-scale Cloud Storage Systems
With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.