支持大规模云存储系统多粒度数据融合的故障预测方法

Yongyang Cheng, T. Zhang, Jing Luo
{"title":"支持大规模云存储系统多粒度数据融合的故障预测方法","authors":"Yongyang Cheng, T. Zhang, Jing Luo","doi":"10.1145/3569966.3570119","DOIUrl":null,"url":null,"abstract":"With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.","PeriodicalId":145580,"journal":{"name":"Proceedings of the 5th International Conference on Computer Science and Software Engineering","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Failure Prediction Approach Supporting Multi Granularity Data Fusion for Large-scale Cloud Storage Systems\",\"authors\":\"Yongyang Cheng, T. Zhang, Jing Luo\",\"doi\":\"10.1145/3569966.3570119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.\",\"PeriodicalId\":145580,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Computer Science and Software Engineering\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Computer Science and Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3569966.3570119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Computer Science and Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3569966.3570119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着云计算和云存储技术的发展,数据规模迅速增长。为了存储和处理大规模数据,云存储中心有成千上万的节点和设备,导致故障频率激增。在各种类型的故障事件中,存储设备故障是最重要的一类。然而,大多数云存储系统缺乏硬盘故障预测机制,只能在硬盘故障后进行更换。对系统运行环境的潜在风险进行预测尤为重要。本文提出了一种支持多粒度数据融合的磁盘故障预测方法,解决了磁盘故障预测中样本不平衡、数据源单一、跨场景模型迁移以及预测模型泛化能力不足等问题。通过我们提出的方法,云存储系统可以准确预测硬盘故障,并主动将预测结果推送给用户,从而提高运维工作的针对性和计划性。通过一系列定性和定量实验,验证了本文方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Failure Prediction Approach Supporting Multi Granularity Data Fusion for Large-scale Cloud Storage Systems
With the development of cloud computing and cloud storage technology, the data scale has grown rapidly. In order to store and process large-scale data, there are thousands of nodes and devices in the cloud storage center, resulting in a surge in the frequency of failures. In various types of failure events, storage device failure is the most important one. However, most cloud storage systems lack disk failure prediction mechanisms and could only replace disks after disk failures. It is particularly important to predict the potential risks in the system operation environment. In this paper, we propose a disk failure prediction approach that supports multi granularity data fusion, which solves problems of unbalanced samples, single data source, cross scenario model migration and insufficient generalization ability of prediction models in disk failure prediction. Through our proposed approach, the cloud storage system could accurately predict disk failures and actively push prediction results to users, so as to improve the pertinence and planning of the operation and maintenance work. The approach presented in this paper has been validated to be valid through a series of qualitative and quantitative experiments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信