{"title":"基于机器学习的云中固态硬盘故障协作预测","authors":"Yuze Jiang, Ruiming Lu, Shuyue Zhou, Qiao Li","doi":"10.1109/ACDSA59508.2024.10467231","DOIUrl":null,"url":null,"abstract":"SSDs (Solid-State Drives) have become integral components in modern data centers. Under such massive deployment, ensuring their reliability, longevity, and optimal performance is crucial. Despite SSD technology and architecture advancements, accurately predicting their failures, particularly with imperfect real-world data, remains a pertinent research challenge. Dataset imbalances have led to suboptimal prediction accuracy in baseline models. This study introduces to incorporate model-wise class balancing, aiming to refine the data processing for improved accuracy in machine learning models for SSD failure detection. When tested on Alibaba’s dataset of 700k NVMe SSDs, this method yielded higher failure prediction accuracy, with the average recall increasing from 51% to 63% and precision scores rising from 59% to 78%. This improvement in recall and precision demonstrates the method’s potential to advance the field of SSD failure prediction.","PeriodicalId":518964,"journal":{"name":"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)","volume":"89 6","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Based Collaborative Prediction of SSD Failures in the Cloud\",\"authors\":\"Yuze Jiang, Ruiming Lu, Shuyue Zhou, Qiao Li\",\"doi\":\"10.1109/ACDSA59508.2024.10467231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"SSDs (Solid-State Drives) have become integral components in modern data centers. Under such massive deployment, ensuring their reliability, longevity, and optimal performance is crucial. Despite SSD technology and architecture advancements, accurately predicting their failures, particularly with imperfect real-world data, remains a pertinent research challenge. Dataset imbalances have led to suboptimal prediction accuracy in baseline models. This study introduces to incorporate model-wise class balancing, aiming to refine the data processing for improved accuracy in machine learning models for SSD failure detection. When tested on Alibaba’s dataset of 700k NVMe SSDs, this method yielded higher failure prediction accuracy, with the average recall increasing from 51% to 63% and precision scores rising from 59% to 78%. This improvement in recall and precision demonstrates the method’s potential to advance the field of SSD failure prediction.\",\"PeriodicalId\":518964,\"journal\":{\"name\":\"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)\",\"volume\":\"89 6\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACDSA59508.2024.10467231\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACDSA59508.2024.10467231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning Based Collaborative Prediction of SSD Failures in the Cloud
SSDs (Solid-State Drives) have become integral components in modern data centers. Under such massive deployment, ensuring their reliability, longevity, and optimal performance is crucial. Despite SSD technology and architecture advancements, accurately predicting their failures, particularly with imperfect real-world data, remains a pertinent research challenge. Dataset imbalances have led to suboptimal prediction accuracy in baseline models. This study introduces to incorporate model-wise class balancing, aiming to refine the data processing for improved accuracy in machine learning models for SSD failure detection. When tested on Alibaba’s dataset of 700k NVMe SSDs, this method yielded higher failure prediction accuracy, with the average recall increasing from 51% to 63% and precision scores rising from 59% to 78%. This improvement in recall and precision demonstrates the method’s potential to advance the field of SSD failure prediction.