{"title":"Machine Learning Based Collaborative Prediction of SSD Failures in the Cloud","authors":"Yuze Jiang, Ruiming Lu, Shuyue Zhou, Qiao Li","doi":"10.1109/ACDSA59508.2024.10467231","DOIUrl":null,"url":null,"abstract":"SSDs (Solid-State Drives) have become integral components in modern data centers. Under such massive deployment, ensuring their reliability, longevity, and optimal performance is crucial. Despite SSD technology and architecture advancements, accurately predicting their failures, particularly with imperfect real-world data, remains a pertinent research challenge. Dataset imbalances have led to suboptimal prediction accuracy in baseline models. This study introduces to incorporate model-wise class balancing, aiming to refine the data processing for improved accuracy in machine learning models for SSD failure detection. When tested on Alibaba’s dataset of 700k NVMe SSDs, this method yielded higher failure prediction accuracy, with the average recall increasing from 51% to 63% and precision scores rising from 59% to 78%. This improvement in recall and precision demonstrates the method’s potential to advance the field of SSD failure prediction.","PeriodicalId":518964,"journal":{"name":"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)","volume":"89 6","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACDSA59508.2024.10467231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
SSDs (Solid-State Drives) have become integral components in modern data centers. Under such massive deployment, ensuring their reliability, longevity, and optimal performance is crucial. Despite SSD technology and architecture advancements, accurately predicting their failures, particularly with imperfect real-world data, remains a pertinent research challenge. Dataset imbalances have led to suboptimal prediction accuracy in baseline models. This study introduces to incorporate model-wise class balancing, aiming to refine the data processing for improved accuracy in machine learning models for SSD failure detection. When tested on Alibaba’s dataset of 700k NVMe SSDs, this method yielded higher failure prediction accuracy, with the average recall increasing from 51% to 63% and precision scores rising from 59% to 78%. This improvement in recall and precision demonstrates the method’s potential to advance the field of SSD failure prediction.