一种基于机器学习的数据分类器以减少ssd中的写放大

Yi-Ying Lu, Chin-Hsien Wu, Ya-Shu Chen
{"title":"一种基于机器学习的数据分类器以减少ssd中的写放大","authors":"Yi-Ying Lu, Chin-Hsien Wu, Ya-Shu Chen","doi":"10.1145/3400286.3418239","DOIUrl":null,"url":null,"abstract":"Solid-state drives (SSDs) that consist of flash memory have the advantages of non-volatility, fast speed, shock resistance, low-power consumption, and small size. Two critical characteristics of flash memory are that it does not support in-place updates, and it must write data in units of a page and erase data in units of a block. Due to the two characteristics, when a block is selected as a victim block to erase, we need to copy the remaining valid pages from the victim block to another free block and the additional copy overhead is called write amplification (WA). Therefore, how to reduce the write amplification (WA) is a crucial issue for SSDs. By performing data classification, it is effective to concentrate the invalid pages in specific blocks and decrease the distribution of invalid pages in the flash memory. The advantage is that we can reduce the write amplification due to the valid pages copied. In the paper, we will propose a machine-learning-based data classifier to classify the written data. Data with similar characteristics will be eventually written in the same group of data blocks in flash memory. Through such a design, it can improve the performance of SSDs by concentrating the invalid pages in the same block and reduce the write amplification.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Machine-Learning-based Data Classifier to Reduce the Write Amplification in SSDs\",\"authors\":\"Yi-Ying Lu, Chin-Hsien Wu, Ya-Shu Chen\",\"doi\":\"10.1145/3400286.3418239\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Solid-state drives (SSDs) that consist of flash memory have the advantages of non-volatility, fast speed, shock resistance, low-power consumption, and small size. Two critical characteristics of flash memory are that it does not support in-place updates, and it must write data in units of a page and erase data in units of a block. Due to the two characteristics, when a block is selected as a victim block to erase, we need to copy the remaining valid pages from the victim block to another free block and the additional copy overhead is called write amplification (WA). Therefore, how to reduce the write amplification (WA) is a crucial issue for SSDs. By performing data classification, it is effective to concentrate the invalid pages in specific blocks and decrease the distribution of invalid pages in the flash memory. The advantage is that we can reduce the write amplification due to the valid pages copied. In the paper, we will propose a machine-learning-based data classifier to classify the written data. Data with similar characteristics will be eventually written in the same group of data blocks in flash memory. Through such a design, it can improve the performance of SSDs by concentrating the invalid pages in the same block and reduce the write amplification.\",\"PeriodicalId\":326100,\"journal\":{\"name\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference on Research in Adaptive and Convergent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3400286.3418239\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3400286.3418239","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

由闪存组成的固态硬盘(Solid-state drives, ssd)具有无易失性、速度快、耐冲击、功耗低、体积小等优点。闪存的两个关键特性是它不支持就地更新,它必须以页为单位写入数据,以块为单位擦除数据。由于这两个特征,当选择一个块作为要擦除的受害块时,我们需要将剩余的有效页面从受害块复制到另一个空闲块,额外的复制开销称为写放大(write amplification, WA)。因此,如何降低写放大(write amplification, WA)是ssd的关键问题。通过对数据进行分类,可以有效地将无效页集中到特定的块中,减少无效页在闪存中的分布。这样做的好处是,我们可以减少由于复制有效页面而导致的写放大。在本文中,我们将提出一个基于机器学习的数据分类器来对书面数据进行分类。具有相似特征的数据最终将写入闪存中的同一组数据块中。通过这样的设计,可以将无效页集中在同一个块中,从而提高ssd的性能,减少写放大。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Machine-Learning-based Data Classifier to Reduce the Write Amplification in SSDs
Solid-state drives (SSDs) that consist of flash memory have the advantages of non-volatility, fast speed, shock resistance, low-power consumption, and small size. Two critical characteristics of flash memory are that it does not support in-place updates, and it must write data in units of a page and erase data in units of a block. Due to the two characteristics, when a block is selected as a victim block to erase, we need to copy the remaining valid pages from the victim block to another free block and the additional copy overhead is called write amplification (WA). Therefore, how to reduce the write amplification (WA) is a crucial issue for SSDs. By performing data classification, it is effective to concentrate the invalid pages in specific blocks and decrease the distribution of invalid pages in the flash memory. The advantage is that we can reduce the write amplification due to the valid pages copied. In the paper, we will propose a machine-learning-based data classifier to classify the written data. Data with similar characteristics will be eventually written in the same group of data blocks in flash memory. Through such a design, it can improve the performance of SSDs by concentrating the invalid pages in the same block and reduce the write amplification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信