Wei-ling Chang, Xiao-chun Yun, Binxing Fang, Shupeng Wang
{"title":"块LZSS压缩算法","authors":"Wei-ling Chang, Xiao-chun Yun, Binxing Fang, Shupeng Wang","doi":"10.1109/DCC.2009.9","DOIUrl":null,"url":null,"abstract":"The mainstream compression algorithms, such as LZ, Huffman, PPM etc., have been extensively studied in recent years. However, rather less attention has been paid to the block algorithm of those algorithms. The aim of this study was therefore to investigate the block LZSS. We studied the relationship between the compression ratio of block LZSS and the value of index or length. We found that the bit of length has little effect on the compression performance of block LZSS, and the bit of index has a significant effect on the compression ratio. Results of the experiment show that to obtain better efficiency from block LZSS, a moderate sized block which is greater than 32KiB, may be optimal, and the optimal block size does not depend on file types. We also investigated factors which affect the optimal block size. We use the mean block standard deviation (MBS) and locality of reference to measure the compression ratio. we found that good data locality implies a large skew in the data distribution, and the greater data distribution skew or the MBS, the better the compression ratio.","PeriodicalId":377880,"journal":{"name":"2009 Data Compression Conference","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"The Block LZSS Compression Algorithm\",\"authors\":\"Wei-ling Chang, Xiao-chun Yun, Binxing Fang, Shupeng Wang\",\"doi\":\"10.1109/DCC.2009.9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The mainstream compression algorithms, such as LZ, Huffman, PPM etc., have been extensively studied in recent years. However, rather less attention has been paid to the block algorithm of those algorithms. The aim of this study was therefore to investigate the block LZSS. We studied the relationship between the compression ratio of block LZSS and the value of index or length. We found that the bit of length has little effect on the compression performance of block LZSS, and the bit of index has a significant effect on the compression ratio. Results of the experiment show that to obtain better efficiency from block LZSS, a moderate sized block which is greater than 32KiB, may be optimal, and the optimal block size does not depend on file types. We also investigated factors which affect the optimal block size. We use the mean block standard deviation (MBS) and locality of reference to measure the compression ratio. we found that good data locality implies a large skew in the data distribution, and the greater data distribution skew or the MBS, the better the compression ratio.\",\"PeriodicalId\":377880,\"journal\":{\"name\":\"2009 Data Compression Conference\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Data Compression Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DCC.2009.9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.2009.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The mainstream compression algorithms, such as LZ, Huffman, PPM etc., have been extensively studied in recent years. However, rather less attention has been paid to the block algorithm of those algorithms. The aim of this study was therefore to investigate the block LZSS. We studied the relationship between the compression ratio of block LZSS and the value of index or length. We found that the bit of length has little effect on the compression performance of block LZSS, and the bit of index has a significant effect on the compression ratio. Results of the experiment show that to obtain better efficiency from block LZSS, a moderate sized block which is greater than 32KiB, may be optimal, and the optimal block size does not depend on file types. We also investigated factors which affect the optimal block size. We use the mean block standard deviation (MBS) and locality of reference to measure the compression ratio. we found that good data locality implies a large skew in the data distribution, and the greater data distribution skew or the MBS, the better the compression ratio.