{"title":"有界引用的数据压缩","authors":"M. Banikazemi","doi":"10.1109/DCC.2009.70","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a new compression/decompression algorithm called LZB which belongs to a class of algorithms related to Lempel-Ziv (LZ). The distinguishing characteristic of LZB is that it allows decompression from arbitrary points of compressed data. This is accomplished by setting a limit on how far back a reference in compressed data can directly or indirectly point to. We enforce this limit by using a sliding \"gate.\" During the compression, we keep track of the origin of each input symbol. The origin of a symbol is the earliest symbol in the input data that the symbol (directly or indirectly) refers to. By using this information we avoid using any reference which go beyond the gate boundary. We modified the gzip implementation of LZ77 to implement LZB. We then compared LZB with the alternative method in which data is segmented into smaller pieces and each piece is compressed separately by using the standard gzip. The results show that LZB improves the compression ratio by 10 to 50 percent for 1024 to 128 byte segment sizes.","PeriodicalId":377880,"journal":{"name":"2009 Data Compression Conference","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"LZB: Data Compression with Bounded References\",\"authors\":\"M. Banikazemi\",\"doi\":\"10.1109/DCC.2009.70\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a new compression/decompression algorithm called LZB which belongs to a class of algorithms related to Lempel-Ziv (LZ). The distinguishing characteristic of LZB is that it allows decompression from arbitrary points of compressed data. This is accomplished by setting a limit on how far back a reference in compressed data can directly or indirectly point to. We enforce this limit by using a sliding \\\"gate.\\\" During the compression, we keep track of the origin of each input symbol. The origin of a symbol is the earliest symbol in the input data that the symbol (directly or indirectly) refers to. By using this information we avoid using any reference which go beyond the gate boundary. We modified the gzip implementation of LZ77 to implement LZB. We then compared LZB with the alternative method in which data is segmented into smaller pieces and each piece is compressed separately by using the standard gzip. The results show that LZB improves the compression ratio by 10 to 50 percent for 1024 to 128 byte segment sizes.\",\"PeriodicalId\":377880,\"journal\":{\"name\":\"2009 Data Compression Conference\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Data Compression Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DCC.2009.70\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.2009.70","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this paper, we propose a new compression/decompression algorithm called LZB which belongs to a class of algorithms related to Lempel-Ziv (LZ). The distinguishing characteristic of LZB is that it allows decompression from arbitrary points of compressed data. This is accomplished by setting a limit on how far back a reference in compressed data can directly or indirectly point to. We enforce this limit by using a sliding "gate." During the compression, we keep track of the origin of each input symbol. The origin of a symbol is the earliest symbol in the input data that the symbol (directly or indirectly) refers to. By using this information we avoid using any reference which go beyond the gate boundary. We modified the gzip implementation of LZ77 to implement LZB. We then compared LZB with the alternative method in which data is segmented into smaller pieces and each piece is compressed separately by using the standard gzip. The results show that LZB improves the compression ratio by 10 to 50 percent for 1024 to 128 byte segment sizes.