Compression Performance Analysis of Different File Formats

Han Yang, Guangjun Qin, Yongqing Hu
{"title":"Compression Performance Analysis of Different File Formats","authors":"Han Yang, Guangjun Qin, Yongqing Hu","doi":"arxiv-2308.12275","DOIUrl":null,"url":null,"abstract":"In data storage and transmission, file compression is a common technique for\nreducing the volume of data, reducing data storage space and transmission time\nand bandwidth. However, there are significant differences in the compression\nperformance of different types of file formats, and the benefits vary. In this\npaper, 22 file formats with approximately 178GB of data were collected and the\nZlib algorithm was used for compression experiments to compare performance in\norder to investigate the compression gains of different file types. The\nexperimental results show that some file types are poorly compressed, with\nalmost constant file size and long compression time, resulting in lower gains;\nsome other file types are significantly reduced in file size and compression\ntime after compression, which can effectively reduce the data volume. Based on\nthe above experimental results, this paper will then selectively reduce the\ndata volume by compression in data storage and transmission for the file types\nin order to obtain the maximum compression yield.","PeriodicalId":501310,"journal":{"name":"arXiv - CS - Other Computer Science","volume":"9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Other Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2308.12275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In data storage and transmission, file compression is a common technique for reducing the volume of data, reducing data storage space and transmission time and bandwidth. However, there are significant differences in the compression performance of different types of file formats, and the benefits vary. In this paper, 22 file formats with approximately 178GB of data were collected and the Zlib algorithm was used for compression experiments to compare performance in order to investigate the compression gains of different file types. The experimental results show that some file types are poorly compressed, with almost constant file size and long compression time, resulting in lower gains; some other file types are significantly reduced in file size and compression time after compression, which can effectively reduce the data volume. Based on the above experimental results, this paper will then selectively reduce the data volume by compression in data storage and transmission for the file types in order to obtain the maximum compression yield.
不同文件格式的压缩性能分析
在数据存储和传输中,文件压缩是减少数据量、减少数据存储空间、减少传输时间和带宽的常用技术。但是,不同类型的文件格式在压缩性能上存在显著差异,其好处也各不相同。本文收集了22种文件格式约178GB的数据,并使用zlib算法进行压缩实验,比较性能,以研究不同文件类型的压缩收益。实验结果表明,有些文件类型压缩效果较差,文件大小几乎不变,压缩时间长,导致增益较低;有些文件类型压缩后文件大小和压缩时间明显减小,可以有效地减少数据量。在上述实验结果的基础上,本文将针对文件类型在数据存储和传输中选择性地压缩数据量,以获得最大的压缩产量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信