{"title":"The small genomics lab experience optimizing data cold storage.","authors":"Elisha D O Roberson","doi":"10.1101/2025.03.18.643355","DOIUrl":null,"url":null,"abstract":"<p><p>Translational research is often a collaborative enterprise that involves basic science researchers, clinicians, and experts in genomics and bioinformatics. While there are central university and industry cores to support data generation, long-term storage often falls to the individual investigators. We frequently fulfill the role of long-term FASTQ file storage for our collaborators. To reduce our cold storage space, we tested the space savings for gzip and zstandard algorithms on an old set of FASTQ files. We found that zstandard had a better overall compression ratio than the best gzip algorithm, amounting to more than 20% space savings overall compared to gzip. It may be worth transitioning to zstandard compression for small, collaborative genomics labs to minimize cold storage costs.</p>","PeriodicalId":519960,"journal":{"name":"bioRxiv : the preprint server for biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11956953/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.03.18.643355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Translational research is often a collaborative enterprise that involves basic science researchers, clinicians, and experts in genomics and bioinformatics. While there are central university and industry cores to support data generation, long-term storage often falls to the individual investigators. We frequently fulfill the role of long-term FASTQ file storage for our collaborators. To reduce our cold storage space, we tested the space savings for gzip and zstandard algorithms on an old set of FASTQ files. We found that zstandard had a better overall compression ratio than the best gzip algorithm, amounting to more than 20% space savings overall compared to gzip. It may be worth transitioning to zstandard compression for small, collaborative genomics labs to minimize cold storage costs.