{"title":"Accelerated implementation of FQSqueezer novel genomic compression method","authors":"Monica Amich, P. D. Luca, S. Fiscale","doi":"10.1109/ISPDC51135.2020.00030","DOIUrl":null,"url":null,"abstract":"Biological data contain very important information for genoma analysis. In last decades, the size of these data is constantly growing. So the Next Generation Sequence (NGS) data has been introduced. These kind of data are represented by different data formats, such as FASTQ, FASTA, SAM, etc. In order to allow a good analysis and storing of them, due to large dimension of these data, several compressors have been performed. FQSqueezer is a novel genomic compressor for FASTQ data files. But several issues are present due to multithread version that runs on multi-core hardware. It is wellknown that the number of cores in a CPU is limited and very minor with respect to GPUs’ cores number. In order to increase the performance related to this compressor method, in this work we present a GPU-parallel implementation of cited compressor by exploiting CUDA framework. More precisely, a suitable domain decomposition is able to give an appreciable gain of performance in terms of time and reliability. Several execution tests confirm the gain of efficiency achieved by our parallel implementation.","PeriodicalId":426824,"journal":{"name":"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDC51135.2020.00030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Biological data contain very important information for genoma analysis. In last decades, the size of these data is constantly growing. So the Next Generation Sequence (NGS) data has been introduced. These kind of data are represented by different data formats, such as FASTQ, FASTA, SAM, etc. In order to allow a good analysis and storing of them, due to large dimension of these data, several compressors have been performed. FQSqueezer is a novel genomic compressor for FASTQ data files. But several issues are present due to multithread version that runs on multi-core hardware. It is wellknown that the number of cores in a CPU is limited and very minor with respect to GPUs’ cores number. In order to increase the performance related to this compressor method, in this work we present a GPU-parallel implementation of cited compressor by exploiting CUDA framework. More precisely, a suitable domain decomposition is able to give an appreciable gain of performance in terms of time and reliability. Several execution tests confirm the gain of efficiency achieved by our parallel implementation.