Aaron Russell Fajardo, Saira Kaye Manalili, Candace Claire Mercado, R. Zapanta, R. Uy
{"title":"Multiprocess Implementation of DNA Pre-alignment Filtering using the Bit Matrix Algorithm","authors":"Aaron Russell Fajardo, Saira Kaye Manalili, Candace Claire Mercado, R. Zapanta, R. Uy","doi":"10.1109/HNICEM51456.2020.9400049","DOIUrl":null,"url":null,"abstract":"With the emergence of more advance technologies in genomic research, it has becomes easier to answer biological and genomic questions. To be able to analyze advance genomic data, however, the process has to start with the determination of sequence variations between two genomic sequences. This process is called the read mapping process. This involves mapping reads to a similar reference genome which can be computationally intensive. Because of this, most works adapt the filter-verification paradigm. More recently, there are works that focus on improving the filtering phase by adding another layer to it called the pre-alignment filter. This manuscript focuses on the bit matrix pre-alignment filter with a goal of improving the implementation by using multiprocessing techniques that exploit multi-core CPUs. The results of this manuscript show that there is an improvement in using the parallel version over the sequential version by an average factor of 4.2 without sacrificing accuracy.","PeriodicalId":230810,"journal":{"name":"2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM51456.2020.9400049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the emergence of more advance technologies in genomic research, it has becomes easier to answer biological and genomic questions. To be able to analyze advance genomic data, however, the process has to start with the determination of sequence variations between two genomic sequences. This process is called the read mapping process. This involves mapping reads to a similar reference genome which can be computationally intensive. Because of this, most works adapt the filter-verification paradigm. More recently, there are works that focus on improving the filtering phase by adding another layer to it called the pre-alignment filter. This manuscript focuses on the bit matrix pre-alignment filter with a goal of improving the implementation by using multiprocessing techniques that exploit multi-core CPUs. The results of this manuscript show that there is an improvement in using the parallel version over the sequential version by an average factor of 4.2 without sacrificing accuracy.