Jan Carlo G. Maghirang, Roger Luis Uy, Kaizen Vinz A. Borja, Joven L. Pernez
{"title":"QCKer-FPGA:用于DNA序列比对的Q克计数滤波器的FPGA实现","authors":"Jan Carlo G. Maghirang, Roger Luis Uy, Kaizen Vinz A. Borja, Joven L. Pernez","doi":"10.1109/HNICEM48295.2019.9072768","DOIUrl":null,"url":null,"abstract":"Read mapping is a process in which DNA reads are mapped to a reference genome through filtering and verification using a predefined metric. Filtering is done by quickly eliminating incorrect regions when a DNA read is compared to the reference genome. Verification on the other hand is responsible for verifying these candidate regions which require mathematical and theoretical approaches. Due to large amounts of data produced by Next Generation Sequencing (NGS) platforms, a filter is needed to reduce various computational challenges introduced by the verification process. FPGAs are special purpose processors that are designed to handle compute-intensive applications, having a highly customizable fabric. In this paper, the q-gram counting filter is implemented that takes advantage of the flexibility and capabilities of FPGAs in parallel applications using the ZedBoard development board. The paper discusses the results of the filter with varying sizes of q, number of reads with various lengths, and different reference sequences. The results show an average of 34.02% lesser clock cycles with a q-gram length of 4 and 53.58% for q-gram of 8 when compared to an implementation in C.","PeriodicalId":6733,"journal":{"name":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","volume":"27 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"QCKer-FPGA: An FPGA Implementation of Q- gram Counting Filter for DNA Sequence Alignment\",\"authors\":\"Jan Carlo G. Maghirang, Roger Luis Uy, Kaizen Vinz A. Borja, Joven L. Pernez\",\"doi\":\"10.1109/HNICEM48295.2019.9072768\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Read mapping is a process in which DNA reads are mapped to a reference genome through filtering and verification using a predefined metric. Filtering is done by quickly eliminating incorrect regions when a DNA read is compared to the reference genome. Verification on the other hand is responsible for verifying these candidate regions which require mathematical and theoretical approaches. Due to large amounts of data produced by Next Generation Sequencing (NGS) platforms, a filter is needed to reduce various computational challenges introduced by the verification process. FPGAs are special purpose processors that are designed to handle compute-intensive applications, having a highly customizable fabric. In this paper, the q-gram counting filter is implemented that takes advantage of the flexibility and capabilities of FPGAs in parallel applications using the ZedBoard development board. The paper discusses the results of the filter with varying sizes of q, number of reads with various lengths, and different reference sequences. The results show an average of 34.02% lesser clock cycles with a q-gram length of 4 and 53.58% for q-gram of 8 when compared to an implementation in C.\",\"PeriodicalId\":6733,\"journal\":{\"name\":\"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )\",\"volume\":\"27 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HNICEM48295.2019.9072768\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM48295.2019.9072768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
QCKer-FPGA: An FPGA Implementation of Q- gram Counting Filter for DNA Sequence Alignment
Read mapping is a process in which DNA reads are mapped to a reference genome through filtering and verification using a predefined metric. Filtering is done by quickly eliminating incorrect regions when a DNA read is compared to the reference genome. Verification on the other hand is responsible for verifying these candidate regions which require mathematical and theoretical approaches. Due to large amounts of data produced by Next Generation Sequencing (NGS) platforms, a filter is needed to reduce various computational challenges introduced by the verification process. FPGAs are special purpose processors that are designed to handle compute-intensive applications, having a highly customizable fabric. In this paper, the q-gram counting filter is implemented that takes advantage of the flexibility and capabilities of FPGAs in parallel applications using the ZedBoard development board. The paper discusses the results of the filter with varying sizes of q, number of reads with various lengths, and different reference sequences. The results show an average of 34.02% lesser clock cycles with a q-gram length of 4 and 53.58% for q-gram of 8 when compared to an implementation in C.