{"title":"基于多项式的DNA序列压缩与搜索表示","authors":"W. A. Khan, Aftab Khan","doi":"10.1109/PuneCon50868.2020.9362362","DOIUrl":null,"url":null,"abstract":"Data compression plays an important role in lesser expensive resource consumption. It is also cumbersome to send a huge amount of data through the network. The data compression techniques help in optimizing physical storage devices as well as it makes easy to transfer the compressed data or file faster through internet or network. Deoxyribonucleic acid or DNA is biological traits presents in human and almost all other living organisms. The DNA contains the genetic information of the living organisms. In this research, various compression algorithms for DNA sequences are studied and compared. The scheme proposed here for compression is the use of a burrows wheeler compression technique. Our proposed algorithm is the modification of the Burrows Wheeler Compression Algorithm (BWCA). The lexicographical sorting of the matrix is replaced by the polynomial base representation of the bases of DNA. This method reduces the time for compression positively. Validation was performed on various datasets that concur the efficiency of the proposed scheme.","PeriodicalId":368862,"journal":{"name":"2020 IEEE Pune Section International Conference (PuneCon)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Polynomial Based Representation for DNA Sequence Compression and Search\",\"authors\":\"W. A. Khan, Aftab Khan\",\"doi\":\"10.1109/PuneCon50868.2020.9362362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data compression plays an important role in lesser expensive resource consumption. It is also cumbersome to send a huge amount of data through the network. The data compression techniques help in optimizing physical storage devices as well as it makes easy to transfer the compressed data or file faster through internet or network. Deoxyribonucleic acid or DNA is biological traits presents in human and almost all other living organisms. The DNA contains the genetic information of the living organisms. In this research, various compression algorithms for DNA sequences are studied and compared. The scheme proposed here for compression is the use of a burrows wheeler compression technique. Our proposed algorithm is the modification of the Burrows Wheeler Compression Algorithm (BWCA). The lexicographical sorting of the matrix is replaced by the polynomial base representation of the bases of DNA. This method reduces the time for compression positively. Validation was performed on various datasets that concur the efficiency of the proposed scheme.\",\"PeriodicalId\":368862,\"journal\":{\"name\":\"2020 IEEE Pune Section International Conference (PuneCon)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Pune Section International Conference (PuneCon)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PuneCon50868.2020.9362362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Pune Section International Conference (PuneCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PuneCon50868.2020.9362362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Polynomial Based Representation for DNA Sequence Compression and Search
Data compression plays an important role in lesser expensive resource consumption. It is also cumbersome to send a huge amount of data through the network. The data compression techniques help in optimizing physical storage devices as well as it makes easy to transfer the compressed data or file faster through internet or network. Deoxyribonucleic acid or DNA is biological traits presents in human and almost all other living organisms. The DNA contains the genetic information of the living organisms. In this research, various compression algorithms for DNA sequences are studied and compared. The scheme proposed here for compression is the use of a burrows wheeler compression technique. Our proposed algorithm is the modification of the Burrows Wheeler Compression Algorithm (BWCA). The lexicographical sorting of the matrix is replaced by the polynomial base representation of the bases of DNA. This method reduces the time for compression positively. Validation was performed on various datasets that concur the efficiency of the proposed scheme.