{"title":"Similarity Detection in Biological Sequences using Parameterized Matching and Q-gram","authors":"Rama Singh, D. Rai, R. Prasad, Rajeev Singh","doi":"10.1109/RAETCS.2018.8443925","DOIUrl":null,"url":null,"abstract":"Whenever characterization of a new DNA sequence takes place then, database search is carried out to find whether homolog’s of gene is present or not. Various evolutions in this field have marked the shift from exact matching to a completely different concept, parameterized matching. Parameterized matching is detected by consistent renaming of text and pattern using bijective mapping. While finding matches between pattern and text, the PBMH-Hash algorithm results in frequent occurrence of false matches with large number of character comparison. This paper presents a new algorithm to detect similarity in biological sequences. The proposed algorithm is based on the concept of Berry-Ravindran algorithm and q-Gram. Analysis shows that our algorithm outperforms existing PBMH-Hash algorithm.","PeriodicalId":131311,"journal":{"name":"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAETCS.2018.8443925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Whenever characterization of a new DNA sequence takes place then, database search is carried out to find whether homolog’s of gene is present or not. Various evolutions in this field have marked the shift from exact matching to a completely different concept, parameterized matching. Parameterized matching is detected by consistent renaming of text and pattern using bijective mapping. While finding matches between pattern and text, the PBMH-Hash algorithm results in frequent occurrence of false matches with large number of character comparison. This paper presents a new algorithm to detect similarity in biological sequences. The proposed algorithm is based on the concept of Berry-Ravindran algorithm and q-Gram. Analysis shows that our algorithm outperforms existing PBMH-Hash algorithm.