{"title":"基于参数化匹配和Q-gram的生物序列相似性检测","authors":"Rama Singh, D. Rai, R. Prasad, Rajeev Singh","doi":"10.1109/RAETCS.2018.8443925","DOIUrl":null,"url":null,"abstract":"Whenever characterization of a new DNA sequence takes place then, database search is carried out to find whether homolog’s of gene is present or not. Various evolutions in this field have marked the shift from exact matching to a completely different concept, parameterized matching. Parameterized matching is detected by consistent renaming of text and pattern using bijective mapping. While finding matches between pattern and text, the PBMH-Hash algorithm results in frequent occurrence of false matches with large number of character comparison. This paper presents a new algorithm to detect similarity in biological sequences. The proposed algorithm is based on the concept of Berry-Ravindran algorithm and q-Gram. Analysis shows that our algorithm outperforms existing PBMH-Hash algorithm.","PeriodicalId":131311,"journal":{"name":"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Similarity Detection in Biological Sequences using Parameterized Matching and Q-gram\",\"authors\":\"Rama Singh, D. Rai, R. Prasad, Rajeev Singh\",\"doi\":\"10.1109/RAETCS.2018.8443925\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Whenever characterization of a new DNA sequence takes place then, database search is carried out to find whether homolog’s of gene is present or not. Various evolutions in this field have marked the shift from exact matching to a completely different concept, parameterized matching. Parameterized matching is detected by consistent renaming of text and pattern using bijective mapping. While finding matches between pattern and text, the PBMH-Hash algorithm results in frequent occurrence of false matches with large number of character comparison. This paper presents a new algorithm to detect similarity in biological sequences. The proposed algorithm is based on the concept of Berry-Ravindran algorithm and q-Gram. Analysis shows that our algorithm outperforms existing PBMH-Hash algorithm.\",\"PeriodicalId\":131311,\"journal\":{\"name\":\"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAETCS.2018.8443925\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAETCS.2018.8443925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Similarity Detection in Biological Sequences using Parameterized Matching and Q-gram
Whenever characterization of a new DNA sequence takes place then, database search is carried out to find whether homolog’s of gene is present or not. Various evolutions in this field have marked the shift from exact matching to a completely different concept, parameterized matching. Parameterized matching is detected by consistent renaming of text and pattern using bijective mapping. While finding matches between pattern and text, the PBMH-Hash algorithm results in frequent occurrence of false matches with large number of character comparison. This paper presents a new algorithm to detect similarity in biological sequences. The proposed algorithm is based on the concept of Berry-Ravindran algorithm and q-Gram. Analysis shows that our algorithm outperforms existing PBMH-Hash algorithm.