{"title":"Robust Algorithm for Finding Weak Motifs","authors":"X. Yang, Jagath Rajapakse","doi":"10.1109/CIBCB.2005.1594948","DOIUrl":null,"url":null,"abstract":"The Challenge Problem posed by Pevzner et al. showed that special algorithms are needed to detect weak motifs in bio-sequences, where the classical approaches, such as MEME and Gibbs Sampler, fail. Though several algorithms have since been developed to solve the weak motif recognition problem, their focus has been on exact datasets and their performances show poor tolerance to the noisy datasets, i.e., for datasets bearing sequences without any motif instances. We propose a novel approach to find weak motifs that is robust to noise in the datasets. The experiments with synthetic datasets show that our algorithm has less running time and higher accuracy in detecting weak motifs over the existing approaches and is more robust to the presense of noise. The application of the algorithm on some promoter datasets from yeast genomes found previously-proven binding sites.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The Challenge Problem posed by Pevzner et al. showed that special algorithms are needed to detect weak motifs in bio-sequences, where the classical approaches, such as MEME and Gibbs Sampler, fail. Though several algorithms have since been developed to solve the weak motif recognition problem, their focus has been on exact datasets and their performances show poor tolerance to the noisy datasets, i.e., for datasets bearing sequences without any motif instances. We propose a novel approach to find weak motifs that is robust to noise in the datasets. The experiments with synthetic datasets show that our algorithm has less running time and higher accuracy in detecting weak motifs over the existing approaches and is more robust to the presense of noise. The application of the algorithm on some promoter datasets from yeast genomes found previously-proven binding sites.