Cun-Quan Zhang, Yunkai Liu, Elaine M. Eschen, Keqiang Wu
{"title":"用非统计近似方法识别dna序列中的调控信号","authors":"Cun-Quan Zhang, Yunkai Liu, Elaine M. Eschen, Keqiang Wu","doi":"10.1109/CSB.2003.1227417","DOIUrl":null,"url":null,"abstract":"The identification of regulatory signals is one of the most challenging tasks in bioinformatics. The development of gene-profiling technologies now makes it possible to obtain vast data on gene expression in a particular organism under various conditions. This has created the opportunity to identify and analyze the parts of the genome believed to be responsible for transcription control-the transcription factor DNA-binding motifs (TFBMs). Developing a practical and efficient computational tool to identify TFBMs will enable us to better understand the interplay among thousands of genes in a complex eukaryotic organism. This problem, which is mathematically formulated as the motif finding problem in computer science, has been studied extensively in recent years. We develop a new mathematical model and approximation technique for motif searching. Based on the graph theoretic and geometric properties of this approach, we propose a nonstatistical approximation algorithm to find motifs in a set of genome sequences.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying regulatory signals in DNA-sequences with a nonstatistical approximation approach\",\"authors\":\"Cun-Quan Zhang, Yunkai Liu, Elaine M. Eschen, Keqiang Wu\",\"doi\":\"10.1109/CSB.2003.1227417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The identification of regulatory signals is one of the most challenging tasks in bioinformatics. The development of gene-profiling technologies now makes it possible to obtain vast data on gene expression in a particular organism under various conditions. This has created the opportunity to identify and analyze the parts of the genome believed to be responsible for transcription control-the transcription factor DNA-binding motifs (TFBMs). Developing a practical and efficient computational tool to identify TFBMs will enable us to better understand the interplay among thousands of genes in a complex eukaryotic organism. This problem, which is mathematically formulated as the motif finding problem in computer science, has been studied extensively in recent years. We develop a new mathematical model and approximation technique for motif searching. Based on the graph theoretic and geometric properties of this approach, we propose a nonstatistical approximation algorithm to find motifs in a set of genome sequences.\",\"PeriodicalId\":147883,\"journal\":{\"name\":\"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSB.2003.1227417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSB.2003.1227417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying regulatory signals in DNA-sequences with a nonstatistical approximation approach
The identification of regulatory signals is one of the most challenging tasks in bioinformatics. The development of gene-profiling technologies now makes it possible to obtain vast data on gene expression in a particular organism under various conditions. This has created the opportunity to identify and analyze the parts of the genome believed to be responsible for transcription control-the transcription factor DNA-binding motifs (TFBMs). Developing a practical and efficient computational tool to identify TFBMs will enable us to better understand the interplay among thousands of genes in a complex eukaryotic organism. This problem, which is mathematically formulated as the motif finding problem in computer science, has been studied extensively in recent years. We develop a new mathematical model and approximation technique for motif searching. Based on the graph theoretic and geometric properties of this approach, we propose a nonstatistical approximation algorithm to find motifs in a set of genome sequences.