{"title":"Genetic algorithm approach for the closest string problem","authors":"Holger Mauch, M. Melzer, John S. Hu","doi":"10.1109/CSB.2003.1227407","DOIUrl":null,"url":null,"abstract":"A fundamental aspect of post-transcriptional gene silencing (PTGS) or RNA interference (RNAi) is the requirement of sequence homology between the transgene and viral or messenger RNAs being targeted. For example, virus-resistant transgenic plants are resistant only to viruses that are closely related (i.e. high sequence homology) to the virus from which the transgene was derived. One idea for broadening this resistance is to devise an artificial sequence that incorporates the sequence variation found in a viral population. This requires an algorithm which can determine an artificial sequence with an optimal (or at least a 90-95% ) homology to all of the viral sequences in a population. The genetic algorithm (GA) presented in this paper serves this purpose. It should be of great value to all researchers who utilize PTGS or RNAi. In the context of coding theory, the task is to find the radius of a code S /spl sub/ {A, C, G, T} /sup n/. In computational biology this problem is commonly referred to as the closest string problem. Experimental results suggest that this NP-complete optimization problem can be approached well with a custom-built GA.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSB.2003.1227407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25
Abstract
A fundamental aspect of post-transcriptional gene silencing (PTGS) or RNA interference (RNAi) is the requirement of sequence homology between the transgene and viral or messenger RNAs being targeted. For example, virus-resistant transgenic plants are resistant only to viruses that are closely related (i.e. high sequence homology) to the virus from which the transgene was derived. One idea for broadening this resistance is to devise an artificial sequence that incorporates the sequence variation found in a viral population. This requires an algorithm which can determine an artificial sequence with an optimal (or at least a 90-95% ) homology to all of the viral sequences in a population. The genetic algorithm (GA) presented in this paper serves this purpose. It should be of great value to all researchers who utilize PTGS or RNAi. In the context of coding theory, the task is to find the radius of a code S /spl sub/ {A, C, G, T} /sup n/. In computational biology this problem is commonly referred to as the closest string problem. Experimental results suggest that this NP-complete optimization problem can be approached well with a custom-built GA.