{"title":"应用于基因设计的进化算法估计","authors":"A. Eremeev, A. Spirov","doi":"10.1109/CSGB.2018.8544837","DOIUrl":null,"url":null,"abstract":"The field of evolutionary algorithms (EAs) emerged in the area of computer science due to transfer of ideas from biology and developed independently for several decades, enriched with techniques from probability theory, complexity theory and optimization methods. Our aim is to consider how some recent results form EA theory may be transferred back into biology. It has been noted that the EAs optimizing Royal Road fitness functions may be considered as models of evolutionary search for the gene promoter sequences from scratch. Here we consider the design of synthetic promoters from the EAs methodology viewpoint. This problem asks for a tight cluster of supposedly unknown motifs from the initial random (or partially random) set of DNA sequences using SELEX approaches. We apply the upper bounds on the expected hitting time of a target area of genotypic space, the EA runtime, in order to upper-bound the expected time to finding a sufficiently fit series of motifs (e.g. binding sites for transcription factors) in a SELEX procedure. On the other hand, using the EA theory we propose an upper bound on expected proportion of the DNA sequences with sufficiently high fitness at a given iteration of a SELEX procedure. Both approaches are evaluated in computational experiment, using a Royal Road fitness function as a model of the SELEX procedure for regulatory FIS factor binding site. Our results suggest that some theoretically provable bounds for EA performance may be used, at least in principle, for a-priory estimation of efficiency of SELEX-based approaches.","PeriodicalId":230439,"journal":{"name":"2018 11th International Multiconference Bioinformatics of Genome Regulation and Structure\\Systems Biology (BGRS\\SB)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Estimates from Evolutionary Algorithms Theory Applied to Gene Design\",\"authors\":\"A. Eremeev, A. Spirov\",\"doi\":\"10.1109/CSGB.2018.8544837\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The field of evolutionary algorithms (EAs) emerged in the area of computer science due to transfer of ideas from biology and developed independently for several decades, enriched with techniques from probability theory, complexity theory and optimization methods. Our aim is to consider how some recent results form EA theory may be transferred back into biology. It has been noted that the EAs optimizing Royal Road fitness functions may be considered as models of evolutionary search for the gene promoter sequences from scratch. Here we consider the design of synthetic promoters from the EAs methodology viewpoint. This problem asks for a tight cluster of supposedly unknown motifs from the initial random (or partially random) set of DNA sequences using SELEX approaches. We apply the upper bounds on the expected hitting time of a target area of genotypic space, the EA runtime, in order to upper-bound the expected time to finding a sufficiently fit series of motifs (e.g. binding sites for transcription factors) in a SELEX procedure. On the other hand, using the EA theory we propose an upper bound on expected proportion of the DNA sequences with sufficiently high fitness at a given iteration of a SELEX procedure. Both approaches are evaluated in computational experiment, using a Royal Road fitness function as a model of the SELEX procedure for regulatory FIS factor binding site. Our results suggest that some theoretically provable bounds for EA performance may be used, at least in principle, for a-priory estimation of efficiency of SELEX-based approaches.\",\"PeriodicalId\":230439,\"journal\":{\"name\":\"2018 11th International Multiconference Bioinformatics of Genome Regulation and Structure\\\\Systems Biology (BGRS\\\\SB)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 11th International Multiconference Bioinformatics of Genome Regulation and Structure\\\\Systems Biology (BGRS\\\\SB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSGB.2018.8544837\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Multiconference Bioinformatics of Genome Regulation and Structure\\Systems Biology (BGRS\\SB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSGB.2018.8544837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Estimates from Evolutionary Algorithms Theory Applied to Gene Design
The field of evolutionary algorithms (EAs) emerged in the area of computer science due to transfer of ideas from biology and developed independently for several decades, enriched with techniques from probability theory, complexity theory and optimization methods. Our aim is to consider how some recent results form EA theory may be transferred back into biology. It has been noted that the EAs optimizing Royal Road fitness functions may be considered as models of evolutionary search for the gene promoter sequences from scratch. Here we consider the design of synthetic promoters from the EAs methodology viewpoint. This problem asks for a tight cluster of supposedly unknown motifs from the initial random (or partially random) set of DNA sequences using SELEX approaches. We apply the upper bounds on the expected hitting time of a target area of genotypic space, the EA runtime, in order to upper-bound the expected time to finding a sufficiently fit series of motifs (e.g. binding sites for transcription factors) in a SELEX procedure. On the other hand, using the EA theory we propose an upper bound on expected proportion of the DNA sequences with sufficiently high fitness at a given iteration of a SELEX procedure. Both approaches are evaluated in computational experiment, using a Royal Road fitness function as a model of the SELEX procedure for regulatory FIS factor binding site. Our results suggest that some theoretically provable bounds for EA performance may be used, at least in principle, for a-priory estimation of efficiency of SELEX-based approaches.