{"title":"基于遗传规划的上位性符号建模的掩模函数。","authors":"Ryan J Urbanowicz, Bill C White, Jason H Moore","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>The study of common, complex multifactorial diseases in genetic epidemiology is complicated by nonlinearity in the genotype-to-phenotype mapping relationship that is due, in part, to epistasis or gene-gene interactions. Symobolic discriminant analysis (SDA) is a flexible modeling approach which uses genetic programming (GP) to evolve an optimal predictive model using a predefined collection of mathematical functions, constants, and attributes. This has been shown to be an effective strategy for modeling epistasis. In the present study, we introduce the genetic \"mask\" as a novel building block which exploits expert knowledge in the form of a pre-constructed relationship between two attributes. The goal of this study was to determine whether the availability of \"mask\" building blocks improves SDA performance. The results of this study support the idea that pre-processing data improves GP performance.</p>","PeriodicalId":88876,"journal":{"name":"Genetic and Evolutionary Computation Conference : [proceedings]. Genetic and Evolutionary Computation Conference","volume":"2008 ","pages":"339-346"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3457012/pdf/nihms107977.pdf","citationCount":"0","resultStr":"{\"title\":\"Mask Functions for the Symbolic Modeling of Epistasis Using Genetic Programming.\",\"authors\":\"Ryan J Urbanowicz, Bill C White, Jason H Moore\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The study of common, complex multifactorial diseases in genetic epidemiology is complicated by nonlinearity in the genotype-to-phenotype mapping relationship that is due, in part, to epistasis or gene-gene interactions. Symobolic discriminant analysis (SDA) is a flexible modeling approach which uses genetic programming (GP) to evolve an optimal predictive model using a predefined collection of mathematical functions, constants, and attributes. This has been shown to be an effective strategy for modeling epistasis. In the present study, we introduce the genetic \\\"mask\\\" as a novel building block which exploits expert knowledge in the form of a pre-constructed relationship between two attributes. The goal of this study was to determine whether the availability of \\\"mask\\\" building blocks improves SDA performance. The results of this study support the idea that pre-processing data improves GP performance.</p>\",\"PeriodicalId\":88876,\"journal\":{\"name\":\"Genetic and Evolutionary Computation Conference : [proceedings]. Genetic and Evolutionary Computation Conference\",\"volume\":\"2008 \",\"pages\":\"339-346\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3457012/pdf/nihms107977.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetic and Evolutionary Computation Conference : [proceedings]. Genetic and Evolutionary Computation Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic and Evolutionary Computation Conference : [proceedings]. Genetic and Evolutionary Computation Conference","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mask Functions for the Symbolic Modeling of Epistasis Using Genetic Programming.
The study of common, complex multifactorial diseases in genetic epidemiology is complicated by nonlinearity in the genotype-to-phenotype mapping relationship that is due, in part, to epistasis or gene-gene interactions. Symobolic discriminant analysis (SDA) is a flexible modeling approach which uses genetic programming (GP) to evolve an optimal predictive model using a predefined collection of mathematical functions, constants, and attributes. This has been shown to be an effective strategy for modeling epistasis. In the present study, we introduce the genetic "mask" as a novel building block which exploits expert knowledge in the form of a pre-constructed relationship between two attributes. The goal of this study was to determine whether the availability of "mask" building blocks improves SDA performance. The results of this study support the idea that pre-processing data improves GP performance.