{"title":"Linear and Nonlinear Regression for Combinatorial Optimization Problem of Multiple Transgenesis","authors":"D. Tominaga, Kazuki Mori, S. Aburatani","doi":"10.2197/IPSJTBIO.9.7","DOIUrl":null,"url":null,"abstract":"Combinatorial optimization problem is a difficult class of problems from which to obtain exact solutions, but such problems often arise in biotechnology, for example, it is often necessary to find optimal combinations of genes in transgenics to improve production of a useful compound by microorganisms. In the cases of 20 candidate genes for introduction into cells, the number of possible combinations of introduced genes is approximately 106. Testing all of their combinations by experimental observation is impossible practically. A few combinations are observed experimentally for large numbers of possible combinations generally. We tested two methods for the prediction of effects of transgenes: multivariate linear regression and the RBF (Radial Basis Function) network, with a simulated and an unpublished experimentally observed dataset of transgenic yeast. Results show that RBF network can detect a special gene (introduced gene) at the five percent significance level when the gene causes production values that are 1.5 times greater than other genes for the simulated dataset. Prediction by RBF network causes over-learning for larger numbers of learning data, however, it is superior than that by the linear regression model at the best condition.","PeriodicalId":38959,"journal":{"name":"IPSJ Transactions on Bioinformatics","volume":"6 1","pages":"7-11"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2197/IPSJTBIO.9.7","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSJ Transactions on Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2197/IPSJTBIO.9.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 2
Abstract
Combinatorial optimization problem is a difficult class of problems from which to obtain exact solutions, but such problems often arise in biotechnology, for example, it is often necessary to find optimal combinations of genes in transgenics to improve production of a useful compound by microorganisms. In the cases of 20 candidate genes for introduction into cells, the number of possible combinations of introduced genes is approximately 106. Testing all of their combinations by experimental observation is impossible practically. A few combinations are observed experimentally for large numbers of possible combinations generally. We tested two methods for the prediction of effects of transgenes: multivariate linear regression and the RBF (Radial Basis Function) network, with a simulated and an unpublished experimentally observed dataset of transgenic yeast. Results show that RBF network can detect a special gene (introduced gene) at the five percent significance level when the gene causes production values that are 1.5 times greater than other genes for the simulated dataset. Prediction by RBF network causes over-learning for larger numbers of learning data, however, it is superior than that by the linear regression model at the best condition.