{"title":"基于遗传算法的多序列比对分解","authors":"F. Naznin, R. Sarker, D. Essam","doi":"10.1109/CIBCB.2010.5510595","DOIUrl":null,"url":null,"abstract":"Multiple sequence alignment is one of the most important issues in molecular biology as it plays an important role such as in life saving drug design. In this paper, we divide given sequences into two or more subsequences and then combine them together in order to find better multiple sequence alignments by applying a new GA based approach to the combined sequences. We also introduce new ways of generating an initial population and of applying the genetic operators. We have carried out experiments for the BAliBASE benchmark database using the sum of pair objective function with the PAM250 score matrix. To evaluate our proposed approach, we have compared with well known methods such as T-Coffee, MUSCLE, MAFFT and ProbCons. The experimental results show that better multiple sequence alignments may be obtained with higher number of divisions, however the computation time increases with the number of decompositions. The overall performance of the proposed Decomposition with GA (DGA) method is better than the existing methods and the GA method (without decompositions).","PeriodicalId":340637,"journal":{"name":"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"DGA: Decomposition with genetic algorithm for multiple sequence alignment\",\"authors\":\"F. Naznin, R. Sarker, D. Essam\",\"doi\":\"10.1109/CIBCB.2010.5510595\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multiple sequence alignment is one of the most important issues in molecular biology as it plays an important role such as in life saving drug design. In this paper, we divide given sequences into two or more subsequences and then combine them together in order to find better multiple sequence alignments by applying a new GA based approach to the combined sequences. We also introduce new ways of generating an initial population and of applying the genetic operators. We have carried out experiments for the BAliBASE benchmark database using the sum of pair objective function with the PAM250 score matrix. To evaluate our proposed approach, we have compared with well known methods such as T-Coffee, MUSCLE, MAFFT and ProbCons. The experimental results show that better multiple sequence alignments may be obtained with higher number of divisions, however the computation time increases with the number of decompositions. The overall performance of the proposed Decomposition with GA (DGA) method is better than the existing methods and the GA method (without decompositions).\",\"PeriodicalId\":340637,\"journal\":{\"name\":\"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2010.5510595\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2010.5510595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
DGA: Decomposition with genetic algorithm for multiple sequence alignment
Multiple sequence alignment is one of the most important issues in molecular biology as it plays an important role such as in life saving drug design. In this paper, we divide given sequences into two or more subsequences and then combine them together in order to find better multiple sequence alignments by applying a new GA based approach to the combined sequences. We also introduce new ways of generating an initial population and of applying the genetic operators. We have carried out experiments for the BAliBASE benchmark database using the sum of pair objective function with the PAM250 score matrix. To evaluate our proposed approach, we have compared with well known methods such as T-Coffee, MUSCLE, MAFFT and ProbCons. The experimental results show that better multiple sequence alignments may be obtained with higher number of divisions, however the computation time increases with the number of decompositions. The overall performance of the proposed Decomposition with GA (DGA) method is better than the existing methods and the GA method (without decompositions).