{"title":"Investigating Evolutionary Relationships between Species through the Light of Graph Theory Based on the Multiplet Structure of the Genetic Code","authors":"Antara Sengupta, J. Das, P. Choudhury","doi":"10.1109/IACC.2017.0175","DOIUrl":null,"url":null,"abstract":"Investigating evolutionary relationship between various species through similarity/dissimilarity analysis is a fundamental method. In this present work firstly 20 canonical amino acids and 3 stop codons (terminations) are classified into five different classes depending upon their frequency mapping with 64 codons of genetic table. Secondly, each DNA sequence is represented by a weighted directed multi graph based on that classification. Thirdly, the procedure has been implemented to find out the evolutionary relationship between various species of alpha globin and beta globin genes. Here a new mathematical tool has been constructed to derive similarity/dissimilarity matrix, to get suitable phylogenetic trees for each data set. It is completely alignment free approach and hence the time complexity is directly proportional to the sequence length, that is O(N). Moreover the classification rule will decrease the complexity of graph constructions.","PeriodicalId":248433,"journal":{"name":"2017 IEEE 7th International Advance Computing Conference (IACC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 7th International Advance Computing Conference (IACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IACC.2017.0175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
Investigating evolutionary relationship between various species through similarity/dissimilarity analysis is a fundamental method. In this present work firstly 20 canonical amino acids and 3 stop codons (terminations) are classified into five different classes depending upon their frequency mapping with 64 codons of genetic table. Secondly, each DNA sequence is represented by a weighted directed multi graph based on that classification. Thirdly, the procedure has been implemented to find out the evolutionary relationship between various species of alpha globin and beta globin genes. Here a new mathematical tool has been constructed to derive similarity/dissimilarity matrix, to get suitable phylogenetic trees for each data set. It is completely alignment free approach and hence the time complexity is directly proportional to the sequence length, that is O(N). Moreover the classification rule will decrease the complexity of graph constructions.