{"title":"适应性分子进化","authors":"Ziheng Yang","doi":"10.1002/0470022620.BBC10","DOIUrl":null,"url":null,"abstract":"This chapter reviews statistical methods for detecting adaptive molecular evolution by comparing synonymous and nonsynonymous substitution rates in protein-coding DNA sequences. A Markov process model of codon substitution, which forms the basis for all later discussions in this chapter, is introduced first. The case of comparing two sequences to estimate the numbers of synonymous (dS) and nonsynonymous (dN ) substitutions per site is then considered. The maximum likelihood (ML) method and a number of ad hoc counting methods are evaluated. The rest of the chapter deals with joint analyses of multiple sequences on a phylogeny. Review is done on Markov models of codon substitution that allow the nonsynonymous/synonymous rate ratio (ω = dN /dS) to vary among branches in a phylogeny or among amino acid sites in a protein. Those models can be used to construct likelihood ratio tests (LRTs) to identify evolutionary lineages under episodic Darwinian selection or to infer critical amino acids in a protein under diversifying selection. Real-data examples are used to demonstrate the application of the methods. The chapter finishes with a discussion of the limitations of current methods.","PeriodicalId":216924,"journal":{"name":"Handbook of Statistical Genomics","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Adaptive Molecular Evolution\",\"authors\":\"Ziheng Yang\",\"doi\":\"10.1002/0470022620.BBC10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This chapter reviews statistical methods for detecting adaptive molecular evolution by comparing synonymous and nonsynonymous substitution rates in protein-coding DNA sequences. A Markov process model of codon substitution, which forms the basis for all later discussions in this chapter, is introduced first. The case of comparing two sequences to estimate the numbers of synonymous (dS) and nonsynonymous (dN ) substitutions per site is then considered. The maximum likelihood (ML) method and a number of ad hoc counting methods are evaluated. The rest of the chapter deals with joint analyses of multiple sequences on a phylogeny. Review is done on Markov models of codon substitution that allow the nonsynonymous/synonymous rate ratio (ω = dN /dS) to vary among branches in a phylogeny or among amino acid sites in a protein. Those models can be used to construct likelihood ratio tests (LRTs) to identify evolutionary lineages under episodic Darwinian selection or to infer critical amino acids in a protein under diversifying selection. Real-data examples are used to demonstrate the application of the methods. The chapter finishes with a discussion of the limitations of current methods.\",\"PeriodicalId\":216924,\"journal\":{\"name\":\"Handbook of Statistical Genomics\",\"volume\":\"67 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Handbook of Statistical Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/0470022620.BBC10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Handbook of Statistical Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/0470022620.BBC10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This chapter reviews statistical methods for detecting adaptive molecular evolution by comparing synonymous and nonsynonymous substitution rates in protein-coding DNA sequences. A Markov process model of codon substitution, which forms the basis for all later discussions in this chapter, is introduced first. The case of comparing two sequences to estimate the numbers of synonymous (dS) and nonsynonymous (dN ) substitutions per site is then considered. The maximum likelihood (ML) method and a number of ad hoc counting methods are evaluated. The rest of the chapter deals with joint analyses of multiple sequences on a phylogeny. Review is done on Markov models of codon substitution that allow the nonsynonymous/synonymous rate ratio (ω = dN /dS) to vary among branches in a phylogeny or among amino acid sites in a protein. Those models can be used to construct likelihood ratio tests (LRTs) to identify evolutionary lineages under episodic Darwinian selection or to infer critical amino acids in a protein under diversifying selection. Real-data examples are used to demonstrate the application of the methods. The chapter finishes with a discussion of the limitations of current methods.