Co-evolution of alpha-helical transmembrane protein residues: large-scale variant profiling and complete mutational landscape of 2277 known PDB entries representing 504 unique human protein sequences.
{"title":"Co-evolution of alpha-helical transmembrane protein residues: large-scale variant profiling and complete mutational landscape of 2277 known PDB entries representing 504 unique human protein sequences.","authors":"Taner Karagöl, Alper Karagöl, Shuguang Zhang","doi":"10.1007/s00239-025-10262-8","DOIUrl":null,"url":null,"abstract":"<p><p>Membrane proteins play fundamental roles in cellular function, yet the evolutionary dynamics of their amino acid composition remain poorly understood. Our current study investigates the substitutional landscape and evolutionary patterns of hydrophilic and hydrophobic residues in membrane α-helical proteins, addressing a significant gap in our knowledge of protein evolution. We analyzed 2277 high-resolution protein structures from the RCSB Protein Data Bank corresponding to 458 unique PDB structures, 504 UniProt transmembrane entries and their AlphaMissense predicted mutational libraries including more than 5.8 million amino acid substitutions, focusing on known transmembrane α-helical proteins in Homo sapiens. Our analysis showed that the pathological outcome of the substitutions is diverse, as nonpolar to polar changes showed higher pathological scores in general. Notably, F <=> Y substitutions showed significantly lower pathological scores. Our further analysis revealed a significant asymmetry in the evolutionary frequencies of polar and nonpolar amino acids. We identified key residue pairs driving this asymmetry, with F <=> Y, A <=> T, V <=> T and A <=> S co-evolution diverging from the expected negative correlations (Spearman's rho > 0.20, p < 0.001). The V <=> T substitution via an alanine intermediate and the G <=> N substitution via a serine intermediate lower their statistical barrier, which would otherwise require two sequential base changes. We propose two evolutionary game theory (EGT) based models to explain their diversification, with partial correlation analysis on residue frequencies in homolog sequences. These mathematical insights suggest a previously unrecognized evolutionary pressure, potentially linked to functional diversification, which could be targeted to combat drug resistance. Our results offer insights into membrane protein evolution and may inform improved methods for protein structure prediction and design.</p>","PeriodicalId":16366,"journal":{"name":"Journal of Molecular Evolution","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00239-025-10262-8","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Membrane proteins play fundamental roles in cellular function, yet the evolutionary dynamics of their amino acid composition remain poorly understood. Our current study investigates the substitutional landscape and evolutionary patterns of hydrophilic and hydrophobic residues in membrane α-helical proteins, addressing a significant gap in our knowledge of protein evolution. We analyzed 2277 high-resolution protein structures from the RCSB Protein Data Bank corresponding to 458 unique PDB structures, 504 UniProt transmembrane entries and their AlphaMissense predicted mutational libraries including more than 5.8 million amino acid substitutions, focusing on known transmembrane α-helical proteins in Homo sapiens. Our analysis showed that the pathological outcome of the substitutions is diverse, as nonpolar to polar changes showed higher pathological scores in general. Notably, F <=> Y substitutions showed significantly lower pathological scores. Our further analysis revealed a significant asymmetry in the evolutionary frequencies of polar and nonpolar amino acids. We identified key residue pairs driving this asymmetry, with F <=> Y, A <=> T, V <=> T and A <=> S co-evolution diverging from the expected negative correlations (Spearman's rho > 0.20, p < 0.001). The V <=> T substitution via an alanine intermediate and the G <=> N substitution via a serine intermediate lower their statistical barrier, which would otherwise require two sequential base changes. We propose two evolutionary game theory (EGT) based models to explain their diversification, with partial correlation analysis on residue frequencies in homolog sequences. These mathematical insights suggest a previously unrecognized evolutionary pressure, potentially linked to functional diversification, which could be targeted to combat drug resistance. Our results offer insights into membrane protein evolution and may inform improved methods for protein structure prediction and design.
期刊介绍:
Journal of Molecular Evolution covers experimental, computational, and theoretical work aimed at deciphering features of molecular evolution and the processes bearing on these features, from the initial formation of macromolecular systems through their evolution at the molecular level, the co-evolution of their functions in cellular and organismal systems, and their influence on organismal adaptation, speciation, and ecology. Topics addressed include the evolution of informational macromolecules and their relation to more complex levels of biological organization, including populations and taxa, as well as the molecular basis for the evolution of ecological interactions of species and the use of molecular data to infer fundamental processes in evolutionary ecology. This coverage accommodates such subfields as new genome sequences, comparative structural and functional genomics, population genetics, the molecular evolution of development, the evolution of gene regulation and gene interaction networks, and in vitro evolution of DNA and RNA, molecular evolutionary ecology, and the development of methods and theory that enable molecular evolutionary inference, including but not limited to, phylogenetic methods.