{"title":"序列依赖的内在无序蛋白质的构象景观揭示了不对称链压实。","authors":"Cong Wang,Bin Zhang","doi":"10.1021/acs.jctc.5c01329","DOIUrl":null,"url":null,"abstract":"Intrinsically disordered proteins (IDPs) exhibit highly dynamic and heterogeneous conformational ensembles that are strongly influenced by sequence features. While global properties such as chain compaction and scaling behavior have been widely studied, they often fail to resolve the fine-grained, sequence-specific structural variation that underlies IDP function. Here, we perform long-time scale atomistic simulations of 47 representative IDP sequences from the yeast proteome to systematically investigate the relationship between sequence composition and conformational ensemble. To analyze the high-dimensional structural data, we apply uniform manifold approximation and projection (UMAP), a nonlinear dimensionality reduction technique that preserves local structural relationships. The resulting low-dimensional embeddings effectively differentiate IDP ensembles and reveal a novel descriptor─local compactness asymmetry─that quantifies directional differences in chain organization. This metric, denoted γRg, captures conformational features orthogonal to traditional global measures such as radius of gyration and end-to-end distance. We show that γRg correlates with sequence-level asymmetries in charge and hydropathy, and that conformational dynamics preferentially occur in the more extended region of the chain. The simulation data set generated in this work also provides a valuable resource for training machine learning models and developing improved coarse-grained force fields for disordered proteins.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"49 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2025-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sequence-Dependent Conformational Landscapes of Intrinsically Disordered Proteins Reveal Asymmetric Chain Compaction.\",\"authors\":\"Cong Wang,Bin Zhang\",\"doi\":\"10.1021/acs.jctc.5c01329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intrinsically disordered proteins (IDPs) exhibit highly dynamic and heterogeneous conformational ensembles that are strongly influenced by sequence features. While global properties such as chain compaction and scaling behavior have been widely studied, they often fail to resolve the fine-grained, sequence-specific structural variation that underlies IDP function. Here, we perform long-time scale atomistic simulations of 47 representative IDP sequences from the yeast proteome to systematically investigate the relationship between sequence composition and conformational ensemble. To analyze the high-dimensional structural data, we apply uniform manifold approximation and projection (UMAP), a nonlinear dimensionality reduction technique that preserves local structural relationships. The resulting low-dimensional embeddings effectively differentiate IDP ensembles and reveal a novel descriptor─local compactness asymmetry─that quantifies directional differences in chain organization. This metric, denoted γRg, captures conformational features orthogonal to traditional global measures such as radius of gyration and end-to-end distance. We show that γRg correlates with sequence-level asymmetries in charge and hydropathy, and that conformational dynamics preferentially occur in the more extended region of the chain. The simulation data set generated in this work also provides a valuable resource for training machine learning models and developing improved coarse-grained force fields for disordered proteins.\",\"PeriodicalId\":45,\"journal\":{\"name\":\"Journal of Chemical Theory and Computation\",\"volume\":\"49 1\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Theory and Computation\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jctc.5c01329\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.5c01329","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Intrinsically disordered proteins (IDPs) exhibit highly dynamic and heterogeneous conformational ensembles that are strongly influenced by sequence features. While global properties such as chain compaction and scaling behavior have been widely studied, they often fail to resolve the fine-grained, sequence-specific structural variation that underlies IDP function. Here, we perform long-time scale atomistic simulations of 47 representative IDP sequences from the yeast proteome to systematically investigate the relationship between sequence composition and conformational ensemble. To analyze the high-dimensional structural data, we apply uniform manifold approximation and projection (UMAP), a nonlinear dimensionality reduction technique that preserves local structural relationships. The resulting low-dimensional embeddings effectively differentiate IDP ensembles and reveal a novel descriptor─local compactness asymmetry─that quantifies directional differences in chain organization. This metric, denoted γRg, captures conformational features orthogonal to traditional global measures such as radius of gyration and end-to-end distance. We show that γRg correlates with sequence-level asymmetries in charge and hydropathy, and that conformational dynamics preferentially occur in the more extended region of the chain. The simulation data set generated in this work also provides a valuable resource for training machine learning models and developing improved coarse-grained force fields for disordered proteins.
期刊介绍:
The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.