{"title":"DR and SPIT: Statistical approaches for identifying transient structure in intrinsically disordered proteins via NMR chemical shifts.","authors":"Dániel Kovács, Andrea Bodor","doi":"10.1002/pro.70250","DOIUrl":null,"url":null,"abstract":"<p><p>Intrinsically disordered proteins (IDPs) play key roles in various biological processes; they are associated with liquid-liquid phase separation and are targets in disorder-based drug design. Efforts to identify their structural propensities-that can be linked to molecular recognition, malfunction, targeting-still lead to ambiguous results. Secondary structure is routinely assessed by NMR spectroscopy by calculating the secondary chemical shifts (SCSs). Focusing on a given environment in the polypeptide backbone, SCSs highlight the deviation from the \"random coil\" state. However, the analysis is dependent on which of the numerous random coil chemical shift (RCCS) predictors is applied in the calculations, resulting in an especially pronounced ambiguity for IDPs. To overcome this, we introduce two novel statistical tools that enable the sound identification of structural propensities. We propose the chemical shift discordance ratio (DR) for prefiltering RCCS predictors based on self-consistency. Further on, we introduce the Structural Propensity Identification by t-statistics (SPIT) approach for extracting maximum information from SCS data by using multiple RCCS predictors simultaneously. This way SCS patterns indicating structural propensities can be clearly distinguished from the \"noise\". The applicability of these methods is demonstrated for four proteins of varying degrees of disorder. Ubiquitin and α-synuclein are used as respective benchmarks for a globular and a disordered protein, while two proline-rich IDPs are included as especially challenging molecules in secondary structure analysis.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"34 9","pages":"e70250"},"PeriodicalIF":5.2000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12356139/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70250","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Intrinsically disordered proteins (IDPs) play key roles in various biological processes; they are associated with liquid-liquid phase separation and are targets in disorder-based drug design. Efforts to identify their structural propensities-that can be linked to molecular recognition, malfunction, targeting-still lead to ambiguous results. Secondary structure is routinely assessed by NMR spectroscopy by calculating the secondary chemical shifts (SCSs). Focusing on a given environment in the polypeptide backbone, SCSs highlight the deviation from the "random coil" state. However, the analysis is dependent on which of the numerous random coil chemical shift (RCCS) predictors is applied in the calculations, resulting in an especially pronounced ambiguity for IDPs. To overcome this, we introduce two novel statistical tools that enable the sound identification of structural propensities. We propose the chemical shift discordance ratio (DR) for prefiltering RCCS predictors based on self-consistency. Further on, we introduce the Structural Propensity Identification by t-statistics (SPIT) approach for extracting maximum information from SCS data by using multiple RCCS predictors simultaneously. This way SCS patterns indicating structural propensities can be clearly distinguished from the "noise". The applicability of these methods is demonstrated for four proteins of varying degrees of disorder. Ubiquitin and α-synuclein are used as respective benchmarks for a globular and a disordered protein, while two proline-rich IDPs are included as especially challenging molecules in secondary structure analysis.
期刊介绍:
Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution.
Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics.
The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication.
Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).