{"title":"通过二级结构聚类DNA和RNA分子动力学集成。","authors":"Swapnil Baral,Michael Zwolak","doi":"10.1016/j.bpj.2025.08.029","DOIUrl":null,"url":null,"abstract":"Macromolecular structure is central to biology. Yet, not all biomolecules have a well-defined fold. Intrinsically disordered regions are ubiquitous, conveying a versatility to function even in otherwise folded structures. For nucleic acids, entropic disorder is manifest in regions of incomplete base pairing (e.g., during transcription) and for long molecules (i.e., beyond the persistence length). To classify the resulting ensembles, we develop a method to cluster based on secondary structure, focusing specifically on DNA and RNA. The number of base pairs to reorganize furnishes a proper distance metric for structures of the same topology (e.g., without knots). This permits clustering of any type, from k-means to hierarchical to density-based methods. We demonstrate this by showing the broad distribution of secondary structure of a fragment of the M13 bacteriophage DNA and by revealing hidden order in a RNA Holliday junction. This clustering approach is connected to energy barriers from disrupting hybridization and recognizes structures that differ only by, e.g., internal reorientation as the same, compressing the vast free-energy landscape from entropic disorder.","PeriodicalId":8922,"journal":{"name":"Biophysical journal","volume":"11 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clustering DNA and RNA molecular dynamics ensembles via secondary structure.\",\"authors\":\"Swapnil Baral,Michael Zwolak\",\"doi\":\"10.1016/j.bpj.2025.08.029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Macromolecular structure is central to biology. Yet, not all biomolecules have a well-defined fold. Intrinsically disordered regions are ubiquitous, conveying a versatility to function even in otherwise folded structures. For nucleic acids, entropic disorder is manifest in regions of incomplete base pairing (e.g., during transcription) and for long molecules (i.e., beyond the persistence length). To classify the resulting ensembles, we develop a method to cluster based on secondary structure, focusing specifically on DNA and RNA. The number of base pairs to reorganize furnishes a proper distance metric for structures of the same topology (e.g., without knots). This permits clustering of any type, from k-means to hierarchical to density-based methods. We demonstrate this by showing the broad distribution of secondary structure of a fragment of the M13 bacteriophage DNA and by revealing hidden order in a RNA Holliday junction. This clustering approach is connected to energy barriers from disrupting hybridization and recognizes structures that differ only by, e.g., internal reorientation as the same, compressing the vast free-energy landscape from entropic disorder.\",\"PeriodicalId\":8922,\"journal\":{\"name\":\"Biophysical journal\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biophysical journal\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.bpj.2025.08.029\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biophysical journal","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.bpj.2025.08.029","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}
Clustering DNA and RNA molecular dynamics ensembles via secondary structure.
Macromolecular structure is central to biology. Yet, not all biomolecules have a well-defined fold. Intrinsically disordered regions are ubiquitous, conveying a versatility to function even in otherwise folded structures. For nucleic acids, entropic disorder is manifest in regions of incomplete base pairing (e.g., during transcription) and for long molecules (i.e., beyond the persistence length). To classify the resulting ensembles, we develop a method to cluster based on secondary structure, focusing specifically on DNA and RNA. The number of base pairs to reorganize furnishes a proper distance metric for structures of the same topology (e.g., without knots). This permits clustering of any type, from k-means to hierarchical to density-based methods. We demonstrate this by showing the broad distribution of secondary structure of a fragment of the M13 bacteriophage DNA and by revealing hidden order in a RNA Holliday junction. This clustering approach is connected to energy barriers from disrupting hybridization and recognizes structures that differ only by, e.g., internal reorientation as the same, compressing the vast free-energy landscape from entropic disorder.
期刊介绍:
BJ publishes original articles, letters, and perspectives on important problems in modern biophysics. The papers should be written so as to be of interest to a broad community of biophysicists. BJ welcomes experimental studies that employ quantitative physical approaches for the study of biological systems, including or spanning scales from molecule to whole organism. Experimental studies of a purely descriptive or phenomenological nature, with no theoretical or mechanistic underpinning, are not appropriate for publication in BJ. Theoretical studies should offer new insights into the understanding ofexperimental results or suggest new experimentally testable hypotheses. Articles reporting significant methodological or technological advances, which have potential to open new areas of biophysical investigation, are also suitable for publication in BJ. Papers describing improvements in accuracy or speed of existing methods or extra detail within methods described previously are not suitable for BJ.