{"title":"Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.","authors":"Sai Swaroop Chittoor, Simona Giunta","doi":"10.1016/j.ajhg.2024.10.016","DOIUrl":null,"url":null,"abstract":"<p><p>Secondary structures are non-canonical arrangements of nucleic acids due to intra-strand interactions, including base pairing, stacking, or other higher-order features that deviate from the standard double-helical conformation. While these structures are extensively studied in RNA, they can also form when DNA becomes single stranded, creating topological roadblocks that can impact essential DNA-based processes such as replication, transcription, and repair, ultimately affecting genome stability. The availability of a complete linear sequence of human genomes, including repetitive loci, enables the prediction of DNA secondary structures comparing across various regions. Here, we evaluate the intrinsic properties of linear single-stranded DNA sequences derived from sampling specialized human loci such as centromeres, pericentromeres, ribosomal DNA (rDNA), and coding regions from the CHM13 genome. Our comparative analysis of predicted secondary structures across human chromosomes revealed the heightened presence, complexity, and instability of secondary structures within the centromere, which gradually decreased toward the pericentromere onto chromosomes' arms, on average lowest in coding regions. Notably, centromeric repeats exhibited the highest level of topological complexity within both the active and divergent domains, even when compared to other repetitive tandem satellites, such as rDNA in acrocentric chromosomes. Our findings provide evidence of the intrinsic self-hybridizing properties of centromere repeats, which are capable of generating complex topological structures that may functionally correlate with chromosome missegregation, especially when centromeric chromatin is disrupted. Processes such as long non-coding RNA transcription, recombination, and other mechanisms that dechromatinize and unwind stretches of linear DNA in these regions create in vivo opportunities for the DNA acrobatics hereby predicted.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ajhg.2024.10.016","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Secondary structures are non-canonical arrangements of nucleic acids due to intra-strand interactions, including base pairing, stacking, or other higher-order features that deviate from the standard double-helical conformation. While these structures are extensively studied in RNA, they can also form when DNA becomes single stranded, creating topological roadblocks that can impact essential DNA-based processes such as replication, transcription, and repair, ultimately affecting genome stability. The availability of a complete linear sequence of human genomes, including repetitive loci, enables the prediction of DNA secondary structures comparing across various regions. Here, we evaluate the intrinsic properties of linear single-stranded DNA sequences derived from sampling specialized human loci such as centromeres, pericentromeres, ribosomal DNA (rDNA), and coding regions from the CHM13 genome. Our comparative analysis of predicted secondary structures across human chromosomes revealed the heightened presence, complexity, and instability of secondary structures within the centromere, which gradually decreased toward the pericentromere onto chromosomes' arms, on average lowest in coding regions. Notably, centromeric repeats exhibited the highest level of topological complexity within both the active and divergent domains, even when compared to other repetitive tandem satellites, such as rDNA in acrocentric chromosomes. Our findings provide evidence of the intrinsic self-hybridizing properties of centromere repeats, which are capable of generating complex topological structures that may functionally correlate with chromosome missegregation, especially when centromeric chromatin is disrupted. Processes such as long non-coding RNA transcription, recombination, and other mechanisms that dechromatinize and unwind stretches of linear DNA in these regions create in vivo opportunities for the DNA acrobatics hereby predicted.
二级结构是核酸因链内相互作用(包括碱基配对、堆叠或其他偏离标准双螺旋构象的高阶特征)而形成的非规范排列。虽然这些结构在 RNA 中被广泛研究,但当 DNA 变为单链时也会形成这些结构,从而产生拓扑路障,影响以 DNA 为基础的基本过程,如复制、转录和修复,最终影响基因组的稳定性。人类基因组包括重复位点在内的完整线性序列的出现,使得对不同区域的 DNA 二级结构进行比较预测成为可能。在这里,我们评估了线性单链 DNA 序列的内在特性,这些序列来自于取样专门的人类基因座,如中心粒、周中心粒、核糖体 DNA (rDNA) 和 CHM13 基因组的编码区。我们对人类染色体上预测的二级结构进行了比较分析,发现中心粒内二级结构的存在性、复杂性和不稳定性都很高,向染色体臂的近中心粒方向逐渐降低,平均而言,编码区的二级结构最低。值得注意的是,即使与其他重复串联卫星(如非中心染色体中的 rDNA)相比,中心粒重复序列在活跃域和发散域内都表现出最高的拓扑复杂性。我们的研究结果提供了中心粒重复序列内在自杂交特性的证据,它能够产生复杂的拓扑结构,在功能上可能与染色体错分离相关,尤其是当中心粒染色质被破坏时。长非编码 RNA 转录、重组等过程,以及其他使这些区域的线性 DNA 片段脱染色质和解旋的机制,为此处预测的 DNA 杂技表演创造了活体机会。
期刊介绍:
The American Journal of Human Genetics (AJHG) is a monthly journal published by Cell Press, chosen by The American Society of Human Genetics (ASHG) as its premier publication starting from January 2008. AJHG represents Cell Press's first society-owned journal, and both ASHG and Cell Press anticipate significant synergies between AJHG content and that of other Cell Press titles.