人类染色体序列的多分形特性

J. P. Correia, R. Silva, D. H. A. L. Anselmo, M. S. Vasconcelos, L. R. da Silva
{"title":"人类染色体序列的多分形特性","authors":"J. P. Correia, R. Silva, D. H. A. L. Anselmo, M. S. Vasconcelos, L. R. da Silva","doi":"10.3390/fractalfract8060312","DOIUrl":null,"url":null,"abstract":"The intricacy and fractal properties of human DNA sequences are examined in this work. The core of this study is to discern whether complete DNA sequences present distinct complexity and fractal attributes compared with sequences containing exclusively exon regions. In this regard, the entire base pair sequences of DNA are extracted from the NCBI (National Center for Biotechnology Information) database. In order to create a time series representation for the base pair sequence {G,C,T,A}, we use the Chaos Game Representation (CGR) approach and a mapping rule f, which enables us to apply the metric known as the Complexity–Entropy Plane (CEP) and multifractal detrended fluctuation analysis (MF-DFA). To carry out our investigation, we divided human DNA into two groups: the first is composed of the 24 chromosomes, which comprises all the base pairs that form the DNA sequence, and another group that also includes the 24 chromosomes, but the DNA sequences rely only on the exons’ presence. The results show that both sets provide fractal patterns in their structure, as obtained by the CGR approach. Complete DNA sequences show a sharper visual fractal pattern than sequences composed only of exons. Moreover, the sequences occupy distinct areas of the complexity–entropy plane, and the complete DNA sequences lead to greater statistical complexity and lower entropy than the exon sequences. Also, we observed that different fractal parameters between chromosomes indicate diversity in genomic sequences. All these results occur in different scales for all chromosomes.","PeriodicalId":510138,"journal":{"name":"Fractal and Fractional","volume":"35 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multifractal Properties of Human Chromosome Sequences\",\"authors\":\"J. P. Correia, R. Silva, D. H. A. L. Anselmo, M. S. Vasconcelos, L. R. da Silva\",\"doi\":\"10.3390/fractalfract8060312\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The intricacy and fractal properties of human DNA sequences are examined in this work. The core of this study is to discern whether complete DNA sequences present distinct complexity and fractal attributes compared with sequences containing exclusively exon regions. In this regard, the entire base pair sequences of DNA are extracted from the NCBI (National Center for Biotechnology Information) database. In order to create a time series representation for the base pair sequence {G,C,T,A}, we use the Chaos Game Representation (CGR) approach and a mapping rule f, which enables us to apply the metric known as the Complexity–Entropy Plane (CEP) and multifractal detrended fluctuation analysis (MF-DFA). To carry out our investigation, we divided human DNA into two groups: the first is composed of the 24 chromosomes, which comprises all the base pairs that form the DNA sequence, and another group that also includes the 24 chromosomes, but the DNA sequences rely only on the exons’ presence. The results show that both sets provide fractal patterns in their structure, as obtained by the CGR approach. Complete DNA sequences show a sharper visual fractal pattern than sequences composed only of exons. Moreover, the sequences occupy distinct areas of the complexity–entropy plane, and the complete DNA sequences lead to greater statistical complexity and lower entropy than the exon sequences. Also, we observed that different fractal parameters between chromosomes indicate diversity in genomic sequences. All these results occur in different scales for all chromosomes.\",\"PeriodicalId\":510138,\"journal\":{\"name\":\"Fractal and Fractional\",\"volume\":\"35 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fractal and Fractional\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/fractalfract8060312\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fractal and Fractional","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/fractalfract8060312","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

这项研究探讨了人类 DNA 序列的复杂性和分形属性。这项研究的核心在于,与只包含外显子区域的序列相比,完整的 DNA 序列是否呈现出不同的复杂性和分形属性。为此,我们从 NCBI(美国国家生物技术信息中心)数据库中提取了 DNA 的整个碱基对序列。为了创建碱基对序列{G,C,T,A}的时间序列表示,我们使用了混沌博弈表示(CGR)方法和映射规则f,这使我们能够应用称为复杂性-熵平面(CEP)的度量和多分形去趋势波动分析(MF-DFA)。为了进行研究,我们将人类 DNA 分成两组:一组由 24 条染色体组成,包括构成 DNA 序列的所有碱基对;另一组也包括 24 条染色体,但 DNA 序列只依赖于外显子的存在。结果表明,这两组序列的结构都具有分形模式,正如 CGR 方法所获得的那样。与仅由外显子组成的序列相比,完整的 DNA 序列显示出更清晰的视觉分形模式。此外,这两种序列占据了复杂度-熵平面的不同区域,与外显子序列相比,完整的 DNA 序列具有更高的统计复杂度和更低的熵。我们还观察到,染色体之间不同的分形参数表明基因组序列具有多样性。所有这些结果出现在所有染色体的不同尺度上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multifractal Properties of Human Chromosome Sequences
The intricacy and fractal properties of human DNA sequences are examined in this work. The core of this study is to discern whether complete DNA sequences present distinct complexity and fractal attributes compared with sequences containing exclusively exon regions. In this regard, the entire base pair sequences of DNA are extracted from the NCBI (National Center for Biotechnology Information) database. In order to create a time series representation for the base pair sequence {G,C,T,A}, we use the Chaos Game Representation (CGR) approach and a mapping rule f, which enables us to apply the metric known as the Complexity–Entropy Plane (CEP) and multifractal detrended fluctuation analysis (MF-DFA). To carry out our investigation, we divided human DNA into two groups: the first is composed of the 24 chromosomes, which comprises all the base pairs that form the DNA sequence, and another group that also includes the 24 chromosomes, but the DNA sequences rely only on the exons’ presence. The results show that both sets provide fractal patterns in their structure, as obtained by the CGR approach. Complete DNA sequences show a sharper visual fractal pattern than sequences composed only of exons. Moreover, the sequences occupy distinct areas of the complexity–entropy plane, and the complete DNA sequences lead to greater statistical complexity and lower entropy than the exon sequences. Also, we observed that different fractal parameters between chromosomes indicate diversity in genomic sequences. All these results occur in different scales for all chromosomes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信