The grand biological universe: A comprehensive geometric construction of genome space.

IF 25.7 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
The Innovation Pub Date : 2025-04-30 eCollection Date: 2025-08-04 DOI:10.1016/j.xinn.2025.100937
Hongyu Yu, Nan Sun, Ruohan Ren, Tao Zhou, Mengcen Guan, Leqi Zhao, Stephen S-T Yau
{"title":"The grand biological universe: A comprehensive geometric construction of genome space.","authors":"Hongyu Yu, Nan Sun, Ruohan Ren, Tao Zhou, Mengcen Guan, Leqi Zhao, Stephen S-T Yau","doi":"10.1016/j.xinn.2025.100937","DOIUrl":null,"url":null,"abstract":"<p><p>Analyzing the geometric relationships among genomic sequences from a mathematical perspective and revealing the laws hidden within these relationships is a crucial challenge in bioinformatics. The natural vector method constructs a genome space by extracting statistical moments of <i>k</i>-mers to illuminate the relationships among genomes. This approach highlights a fundamental law in biology known as the convex hull principle, which states that natural vectors corresponding to different types of biological sequences form distinct, non-overlapping convex hulls. Previous studies have validated this important principle across various datasets. However, they often focused on specific kingdoms and did not thoroughly analyze the significance of the dimensions required for the convex hull separation. In this study, we integrate all reliable sequences from different kingdoms to construct the grand biological universe, within which we comprehensively validate the multi-level convex hull principle. We demonstrate that the separation of convex hulls arises from biological properties rather than mathematical characteristics of high-dimensional spaces. Furthermore, we develop suitable metrics within the grand biological universe to facilitate efficient sequence classification. This research advances the convex hull principle through both theoretical development and experimental validation, making significant contributions to the understanding of the geometric structure of genome space.</p>","PeriodicalId":36121,"journal":{"name":"The Innovation","volume":"6 8","pages":"100937"},"PeriodicalIF":25.7000,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12347096/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Innovation","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1016/j.xinn.2025.100937","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/4 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Analyzing the geometric relationships among genomic sequences from a mathematical perspective and revealing the laws hidden within these relationships is a crucial challenge in bioinformatics. The natural vector method constructs a genome space by extracting statistical moments of k-mers to illuminate the relationships among genomes. This approach highlights a fundamental law in biology known as the convex hull principle, which states that natural vectors corresponding to different types of biological sequences form distinct, non-overlapping convex hulls. Previous studies have validated this important principle across various datasets. However, they often focused on specific kingdoms and did not thoroughly analyze the significance of the dimensions required for the convex hull separation. In this study, we integrate all reliable sequences from different kingdoms to construct the grand biological universe, within which we comprehensively validate the multi-level convex hull principle. We demonstrate that the separation of convex hulls arises from biological properties rather than mathematical characteristics of high-dimensional spaces. Furthermore, we develop suitable metrics within the grand biological universe to facilitate efficient sequence classification. This research advances the convex hull principle through both theoretical development and experimental validation, making significant contributions to the understanding of the geometric structure of genome space.

大生物宇宙:基因组空间的综合几何构造。
从数学的角度分析基因组序列之间的几何关系,揭示隐藏在这些关系中的规律是生物信息学的一个重要挑战。自然向量法通过提取k-mers的统计矩来构建基因组空间,以阐明基因组之间的关系。这种方法强调了生物学中一个被称为凸包原理的基本法则,即不同类型的生物序列对应的自然载体形成不同的、不重叠的凸包。以前的研究已经在各种数据集上验证了这一重要原则。然而,他们经常关注特定的王国,而没有彻底分析凸壳分离所需的维度的意义。在这项研究中,我们整合了来自不同领域的所有可靠序列,构建了一个大的生物宇宙,在这个宇宙中我们全面验证了多层次凸壳原理。我们证明了凸壳的分离源于高维空间的生物特性而不是数学特性。此外,我们在大生物宇宙中开发了合适的度量,以促进有效的序列分类。本研究通过理论发展和实验验证两方面推进了凸包原理,对基因组空间几何结构的理解做出了重要贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Innovation
The Innovation MULTIDISCIPLINARY SCIENCES-
CiteScore
38.30
自引率
1.20%
发文量
134
审稿时长
6 weeks
期刊介绍: The Innovation is an interdisciplinary journal that aims to promote scientific application. It publishes cutting-edge research and high-quality reviews in various scientific disciplines, including physics, chemistry, materials, nanotechnology, biology, translational medicine, geoscience, and engineering. The journal adheres to the peer review and publishing standards of Cell Press journals. The Innovation is committed to serving scientists and the public. It aims to publish significant advances promptly and provides a transparent exchange platform. The journal also strives to efficiently promote the translation from scientific discovery to technological achievements and rapidly disseminate scientific findings worldwide. Indexed in the following databases, The Innovation has visibility in Scopus, Directory of Open Access Journals (DOAJ), Web of Science, Emerging Sources Citation Index (ESCI), PubMed Central, Compendex (previously Ei index), INSPEC, and CABI A&I.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信