From nonexistence to novel applications: Nullomers and related k-mer based concepts in bioinformatics.

Advances in clinical chemistry Pub Date : 2025-01-01 Epub Date: 2025-07-11 DOI:10.1016/bs.acc.2025.06.009

Candace S Y Chan, Ilias Georgakopoulos-Soares

引用次数: 0

Abstract

Underrepresented k-mer sequences, provide insights into evolutionary constraints, molecular mechanisms, and organismal fitness. Analysis of these sequences have broad applications across genomics and proteomics, such as in biomarker development, cancer diagnostics, phylogenetic analysis, synthetic biology and novel drug discovery. Absent sequences (nullomers and neomers) show promise for cancer detection and tissue-of-origin identification using nucleic acids derived from liquid biopsies, while quasi-primes serve as genomic fingerprints that offer potential for evolutionary studies for understanding trait evolution, and in metagenomics, as biomarkers of organismal presence. The chapter also discusses computational challenges associated with analyzing absent sequences and highlights available k-mer based resources and databases. With the continuous expansion of genomic and proteomic data, absent sequences present an innovative framework for addressing fundamental biological questions and advancing applications in basic and translational research.

查看原文本刊更多论文

从不存在到新的应用：生物信息学中基于k-mer的零分子和相关概念。

未被充分代表的k-mer序列，提供了对进化约束，分子机制和有机体适应性的见解。这些序列的分析在基因组学和蛋白质组学中有着广泛的应用，例如生物标志物开发、癌症诊断、系统发育分析、合成生物学和新药发现。缺失序列（零聚物和新聚物）有望用于癌症检测和利用液体活检获得的核酸进行组织起源鉴定，而准引物作为基因组指纹，为理解性状进化的进化研究提供了潜力，在宏基因组学中，作为生物体存在的生物标志物。本章还讨论了与分析缺失序列相关的计算挑战，并强调了可用的基于k-mer的资源和数据库。随着基因组学和蛋白质组学数据的不断扩展，缺失序列为解决基础生物学问题和推进基础和转化研究中的应用提供了一个创新的框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advances in clinical chemistry

自引率

0.00%

发文量