{"title":"From nonexistence to novel applications: Nullomers and related k-mer based concepts in bioinformatics.","authors":"Candace S Y Chan, Ilias Georgakopoulos-Soares","doi":"10.1016/bs.acc.2025.06.009","DOIUrl":null,"url":null,"abstract":"<p><p>Underrepresented k-mer sequences, provide insights into evolutionary constraints, molecular mechanisms, and organismal fitness. Analysis of these sequences have broad applications across genomics and proteomics, such as in biomarker development, cancer diagnostics, phylogenetic analysis, synthetic biology and novel drug discovery. Absent sequences (nullomers and neomers) show promise for cancer detection and tissue-of-origin identification using nucleic acids derived from liquid biopsies, while quasi-primes serve as genomic fingerprints that offer potential for evolutionary studies for understanding trait evolution, and in metagenomics, as biomarkers of organismal presence. The chapter also discusses computational challenges associated with analyzing absent sequences and highlights available k-mer based resources and databases. With the continuous expansion of genomic and proteomic data, absent sequences present an innovative framework for addressing fundamental biological questions and advancing applications in basic and translational research.</p>","PeriodicalId":101297,"journal":{"name":"Advances in clinical chemistry","volume":"129 ","pages":"191-206"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in clinical chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/bs.acc.2025.06.009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/11 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Underrepresented k-mer sequences, provide insights into evolutionary constraints, molecular mechanisms, and organismal fitness. Analysis of these sequences have broad applications across genomics and proteomics, such as in biomarker development, cancer diagnostics, phylogenetic analysis, synthetic biology and novel drug discovery. Absent sequences (nullomers and neomers) show promise for cancer detection and tissue-of-origin identification using nucleic acids derived from liquid biopsies, while quasi-primes serve as genomic fingerprints that offer potential for evolutionary studies for understanding trait evolution, and in metagenomics, as biomarkers of organismal presence. The chapter also discusses computational challenges associated with analyzing absent sequences and highlights available k-mer based resources and databases. With the continuous expansion of genomic and proteomic data, absent sequences present an innovative framework for addressing fundamental biological questions and advancing applications in basic and translational research.