{"title":"Determining the minimum functional length of k-mers: Evidence for the 7-mer threshold from yeast genome","authors":"Deliang Zhou , Jianli Liu , Wen Jin","doi":"10.1016/j.genrep.2025.102309","DOIUrl":null,"url":null,"abstract":"<div><div>k-mers play an important role in the function and evolution of DNA sequences and should have a minimum functional length. Based on the analysis of (1) all core and linker DNA in yeast, and (2) all gene and intergenic sequences in yeast, the minimum functional length of k-mers was determined using the log-ratio of relative frequency. The following conclusions were obtained:<ul><li><span>(1)</span><span><div>7-mers represent the minimum functional length of the basic functional fragments of DNA sequences, while k-mers with <em>k</em> < 7 constitute the basic component fragments of DNA sequences;</div></span></li><li><span>(2)</span><span><div>7-mers are the minimum length k-mers that drive DNA sequences to produce sequence function and evolution;</div></span></li><li><span>(3)</span><span><div>7-mers are the minimum length k-mers that drive the separation of function and evolution among different types of sequences.</div></span></li></ul></div><div>These conclusions represent major theoretical breakthroughs and have significant implications: (1) They support the conclusion from Reference 24 that the minimum functional length of k-mers is 7, fully affirming the accuracy and reliability of this minimum functional length; (2) They elucidate the roles and characteristics of functional and component fragments of DNA sequences, revealing the driving force behind the generation and separation of DNA sequence function and evolution.</div></div>","PeriodicalId":12673,"journal":{"name":"Gene Reports","volume":"41 ","pages":"Article 102309"},"PeriodicalIF":0.9000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gene Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452014425001827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
k-mers play an important role in the function and evolution of DNA sequences and should have a minimum functional length. Based on the analysis of (1) all core and linker DNA in yeast, and (2) all gene and intergenic sequences in yeast, the minimum functional length of k-mers was determined using the log-ratio of relative frequency. The following conclusions were obtained:
(1)
7-mers represent the minimum functional length of the basic functional fragments of DNA sequences, while k-mers with k < 7 constitute the basic component fragments of DNA sequences;
(2)
7-mers are the minimum length k-mers that drive DNA sequences to produce sequence function and evolution;
(3)
7-mers are the minimum length k-mers that drive the separation of function and evolution among different types of sequences.
These conclusions represent major theoretical breakthroughs and have significant implications: (1) They support the conclusion from Reference 24 that the minimum functional length of k-mers is 7, fully affirming the accuracy and reliability of this minimum functional length; (2) They elucidate the roles and characteristics of functional and component fragments of DNA sequences, revealing the driving force behind the generation and separation of DNA sequence function and evolution.
Gene ReportsBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.30
自引率
7.70%
发文量
246
审稿时长
49 days
期刊介绍:
Gene Reports publishes papers that focus on the regulation, expression, function and evolution of genes in all biological contexts, including all prokaryotic and eukaryotic organisms, as well as viruses. Gene Reports strives to be a very diverse journal and topics in all fields will be considered for publication. Although not limited to the following, some general topics include: DNA Organization, Replication & Evolution -Focus on genomic DNA (chromosomal organization, comparative genomics, DNA replication, DNA repair, mobile DNA, mitochondrial DNA, chloroplast DNA). Expression & Function - Focus on functional RNAs (microRNAs, tRNAs, rRNAs, mRNA splicing, alternative polyadenylation) Regulation - Focus on processes that mediate gene-read out (epigenetics, chromatin, histone code, transcription, translation, protein degradation). Cell Signaling - Focus on mechanisms that control information flow into the nucleus to control gene expression (kinase and phosphatase pathways controlled by extra-cellular ligands, Wnt, Notch, TGFbeta/BMPs, FGFs, IGFs etc.) Profiling of gene expression and genetic variation - Focus on high throughput approaches (e.g., DeepSeq, ChIP-Seq, Affymetrix microarrays, proteomics) that define gene regulatory circuitry, molecular pathways and protein/protein networks. Genetics - Focus on development in model organisms (e.g., mouse, frog, fruit fly, worm), human genetic variation, population genetics, as well as agricultural and veterinary genetics. Molecular Pathology & Regenerative Medicine - Focus on the deregulation of molecular processes in human diseases and mechanisms supporting regeneration of tissues through pluripotent or multipotent stem cells.