Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03798-x
Bruce Budowle, Kristen Mittelman, David Mittelman
{"title":"Genomics will forever reshape forensic science and criminal justice","authors":"Bruce Budowle, Kristen Mittelman, David Mittelman","doi":"10.1186/s13059-025-03798-x","DOIUrl":"https://doi.org/10.1186/s13059-025-03798-x","url":null,"abstract":"Dense single nucleotide polymorphism testing has revolutionized forensic science, helping solve decadesold, current and future cases by overcoming the limitations of traditional short tandem repeat profiling. By embracing innovations from fields such as ancient DNA analysis, forensics can deliver long-awaited answers and justice to victims and their families.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"65 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03780-7
Pengxiao Li, Lin Li, Jingminjie Nan, Jiahuan Chen, Jielin Sun, Yanan Cao
{"title":"KEGNI: knowledge graph enhanced framework for gene regulatory network inference","authors":"Pengxiao Li, Lin Li, Jingminjie Nan, Jiahuan Chen, Jielin Sun, Yanan Cao","doi":"10.1186/s13059-025-03780-7","DOIUrl":"https://doi.org/10.1186/s13059-025-03780-7","url":null,"abstract":"Inference of cell type-specific gene regulatory networks (GRNs) is a fundamental step in investigating complex regulatory mechanisms. Here, we present KEGNI (Knowledge graph-Enhanced Gene regulatory Network Inference), a knowledge-guided framework that employs a graph autoencoder to capture gene regulatory relationships and incorporates a knowledge graph to infer GRNs based on scRNA-seq data. KEGNI shows superior performance compared to multiple methods using scRNA-seq data or paired scRNA-seq and scATAC-seq data. KEGNI can identify driver genes and elucidate the regulatory mechanisms underlying distinct cellular contexts. The modular design of KEGNI supports the integration of various knowledge graphs for context-specific tasks.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"85 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deciphering the sequence basis and application of transcriptional initiation regulation in plant genomes through deep learning","authors":"Pengfei Gao, Lijie Lian, Wanjie Feng, Yuxue Ma, Jieni Lin, Liya Qin, Shanmeng Hao, Haonan Zhao, Xuantong Liu, Jing Yuan, Zongcheng Lin, Xia Li, Yuefeng Guan, Xutong Wang","doi":"10.1186/s13059-025-03782-5","DOIUrl":"https://doi.org/10.1186/s13059-025-03782-5","url":null,"abstract":"Transcription initiation is a key checkpoint in plant gene regulation, yet the DNA features that determine where and the frequency of the genes start transcription remain unclear. We develop GenoRetriever, an interpretable deep learning model trained on base pair resolution STRIPE-seq data from multiple crop genomes, to systematically reveal and quantify the sequence code that governs transcription start sites (TSSs). Using TSS profiles from 16 soybean tissues and six additional crops, GenoRetriever identifies 27 core promoter motifs, including canonical TATA box and initiator elements, that together dictate TSS choice and activity. Model interpretation shows how each motif modulates both initiation frequency and precise start site position; these effects are confirmed by in silico motif edits, saturation mutagenesis, and targeted promoter assays. A new telomere-to-telomere assembly of wild soybean, Glycine soja, reveals that 31.85% of natural promoter variants shift dominant motifs relative to cultivated soybean, explaining domestication-driven changes in transcriptional regulation. Cross-species comparisons further indicate that, although many motif functions are conserved, monocots and dicots display distinct motif frequencies and positional preferences. GenoRetriever provides an interpretable, cross species framework for decoding transcription initiation in plants. By linking specific sequence motifs to quantitative transcriptional outcomes and validating these links experimentally, our study advances fundamental knowledge of promoter architecture and supplies a practical platform for rational engineering of gene expression in crop improvement and functional genomics.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"32 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03763-8
Jonas Scheid, Steffen Lemke, Naomi Hoenisch-Gravel, Anna Dengler, Timo Sachsenberg, Arthur Declerq, Ralf Gabriels, Jens Bauer, Marcel Wacker, Leon Bichmann, Lennart Martens, Marissa L. Dubbelaar, Sven Nahnsen, Juliane S. Walz
{"title":"MHCquant2 refines immunopeptidomics tumor antigen discovery","authors":"Jonas Scheid, Steffen Lemke, Naomi Hoenisch-Gravel, Anna Dengler, Timo Sachsenberg, Arthur Declerq, Ralf Gabriels, Jens Bauer, Marcel Wacker, Leon Bichmann, Lennart Martens, Marissa L. Dubbelaar, Sven Nahnsen, Juliane S. Walz","doi":"10.1186/s13059-025-03763-8","DOIUrl":"https://doi.org/10.1186/s13059-025-03763-8","url":null,"abstract":"Confident identification of human leukocyte antigen (HLA)-presented peptides is crucial for advancing cancer immunotherapy. We present MHCquant2, a scalable and modular Nextflow pipeline integrated into nf-core as a reproducible, portable, and standardized workflow for immunopeptidomics. This integration allows a community-driven, robust solution for high-throughput analyses across operating systems and cloud infrastructures. MHCquant2 integrates open-source tools including OpenMS, DeepLC, and MS2PIP, improving peptide identifications by up to 27% across diverse MS platforms, particularly enriching low-abundant peptides. MHCquant2 demonstrates state-of-the-art performance on our novel benignMHCquant2 dataset (n = 92) and expands the benign human immunopeptidome by over 160,000 unique naturally presented HLA peptides.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"21 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03768-3
Marion Pitz, Jutta A. Baldauf, Hans-Peter Piepho, Peng Yu, Heiko Schoof, Annaliese S. Mason, Guoliang Li, Frank Hochholdinger
{"title":"Regulation of heterosis-associated gene expression complementation in maize hybrids","authors":"Marion Pitz, Jutta A. Baldauf, Hans-Peter Piepho, Peng Yu, Heiko Schoof, Annaliese S. Mason, Guoliang Li, Frank Hochholdinger","doi":"10.1186/s13059-025-03768-3","DOIUrl":"https://doi.org/10.1186/s13059-025-03768-3","url":null,"abstract":"Classical genetic concepts to explain heterosis attribute the superiority of F1-hybrids over their homozygous parents to the complementation of unfavorable by beneficial alleles (dominance) or to heterozygote advantage (overdominance). Here we analyze 112 intermated B73xMo17 recombinant inbred lines of maize and their backcrosses to their original parents B73 and Mo17 to obtain hybrids with an average heterozygosity of ~ 50%. This genetic architecture allows studying the influence of homozygous and heterozygous genomic regions on gene expression in hybrids. We demonstrate that single parent expression (SPE) complementation explains between − 8% and 29% of the mid-parent heterotic variance in these hybrids. In this expression pattern, consistent with dominance, genes are active in only one parent and in the hybrid, thus increasing the number of expressed genes in hybrids. Furthermore, we establish that eQTL regulating SPE genes are predominantly located in heterozygous regions of the genome. Finally, we identify an SPE gene that regulates lateral root density in hybrids. Remarkably, the activity of this gene depends on the presence of a Mo17 allele in an eQTL that regulates this gene. Here we show that dominance of SPE genes influences the number of active genes in hybrids, while heterozygosity is instrumental for the regulation of these genes. This finding supports the notion that the genetic constitution of distant regulatory elements is instrumental for the activity of heterosis-associated genes. In summary, our results connect genetic variation at regulatory loci and the degree of heterozygosity with phenotypic variation of heterosis via SPE complementation.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"195 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SIGEL: a context-aware genomic representation learning framework for spatial genomics analysis","authors":"Wenlin Li, Maocheng Zhu, Yucheng Xu, Mengqian Huang, Ziyi Wang, Jing Chen, Hao Wu, Xiaobo Sun","doi":"10.1186/s13059-025-03748-7","DOIUrl":"https://doi.org/10.1186/s13059-025-03748-7","url":null,"abstract":"Spatial transcriptomics (ST) integrates spatial information into genomics, yet methods for generating spatially-informed gene representations are limited and computationally intensive. We present SIGEL, a cost-effective framework that derives gene manifolds from ST data by exploiting spatial genomic context. The resulting SIGEL-generated gene representations (SGRs) are context-aware, biologically meaningful, and robust across samples, making them highly effective for key downstream tasks, including imputing missing genes, detecting spatial expression patterns, identifying disease-related genes and interactions, and improving spatial clustering. Extensive experiments across diverse ST datasets validate SIGEL’s effectiveness and highlight its potential in advancing spatial genomics research.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"78 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03783-4
Linh Nguyen, Jae Young Choi
{"title":"Topsicle: a method for estimating telomere length from whole genome long-read sequencing data","authors":"Linh Nguyen, Jae Young Choi","doi":"10.1186/s13059-025-03783-4","DOIUrl":"https://doi.org/10.1186/s13059-025-03783-4","url":null,"abstract":"Telomeres protect chromosome ends and their length varies significantly between organisms. Because telomere length variation is associated with various biomedical and eco-evolutionary phenotypes, many biological fields are interested in understanding its biological significance. Here, we introduce Topsicle, a computational method that estimates telomere length from whole genome long-read sequencing data using k-mer and change-point detection analysis. Simulations show Topsicle is robust to sequencing errors and coverage. Application of Topsicle to plant and human cancer cells shows high accuracy and comparable results to direct telomere length measurements. We predict Topsicle will be a useful tool for studying telomere biology.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"2 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03749-6
Zheng Su, Mingyan Fang, Andrei Smolnikov, Marcel E. Dinger, Emily C. Oates, Fatemeh Vafaee
{"title":"GeneRAIN: multifaceted representation of genes via deep learning of gene expression networks","authors":"Zheng Su, Mingyan Fang, Andrei Smolnikov, Marcel E. Dinger, Emily C. Oates, Fatemeh Vafaee","doi":"10.1186/s13059-025-03749-6","DOIUrl":"https://doi.org/10.1186/s13059-025-03749-6","url":null,"abstract":"We develop GeneRAIN, a suite of Transformer-based models that learn gene expression relationships from 410 K human bulk RNA-seq samples. Featuring a novel Binning-By-Gene normalization technique, our models capture diverse biological information beyond expression. We introduce GeneRAIN-vec, a multifaceted vectorized gene representation that outperforms those from existing models. We demonstrate knowledge transfer from protein-coding genes to Make 62.5 million biological attribute predictions for 13,030 long noncoding RNAs. This work advances Transformer and self-supervised deep learning applications to expression data, enhancing biological exploration.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"18 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-09-22DOI: 10.1186/s13059-025-03758-5
Siyuan Wu, Ulf Schmitz
{"title":"ScIsoX: a multidimensional framework for measuring isoform-level transcriptomic complexity in single cells","authors":"Siyuan Wu, Ulf Schmitz","doi":"10.1186/s13059-025-03758-5","DOIUrl":"https://doi.org/10.1186/s13059-025-03758-5","url":null,"abstract":"Single-cell isoform analysis enables high-resolution characterization of transcript expression, yet analytical frameworks to systematically measure transcriptomic complexity are lacking. Here, we introduce ScIsoX, a computational framework that integrates a novel hierarchical data structure, a suite of complexity metrics, and dedicated visualization tools for isoform-level analysis. ScIsoX supports systematic exploration of global and cell-type-specific isoform expression patterns arising from alternative splicing, revealing multidimensional complexity signatures across diverse datasets—insights often missed by conventional gene-level approaches. We demonstrate the utility of ScIsoX across multiple real-world single-cell isoform sequencing datasets, showcasing its potential as a general framework for transcriptomic complexity analysis.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"48 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145103798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Systematic benchmarking of computational methods to identify spatially variable genes","authors":"Zhijian Li, Zain M.Patel, Dongyuan Song, Sai Nirmayi Yasa, Robrecht Cannoodt, Guanao Yan, Jingyi Jessica Li, Luca Pinello","doi":"10.1186/s13059-025-03731-2","DOIUrl":"https://doi.org/10.1186/s13059-025-03731-2","url":null,"abstract":"Spatially resolved transcriptomics offers unprecedented insight by enabling the profiling of gene expression within the intact spatial context of cells, effectively adding a new and essential dimension to data interpretation. To efficiently detect spatial structure of interest, an essential step in analyzing such data involves identifying spatially variable genes (SVGs). Despite researchers having developed several computational methods to accomplish this task, the lack of a comprehensive benchmark evaluating their performance remains a considerable gap in the field. Here, we systematically evaluate 14 methods using 96 spatial datasets and 6 metrics. We compare the methods regarding gene ranking and classification based on real spatial variation, statistical calibration, and computation scalability and investigate the impact of identified SVGs on downstream applications such as spatial domain detection. Finally, we explore the applicability of the methods to spatial ATAC-seq data to examine their effectiveness in identifying spatially variable peaks (SVPs). Overall, SPARK-X outperforms other benchmarked methods and Moran’s I achieves a competitive performance, representing a strong baseline for future method development. Moreover, our results reveal that most methods are poorly calibrated, and more specialized algorithms are needed to identify spatially variable peaks. Our benchmarking provides a detailed comparison of SVG detection methods and serves as a reference for both users and method developers.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"18 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145077421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}