Genome BiologyPub Date : 2025-06-13DOI: 10.1186/s13059-025-03557-y
James R. M. Black, Thomas P. Jones, Carlos Martínez-Ruiz, Maria Litovchenko, Clare Puttick, Charles Swanton, Nicholas McGranahan
{"title":"Cancer gene identification from RNA variant allelic frequencies using RVdriver","authors":"James R. M. Black, Thomas P. Jones, Carlos Martínez-Ruiz, Maria Litovchenko, Clare Puttick, Charles Swanton, Nicholas McGranahan","doi":"10.1186/s13059-025-03557-y","DOIUrl":"https://doi.org/10.1186/s13059-025-03557-y","url":null,"abstract":"Existing approaches to identifying cancer genes rely overwhelmingly on DNA sequencing data. Here, we introduce RVdriver, a computational tool that leverages paired bulk genomic and transcriptomic data to classify RNA variant allele frequencies (VAFs) of non-synonymous mutations relative to a synonymous mutation background. We analyze 7882 paired exomes and transcriptomes from 31 cancer types and identify novel, as well as known, cancer genes, complementing other DNA-based approaches. Furthermore, RNA VAFs of individual mutations are able to distinguish “driver” from “passenger” mutations within established cancer genes. This approach highlights the value of multi-omic approaches for cancer gene discovery.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"22 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144278663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-13DOI: 10.1186/s13059-025-03642-2
Tao Jiang, Shuqi Cao, Yadong Liu, Zhendong Zhang, Bo Liu, Ruibang Luo, Guohua Wang, Yadong Wang
{"title":"cuteFC: regenotyping structural variants through an accurate and efficient force-calling method","authors":"Tao Jiang, Shuqi Cao, Yadong Liu, Zhendong Zhang, Bo Liu, Ruibang Luo, Guohua Wang, Yadong Wang","doi":"10.1186/s13059-025-03642-2","DOIUrl":"https://doi.org/10.1186/s13059-025-03642-2","url":null,"abstract":"Long-read sequencing technologies have great potential for the comprehensive discovery of structural variations (SVs). However, accurate genotype assignment for SVs remains challenging due to unavoidable sequencing errors, limited coverage, and the complexity of SVs. Herein, we propose cuteFC, which employs self-adaptive clustering along with a multiallele-aware clustering to achieve accurate SV regenotyping through a force-calling approach. cuteFC also applies a Genome Position Scanner algorithm to improve its application efficiency. Benchmarking evaluations demonstrate that cuteFC outperforms state-of-the-art methods with 2–5% higher F1 scores and constructs a higher-quality genomic atlas with minimal computational resources. cuteFC is available at https://github.com/Meltpinkg/cuteFC and https://zenodo.org/records/14671406 .","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"91 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144278659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-12DOI: 10.1186/s13059-025-03640-4
Andrew Kristof, Krithika Karunakaran, Christopher Allen, Paula Mizote, Sophie Briggs, Zixin Jian, Patrick Nash, John Blazeck
{"title":"Engineering novel CRISPRi repressors for highly efficient mammalian gene regulation","authors":"Andrew Kristof, Krithika Karunakaran, Christopher Allen, Paula Mizote, Sophie Briggs, Zixin Jian, Patrick Nash, John Blazeck","doi":"10.1186/s13059-025-03640-4","DOIUrl":"https://doi.org/10.1186/s13059-025-03640-4","url":null,"abstract":"CRISPR interference (CRISPRi), the repurposing of the RNA-guided endonuclease dCas9 as a programmable transcriptional repressor, allows highly specific repression (knockdown) of gene expression. CRISPRi platforms can often have incomplete knockdown, performance variability across cell lines and gene targets, and inconsistencies dependent on the guide RNA sequence employed. Here, we explore the combination of novel repressor domains with strong Krüppel-associated box (KRAB) repressors, screening > 100 bipartite and tripartite fusion proteins for their ability to reduce gene expression as CRISPRi effectors. We show that these novel repressor fusions have reduced dependence on guide RNA sequences, better slow cell growth when used to knock down expression of essential genes, and function in either fusion or scaffold modalities. Furthermore, we isolate and characterize a particularly effective CRISPRi platform, dCas9-ZIM3(KRAB)-MeCP2(t), which shows improved gene repression of endogenous targets both at the transcript and protein level across several cell lines and when deployed in genome-wide screens. We posit that these novel repressor fusions can enhance the reproducibility and utility of CRISPRi in mammalian cells.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"92 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144269172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-11DOI: 10.1186/s13059-025-03603-9
Siyu Li, Yifan Huang, Shengquan Chen
{"title":"MINGLE: a mutual information-based interpretable framework for automatic cell type annotation in single-cell chromatin accessibility data","authors":"Siyu Li, Yifan Huang, Shengquan Chen","doi":"10.1186/s13059-025-03603-9","DOIUrl":"https://doi.org/10.1186/s13059-025-03603-9","url":null,"abstract":"Single-cell chromatin accessibility sequencing (scCAS) has proven invaluable for investigating the intricate landscape of epigenomic heterogeneity. We propose MINGLE, a mutual information-based interpretable framework that leverages cellular similarities and topological structures for accurate cell type annotation of scCAS data. Additionally, we introduce a convex hull-based strategy to effectively identify novel cell types. Extensive experiments demonstrate MINGLE’s superior annotation performance, particularly for rare and novel cell types, delivering valuable biological insights compared to existing methods. Moreover, MINGLE excels in cross-batch, cross-tissue, and cross-species scenarios, showing robustness to data imbalance and size, highlighting its versatility for complex annotation tasks.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"70 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144260248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-11DOI: 10.1186/s13059-025-03641-3
Avi Bograd, Yaara Oppenheimer-Shaanan, Asaf Levy
{"title":"Plasmids, prophages, and defense systems are depleted from plant microbiota genomes","authors":"Avi Bograd, Yaara Oppenheimer-Shaanan, Asaf Levy","doi":"10.1186/s13059-025-03641-3","DOIUrl":"https://doi.org/10.1186/s13059-025-03641-3","url":null,"abstract":"Plant-associated bacteria significantly impact plant growth and health. Understanding how bacterial genomes adapt to plants can provide insights into their growth promotion and virulence functions. Here, we compare 38,912 bacterial genomes and 6073 metagenomes to explore the distribution of mobile genetic elements and defense systems in plant-associated bacteria. We reveal a consistent taxon-independent depletion of prophages, plasmids, and defense systems in plant-associated bacteria, particularly in the phyllosphere, compared to other ecosystems. The mobilome depletion suggests the presence of unique ecological constraints or molecular mechanisms exerted by plants to control the bacterial mobilomes independently of bacterial immunity.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"585 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144260350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-11DOI: 10.1186/s13059-025-03635-1
Saswat K. Mohanty, Francesca Chiaromonte, Kateryna D. Makova
{"title":"Evolutionary dynamics of predicted G-quadruplexes in human and other great apes","authors":"Saswat K. Mohanty, Francesca Chiaromonte, Kateryna D. Makova","doi":"10.1186/s13059-025-03635-1","DOIUrl":"https://doi.org/10.1186/s13059-025-03635-1","url":null,"abstract":"G-quadruplexes (G4s) are non-canonical DNA structures that can form at approximately 1% of the human genome. They facilitate genomic instability by increasing point mutations and structural variation. Numerous G4s participate in telomere maintenance and regulating transcription and replication, and evolve under purifying selection. Despite these important functions, G4s have remained under-studied in human and ape genomes due to incomplete assemblies. Here, we conduct a comprehensive analysis of predicted G4s (pG4s) in the recently released, telomere-to-telomere (T2T) genomes of human, bonobo, chimpanzee, gorilla, Bornean orangutan, and Sumatran orangutan. We annotate 41,232–174,442 new pG4s in these T2T compared to previous ape genome assemblies (5%–21% increase). Analyzing inter-species whole-genome alignments, we identify pG4s shared across apes (approximately one-third of all pG4s) and thousands of species-specific pG4s. pG4s accumulate and diverge at rates consistent with divergence times between species, following molecular clock. pG4s shared across apes are enriched and hypomethylated at regulatory regions—enhancers, promoters, UTRs, and origins of replication—suggesting their conserved formation and functions. Species-specific pG4s (constituting 11–27% of all pG4s) are located in regulatory regions, potentially contributing to adaptations, and in repeats, likely driving genome expansions. Our findings illuminate the evolutionary dynamics of G4s, conservation of their role in gene regulation, and their contributions to ape genome evolution. Our study highlights the utility of high-resolution T2T genomes in revealing elusive yet likely functionally relevant genomic features previously hidden by incomplete assemblies.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"11 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144260592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-10DOI: 10.1186/s13059-025-03614-6
Xin Wei, Wenjing Ma, Zhijin Wu, Hao Wu
{"title":"TORC: Target-Oriented Reference Construction for supervised cell-type identification in scRNA-seq","authors":"Xin Wei, Wenjing Ma, Zhijin Wu, Hao Wu","doi":"10.1186/s13059-025-03614-6","DOIUrl":"https://doi.org/10.1186/s13059-025-03614-6","url":null,"abstract":"Cell-type identification is a crucial step in single cell RNA-seq (scRNA-seq) data analysis, for which supervised methods are preferred due to their accuracy and efficiency. Performance is highly dependent on the quality of the reference data, but there is no method for selecting and constructing reference data. We develop Target-Oriented Reference Construction (TORC), a widely applicable strategy for constructing reference data given a target dataset for scRNA-seq supervised cell-type identification. TORC alleviates the differences in data distribution and cell-type composition between reference and target. Extensive benchmarks on simulated and real data analyses demonstrate consistent improvements in cell-type identification from TORC.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"20 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144252292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-10DOI: 10.1186/s13059-025-03626-2
Francisco M. Martín-Zamora, Joby Cole, Rory D. Donnellan, Kero Guynes, Allan M. Carrillo-Baltodano, Mark J. Dickman, Paul J. Hurd, José M. Martín-Durán
{"title":"A dynamic histone-based chromatin regulatory toolkit underpins genome and developmental evolution in an invertebrate clade","authors":"Francisco M. Martín-Zamora, Joby Cole, Rory D. Donnellan, Kero Guynes, Allan M. Carrillo-Baltodano, Mark J. Dickman, Paul J. Hurd, José M. Martín-Durán","doi":"10.1186/s13059-025-03626-2","DOIUrl":"https://doi.org/10.1186/s13059-025-03626-2","url":null,"abstract":"The dynamic addition and removal of posttranslational modifications on eukaryotic histones define regulatory regions that play a central role in genome and chromatin biology. However, our understanding of these regulatory mechanisms in animals is primarily based on a few model systems, preventing a general understanding of how histone-based regulation directs and promotes phenotypic variation during animal embryogenesis. Here, we apply a comprehensive multi-omics approach to dissect the histone-based regulatory complement in Annelida, one of the largest invertebrate clades. Annelids exhibit a conserved histone repertoire organized in clusters of dynamically regulated, hyperaccessible chromatin. However, unlike other animals with reduced genomes, the worm Dimorphilus gyrociliatus shows a dramatically streamlined histone repertoire, revealing that genome compaction has lineage-specific effects on histone-based regulation. Notably, the annelid Owenia fusiformis has two H2A.X variants that co-occur in other animals, sometimes associate with fast cell divisions, and represent a unique case of widespread parallel evolution of a histone variant in Eukarya. Histone-modifying enzyme complements are largely conserved among annelids. Yet, temporal differences in the expression of a reduced set of histone modifiers correlate with distinct ontogenetic traits and variation in the adult landscapes of histone posttranslational modifications, as revealed by quantitative mass spectrometry in O. fusiformis and Capitella teleta. Our analysis of histone-based epigenetics within a non-model phylum informs the evolution of histone-based regulation, presenting a framework to explore how this fundamental genome regulatory layer generally contributes to developmental and morphological diversification in annelids and animals.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"586 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144252316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"spaMGCN: a graph convolutional network with autoencoder for spatial domain identification using multi-scale adaptation","authors":"Tianjiao Zhang, Hongfei Zhang, Zhongqian Zhao, Saihong Shao, Yucai Jiang, Xiang Zhang, Guohua Wang","doi":"10.1186/s13059-025-03637-z","DOIUrl":"https://doi.org/10.1186/s13059-025-03637-z","url":null,"abstract":"Spatial domain identification is crucial in spatial transcriptomics analysis. Existing methods excel with continuous and clustered distributions but struggle with discrete ones. We present spaMGCN, an innovative approach specifically designed for identifying spatial domains, especially in discrete tissue distributions. By integrating spatial transcriptomics and spatial epigenomic data through an autoencoder and a multi-scale adaptive graph convolutional network, spaMGCN outperforms baseline methods. Our evaluations demonstrate its effectiveness in recognizing discrete T cell zones in mouse spleens and follicular cells in human lymph nodes, as well as effectively distinguishing capsule structures from surrounding tissues.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"17 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144252295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2025-06-10DOI: 10.1186/s13059-025-03633-3
Jingjing He, Yi Yang, Rui Jiang, Yanying Zheng, Xianfa Yang, Xu Jiang, Xin Xue, Zhongzhou Yang, Naihe Jing, Hailong Cao, Zhuojuan Luo, Ke Wei, Peng Xie, Chengqi Lin
{"title":"Integration of single-cell and spatial transcriptomics by SEU-TCA reveals the spatial origin of early cardiac progenitors","authors":"Jingjing He, Yi Yang, Rui Jiang, Yanying Zheng, Xianfa Yang, Xu Jiang, Xin Xue, Zhongzhou Yang, Naihe Jing, Hailong Cao, Zhuojuan Luo, Ke Wei, Peng Xie, Chengqi Lin","doi":"10.1186/s13059-025-03633-3","DOIUrl":"https://doi.org/10.1186/s13059-025-03633-3","url":null,"abstract":"Obtaining single-cell spatial information remains a challenge in spatial transcriptomics. Here we develop SEU-TCA, a method that leverages transfer component analysis to improve single-cell spatial mapping accuracy. Application to multiple single-cell and spatial transcriptomic datasets shows superior performance in spatial deconvolution and cell mapping. Using SEU-TCA, we explore spatial gene expression and regulon activity during mouse gastrulation and identify anterior second heart field progenitors regulated by Irx1. Functional experiments reveal that Irx1 deletion disrupts anterior second heart field development and causes ventricular septal defects, underscoring SEU-TCA’s potential for advancing developmental biology research.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"80 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144252296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}