Genome BiologyPub Date : 2024-10-14DOI: 10.1186/s13059-024-03416-2
Yunqing Liu, Ningshan Li, Ji Qi, Gang Xu, Jiayi Zhao, Nating Wang, Xiayuan Huang, Wenhao Jiang, Huanhuan Wei, Aurélien Justet, Taylor S. Adams, Robert Homer, Amei Amei, Ivan O. Rosas, Naftali Kaminski, Zuoheng Wang, Xiting Yan
{"title":"SDePER: a hybrid machine learning and regression method for cell-type deconvolution of spatial barcoding-based transcriptomic data","authors":"Yunqing Liu, Ningshan Li, Ji Qi, Gang Xu, Jiayi Zhao, Nating Wang, Xiayuan Huang, Wenhao Jiang, Huanhuan Wei, Aurélien Justet, Taylor S. Adams, Robert Homer, Amei Amei, Ivan O. Rosas, Naftali Kaminski, Zuoheng Wang, Xiting Yan","doi":"10.1186/s13059-024-03416-2","DOIUrl":"https://doi.org/10.1186/s13059-024-03416-2","url":null,"abstract":"Spatial barcoding-based transcriptomic (ST) data require deconvolution for cellular-level downstream analysis. Here we present SDePER, a hybrid machine learning and regression method to deconvolve ST data using reference single-cell RNA sequencing (scRNA-seq) data. SDePER tackles platform effects between ST and scRNA-seq data, ensuring a linear relationship between them while addressing sparsity and spatial correlations in cell types across capture spots. SDePER estimates cell-type proportions, enabling enhanced resolution tissue mapping by imputing cell-type compositions and gene expressions at unmeasured locations. Applications to simulated data and four real datasets showed SDePER’s superior accuracy and robustness over existing methods.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"34 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142431654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-14DOI: 10.1186/s13059-024-03414-4
Malick Ndiaye, Silvia Prieto-Baños, Lucy M. Fitzgerald, Ali Yazdizadeh Kharrazi, Sergey Oreshkov, Christophe Dessimoz, Fritz J. Sedlazeck, Natasha Glover, Sina Majidian
{"title":"When less is more: sketching with minimizers in genomics","authors":"Malick Ndiaye, Silvia Prieto-Baños, Lucy M. Fitzgerald, Ali Yazdizadeh Kharrazi, Sergey Oreshkov, Christophe Dessimoz, Fritz J. Sedlazeck, Natasha Glover, Sina Majidian","doi":"10.1186/s13059-024-03414-4","DOIUrl":"https://doi.org/10.1186/s13059-024-03414-4","url":null,"abstract":"The exponential increase in sequencing data calls for conceptual and computational advances to extract useful biological insights. One such advance, minimizers, allows for reducing the quantity of data handled while maintaining some of its key properties. We provide a basic introduction to minimizers, cover recent methodological developments, and review the diverse applications of minimizers to analyze genomic data, including de novo genome assembly, metagenomics, read alignment, read correction, and pangenomes. We also touch on alternative data sketching techniques including universal hitting sets, syncmers, or strobemers. Minimizers and their alternatives have rapidly become indispensable tools for handling vast amounts of data.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"83 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142431656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Drought-responsive dynamics of H3K9ac-marked 3D chromatin interactions are integrated by OsbZIP23-associated super-enhancer-like promoter regions in rice","authors":"Yu Chang, Jiahan Liu, Minrong Guo, Weizhi Ouyang, Jiapei Yan, Lizhong Xiong, Xingwang Li","doi":"10.1186/s13059-024-03408-2","DOIUrl":"https://doi.org/10.1186/s13059-024-03408-2","url":null,"abstract":"In response to drought stress (DS), plants undergo complex processes that entail significant transcriptome reprogramming. However, the intricate relationship between the dynamic alterations in the three-dimensional (3D) genome and the modulation of gene co-expression in drought responses remains a relatively unexplored area. In this study, we reconstruct high-resolution 3D genome maps based on genomic regions marked by H3K9ac, an active histone modification that dynamically responds to soil water variations in rice. We discover a genome-wide disconnection of 3D genome contact upon DS with over 10,000 chromatin loops lost, which are partially recovered in the subsequent re-watering. Loops integrating promoter–promoter interactions (PPI) contribute to gene expression in addition to basal H3K9ac modifications. Moreover, H3K9ac-marked promoter regions with high affinities in mediating PPIs, termed as super-promoter regions (SPRs), integrate spatially clustered PPIs in a super-enhancer-like manner. Interestingly, the knockout mutation of OsbZIP23, a well-defined DS-responsive transcription factor, leads to the disassociation of over 80% DS-specific PPIs and decreased expression of the corresponding genes under DS. As a case study, we show how OsbZIP23 integrates the PPI cluster formation and the co-expression of four dehydrin genes, RAB16A–D, through targeting the RAB16C SPR in a stress signaling-dependent manner. Our high-resolution 3D genome maps unveil the principles and details of dynamic genome folding in response to water supply variations and illustrate OsbZIP23 as an indispensable integrator of the yet unique 3D genome organization that is essential for gene co-expression under DS in rice.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"15 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-10DOI: 10.1186/s13059-024-03405-5
Susanna Sawyer, Pere Gelabert, Benjamin Yakir, Alejandro Llanos-Lizcano, Alessandra Sperduti, Luca Bondioli, Olivia Cheronet, Christine Neugebauer-Maresch, Maria Teschler-Nicola, Mario Novak, Ildikó Pap, Ildikó Szikossy, Tamás Hajdu, Vyacheslav Moiseyev, Andrey Gromov, Gunita Zariņa, Eran Meshorer, Liran Carmel, Ron Pinhasi
{"title":"Improved detection of methylation in ancient DNA","authors":"Susanna Sawyer, Pere Gelabert, Benjamin Yakir, Alejandro Llanos-Lizcano, Alessandra Sperduti, Luca Bondioli, Olivia Cheronet, Christine Neugebauer-Maresch, Maria Teschler-Nicola, Mario Novak, Ildikó Pap, Ildikó Szikossy, Tamás Hajdu, Vyacheslav Moiseyev, Andrey Gromov, Gunita Zariņa, Eran Meshorer, Liran Carmel, Ron Pinhasi","doi":"10.1186/s13059-024-03405-5","DOIUrl":"https://doi.org/10.1186/s13059-024-03405-5","url":null,"abstract":"Reconstructing premortem DNA methylation levels in ancient DNA has led to breakthrough studies such as the prediction of anatomical features of the Denisovan. These studies rely on computationally inferring methylation levels from damage signals in naturally deaminated cytosines, which requires expensive high-coverage genomes. Here, we test two methods for direct methylation measurement developed for modern DNA based on either bisulfite or enzymatic methylation treatments. Bisulfite treatment shows the least reduction in DNA yields as well as the least biases during methylation conversion, demonstrating that this method can be successfully applied to ancient DNA.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"1 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-10DOI: 10.1186/s13059-024-03407-3
Varsha Thoppey Manoharan, Aly Abdelkareem, Gurveer Gill, Samuel Brown, Aaron Gillmor, Courtney Hall, Heewon Seo, Kiran Narta, Sean Grewal, Ngoc Ha Dang, Bo Young Ahn, Kata Osz, Xueqing Lun, Laura Mah, Franz Zemp, Douglas Mahoney, Donna L. Senger, Jennifer A. Chan, A. Sorana Morrissy
{"title":"Spatiotemporal modeling reveals high-resolution invasion states in glioblastoma","authors":"Varsha Thoppey Manoharan, Aly Abdelkareem, Gurveer Gill, Samuel Brown, Aaron Gillmor, Courtney Hall, Heewon Seo, Kiran Narta, Sean Grewal, Ngoc Ha Dang, Bo Young Ahn, Kata Osz, Xueqing Lun, Laura Mah, Franz Zemp, Douglas Mahoney, Donna L. Senger, Jennifer A. Chan, A. Sorana Morrissy","doi":"10.1186/s13059-024-03407-3","DOIUrl":"https://doi.org/10.1186/s13059-024-03407-3","url":null,"abstract":"Diffuse invasion of glioblastoma cells through normal brain tissue is a key contributor to tumor aggressiveness, resistance to conventional therapies, and dismal prognosis in patients. A deeper understanding of how components of the tumor microenvironment (TME) contribute to overall tumor organization and to programs of invasion may reveal opportunities for improved therapeutic strategies. Towards this goal, we apply a novel computational workflow to a spatiotemporally profiled GBM xenograft cohort, leveraging the ability to distinguish human tumor from mouse TME to overcome previous limitations in the analysis of diffuse invasion. Our analytic approach, based on unsupervised deconvolution, performs reference-free discovery of cell types and cell activities within the complete GBM ecosystem. We present a comprehensive catalogue of 15 tumor cell programs set within the spatiotemporal context of 90 mouse brain and TME cell types, cell activities, and anatomic structures. Distinct tumor programs related to invasion align with routes of perivascular, white matter, and parenchymal invasion. Furthermore, sub-modules of genes serving as program network hubs are highly prognostic in GBM patients. The compendium of programs presented here provides a basis for rational targeting of tumor and/or TME components. We anticipate that our approach will facilitate an ecosystem-level understanding of the immediate and long-term consequences of such perturbations, including the identification of compensatory programs that will inform improved combinatorial therapies.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"20 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Transipedia.org: k-mer-based exploration of large RNA sequencing datasets and application to cancer data","authors":"Chloé Bessière, Haoliang Xue, Benoit Guibert, Anthony Boureux, Florence Rufflé, Julien Viot, Rayan Chikhi, Mikaël Salson, Camille Marchet, Thérèse Commes, Daniel Gautheret","doi":"10.1186/s13059-024-03413-5","DOIUrl":"https://doi.org/10.1186/s13059-024-03413-5","url":null,"abstract":"Indexing techniques relying on k-mers have proven effective in searching for RNA sequences across thousands of RNA-seq libraries, but without enabling direct RNA quantification. We show here that arbitrary RNA sequences can be quantified in seconds through their decomposition into k-mers, with a precision akin to that of conventional RNA quantification methods. Using an index of the Cancer Cell Line Encyclopedia (CCLE) collection consisting of 1019 RNA-seq samples, we show that k-mer indexing offers a powerful means to reveal non-reference sequences, and variant RNAs induced by specific gene alterations, for instance in splicing factors.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"207 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-10DOI: 10.1186/s13059-024-03409-1
Mir Henglin, Maryam Ghareghani, William T. Harvey, David Porubsky, Sergey Koren, Evan E. Eichler, Peter Ebert, Tobias Marschall
{"title":"Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing","authors":"Mir Henglin, Maryam Ghareghani, William T. Harvey, David Porubsky, Sergey Koren, Evan E. Eichler, Peter Ebert, Tobias Marschall","doi":"10.1186/s13059-024-03409-1","DOIUrl":"https://doi.org/10.1186/s13059-024-03409-1","url":null,"abstract":"Haplotype information is crucial for biomedical and population genetics research. However, current strategies to produce de novo haplotype-resolved assemblies often require either difficult-to-acquire parental data or an intermediate haplotype-collapsed assembly. Here, we present Graphasing, a workflow which synthesizes the global phase signal of Strand-seq with assembly graph topology to produce chromosome-scale de novo haplotypes for diploid genomes. Graphasing readily integrates with any assembly workflow that both outputs an assembly graph and has a haplotype assembly mode. Graphasing performs comparably to trio phasing in contiguity, phasing accuracy, and assembly quality, outperforms Hi-C in phasing accuracy, and generates human assemblies with over 18 chromosome-spanning haplotypes.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"9 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-10DOI: 10.1186/s13059-024-03415-3
Gerry A. Shipman, Reinnier Padilla, Cynthia Horth, Bo Hu, Eric Bareke, Francisca N. Vitorino, Joanna M. Gongora, Benjamin A. Garcia, Chao Lu, Jacek Majewski
{"title":"Systematic perturbations of SETD2, NSD1, NSD2, NSD3, and ASH1L reveal their distinct contributions to H3K36 methylation","authors":"Gerry A. Shipman, Reinnier Padilla, Cynthia Horth, Bo Hu, Eric Bareke, Francisca N. Vitorino, Joanna M. Gongora, Benjamin A. Garcia, Chao Lu, Jacek Majewski","doi":"10.1186/s13059-024-03415-3","DOIUrl":"https://doi.org/10.1186/s13059-024-03415-3","url":null,"abstract":"Methylation of histone 3 lysine 36 (H3K36me) has emerged as an essential epigenetic component for the faithful regulation of gene expression. Despite its importance in development and disease, how the molecular agents collectively shape the H3K36me landscape is unclear. We use mouse mesenchymal stem cells to perturb the H3K36me methyltransferases (K36MTs) and infer the activities of the five most prominent enzymes: SETD2, NSD1, NSD2, NSD3, and ASH1L. We find that H3K36me2 is the most abundant of the three methylation states and is predominantly deposited at intergenic regions by NSD1, and partly by NSD2. In contrast, H3K36me1/3 are most abundant within exons and are positively correlated with gene expression. We demonstrate that while SETD2 deposits most H3K36me3, it may also deposit H3K36me2 within transcribed genes. Additionally, loss of SETD2 results in an increase of exonic H3K36me1, suggesting other (K36MTs) prime gene bodies with lower methylation states ahead of transcription. While NSD1/2 establish broad intergenic H3K36me2 domains, NSD3 deposits H3K36me2 peaks on active promoters and enhancers. Meanwhile, the activity of ASH1L is restricted to the regulatory elements of developmentally relevant genes, and our analyses implicate PBX2 as a potential recruitment factor. Within genes, SETD2 primarily deposits H3K36me3, while the other K36MTs deposit H3K36me1/2 independently of SETD2 activity. For the deposition of H3K36me1/2, we find a hierarchy of K36MT activities where NSD1 > NSD2 > NSD3 > ASH1L. While NSD1 and NSD2 are responsible for most genome-wide propagation of H3K36me2, the activities of NSD3 and ASH1L are confined to active regulatory elements.\u0000","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"19 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142398275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-08DOI: 10.1186/s13059-024-03398-1
Hao Wang, William Torous, Boying Gong, Elizabeth Purdom
{"title":"Visualizing scRNA-Seq data at population scale with GloScope","authors":"Hao Wang, William Torous, Boying Gong, Elizabeth Purdom","doi":"10.1186/s13059-024-03398-1","DOIUrl":"https://doi.org/10.1186/s13059-024-03398-1","url":null,"abstract":"Increasingly, scRNA-Seq studies explore cell populations across different samples and the effect of sample heterogeneity on organism’s phenotype. However, relatively few bioinformatic methods have been developed which adequately address the variation between samples for such population-level analyses. We propose a framework for representing the entire single-cell profile of a sample, which we call a GloScope representation. We implement GloScope on scRNA-Seq datasets from study designs ranging from 12 to over 300 samples and demonstrate how GloScope allows researchers to perform essential bioinformatic tasks at the sample-level, in particular visualization and quality control assessment.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"55 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142384435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome BiologyPub Date : 2024-10-08DOI: 10.1186/s13059-024-03397-2
Zhi-Can Fu, Bao-Qing Gao, Fang Nan, Xu-Kai Ma, Li Yang
{"title":"DEMINING: A deep learning model embedded framework to distinguish RNA editing from DNA mutations in RNA sequencing data","authors":"Zhi-Can Fu, Bao-Qing Gao, Fang Nan, Xu-Kai Ma, Li Yang","doi":"10.1186/s13059-024-03397-2","DOIUrl":"https://doi.org/10.1186/s13059-024-03397-2","url":null,"abstract":"Precise calling of promiscuous adenosine-to-inosine RNA editing sites from transcriptomic datasets is hindered by DNA mutations and sequencing/mapping errors. Here, we present a stepwise computational framework, called DEMINING, to distinguish RNA editing and DNA mutations directly from RNA sequencing datasets, with an embedded deep learning model named DeepDDR. After transfer learning, DEMINING can also classify RNA editing sites and DNA mutations from non-primate sequencing samples. When applied in samples from acute myeloid leukemia patients, DEMINING uncovers previously underappreciated DNA mutation and RNA editing sites; some associated with the upregulated expression of host genes or the production of neoantigens.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"29 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142384438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}