Genome research最新文献

筛选
英文 中文
MCHelper automatically curates transposable element libraries across eukaryotic species MCHelper自动管理跨真核生物物种的转座因子库
IF 7 2区 生物学
Genome research Pub Date : 2024-12-09 DOI: 10.1101/gr.278821.123
Simon Orozco-Arias, Pío Sierra, Richard Durbin, Josefa González
{"title":"MCHelper automatically curates transposable element libraries across eukaryotic species","authors":"Simon Orozco-Arias, Pío Sierra, Richard Durbin, Josefa González","doi":"10.1101/gr.278821.123","DOIUrl":"https://doi.org/10.1101/gr.278821.123","url":null,"abstract":"The number of species with high-quality genome sequences continues to increase, in part due to the scaling up of multiple large-scale biodiversity sequencing projects. While the need to annotate genic sequences in these genomes is widely acknowledged, the parallel need to annotate transposable element (TE) sequences that have been shown to alter genome architecture, rewire gene regulatory networks, and contribute to the evolution of host traits is becoming ever more evident. However, accurate genome-wide annotation of TE sequences is still technically challenging. Several de novo TE identification tools are now available, but manual curation of the libraries produced by these tools is needed to generate high-quality genome annotations. Manual curation is time-consuming, and thus impractical for large-scale genomic studies, and lacks reproducibility. In this work, we present the Manual Curator Helper tool MCHelper, which automates the TE library curation process. By leveraging MCHelper's fully automated mode with the outputs from three de novo TE identification tools, RepeatModeler2, EDTA, and REPET, in the fruit fly, rice, hooded crow, zebrafish, maize, and human, we show a substantial improvement in the quality of the TE libraries and genome annotations. MCHelper libraries are less redundant, with up to 65% reduction in the number of consensus sequences, have up to 11.4% fewer false positive sequences, and up to ∼48% fewer “unclassified/unknown” TE consensus sequences. Genome-wide TE annotations are also improved, including larger unfragmented insertions. Moreover, MCHelper is an easy-to-install and easy-to-use tool.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"20 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolving the chromatin impact of mosaic variants with targeted Fiber-seq 利用靶向纤维序列分析镶嵌变异对染色质的影响
IF 7 2区 生物学
Genome research Pub Date : 2024-12-09 DOI: 10.1101/gr.279747.124
Stephanie C. Bohaczuk, Zachary J. Amador, Chang Li, Benjamin J. Mallory, Elliott G. Swanson, Jane Ranchalis, Mitchell R. Vollger, Katherine M. Munson, Tom Walsh, Morgan O. Hamm, Yizi Mao, Andre Lieber, Andrew B. Stergachis
{"title":"Resolving the chromatin impact of mosaic variants with targeted Fiber-seq","authors":"Stephanie C. Bohaczuk, Zachary J. Amador, Chang Li, Benjamin J. Mallory, Elliott G. Swanson, Jane Ranchalis, Mitchell R. Vollger, Katherine M. Munson, Tom Walsh, Morgan O. Hamm, Yizi Mao, Andre Lieber, Andrew B. Stergachis","doi":"10.1101/gr.279747.124","DOIUrl":"https://doi.org/10.1101/gr.279747.124","url":null,"abstract":"Accurately quantifying the functional consequences of noncoding mosaic variants requires the pairing of DNA sequences with both accessible and closed chromatin architectures along individual DNA molecules—a pairing that cannot be achieved using traditional fragmentation-based chromatin assays. We demonstrate that targeted single-molecule chromatin fiber sequencing (Fiber-seq) achieves this, permitting single-molecule, long-read genomic, and epigenomic profiling across targeted &gt;100 kb loci with ∼10-fold enrichment over untargeted sequencing. Targeted Fiber-seq reveals that pathogenic expansions of the <em>DMPK</em> CTG repeat that underlie Myotonic Dystrophy 1 are characterized by somatic instability and disruption of multiple nearby regulatory elements, both of which are repeat length-dependent. Furthermore, we reveal that therapeutic adenine base editing of the segmentally duplicated γ-globin (<em>HBG1</em>/<em>HBG2</em>) promoters in primary human hematopoietic cells induced toward an erythroblast lineage increases the accessibility of the <em>HBG1</em> promoter as well as neighboring regulatory elements. Overall, we find that these non–protein coding mosaic variants can have complex impacts on chromatin architectures, including extending beyond the regulatory element harboring the variant.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"1 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rearrangements of viral and human genomes at human papillomavirus integration events and their allele-specific impacts on cancer genome regulation 人乳头瘤病毒整合事件中病毒和人类基因组的重排及其等位基因特异性对癌症基因组调控的影响
IF 7 2区 生物学
Genome research Pub Date : 2024-12-05 DOI: 10.1101/gr.279041.124
Vanessa L. Porter, Michelle Ng, Kieran O'Neill, Signe MacLennan, Richard D. Corbett, Luka Culibrk, Zeid Hamadeh, Marissa Iden, Rachel Schmidt, Shirng-Wern Tsaih, Carolyn Nakisige, Martin Origa, Jackson Orem, Glenn Chang, Jeremy Fan, Ka Ming Nip, Vahid Akbari, Simon K. Chan, James Hopkins, Richard A. Moore, Eric Chuah, Karen L. Mungall, Andrew J. Mungall, Inanc Birol, Steven J.M. Jones, Janet S. Rader, Marco A. Marra
{"title":"Rearrangements of viral and human genomes at human papillomavirus integration events and their allele-specific impacts on cancer genome regulation","authors":"Vanessa L. Porter, Michelle Ng, Kieran O'Neill, Signe MacLennan, Richard D. Corbett, Luka Culibrk, Zeid Hamadeh, Marissa Iden, Rachel Schmidt, Shirng-Wern Tsaih, Carolyn Nakisige, Martin Origa, Jackson Orem, Glenn Chang, Jeremy Fan, Ka Ming Nip, Vahid Akbari, Simon K. Chan, James Hopkins, Richard A. Moore, Eric Chuah, Karen L. Mungall, Andrew J. Mungall, Inanc Birol, Steven J.M. Jones, Janet S. Rader, Marco A. Marra","doi":"10.1101/gr.279041.124","DOIUrl":"https://doi.org/10.1101/gr.279041.124","url":null,"abstract":"Human papillomavirus (HPV) integration has been implicated in transforming HPV infection into cancer. To resolve genome dysregulation associated with HPV integration, we performed Oxford Nanopore long-read sequencing on 72 cervical cancer genomes from an Ugandan dataset that was previously characterized using short-read sequencing. We found recurrent structural rearrangement patterns at HPV integration events, which we categorized as: del(etion)-like, dup(lication)-like, translocation, multibreakpoint, or repeat region integrations. Integrations involving amplified HPV-human concatemers, particularly multibreakpoint events, frequently harbored heterogeneous forms and copy numbers of the viral genome. Transcriptionally active integrants were characterized by unmethylated regions in both the viral and human genomes downstream from the viral transcription start site, resulting in HPV-human fusion transcripts. In contrast, integrants without evidence of expression lacked consistent methylation patterns. Furthermore, whereas transcriptional dysregulation was limited to genes within 200 kilobases of an HPV integrant, dysregulation of the human epigenome in the form of allelic differentially methylated regions affected megabase expanses of the genome, irrespective of the integrant's transcriptional status. By elucidating the structural, epigenetic, and allele-specific impacts of HPV integration, we provide insight into the role of integrated HPV in cervical cancer.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"68 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142783300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An integrative TAD catalog in lymphoblastoid cell lines discloses the functional impact of deletions and insertions in human genomes 淋巴母细胞细胞系的整合TAD目录揭示了人类基因组中缺失和插入的功能影响
IF 7 2区 生物学
Genome research Pub Date : 2024-12-05 DOI: 10.1101/gr.279419.124
Chong Li, Marc Jan Bonder, Sabriya Syed, Matthew Jensen, Human Genome Structural Variation Consortium (HGSVC), HGSVC Functional Analysis Working Group, Mark B. Gerstein, Michael C. Zody, Mark J.P. Chaisson, Michael E. Talkowski, Tobias Marschall, Jan O. Korbel, Evan E. Eichler, Charles Lee, Xinghua Shi
{"title":"An integrative TAD catalog in lymphoblastoid cell lines discloses the functional impact of deletions and insertions in human genomes","authors":"Chong Li, Marc Jan Bonder, Sabriya Syed, Matthew Jensen, Human Genome Structural Variation Consortium (HGSVC), HGSVC Functional Analysis Working Group, Mark B. Gerstein, Michael C. Zody, Mark J.P. Chaisson, Michael E. Talkowski, Tobias Marschall, Jan O. Korbel, Evan E. Eichler, Charles Lee, Xinghua Shi","doi":"10.1101/gr.279419.124","DOIUrl":"https://doi.org/10.1101/gr.279419.124","url":null,"abstract":"The human genome is packaged within a three-dimensional (3D) nucleus and organized into structural units known as compartments, topologically associating domains (TADs), and loops. TAD boundaries, separating adjacent TADs, have been found to be well conserved across mammalian species and more evolutionarily constrained than TADs themselves. Recent studies show that structural variants (SVs) can modify 3D genomes through the disruption of TADs, which play an essential role in insulating genes from outside regulatory elements’ aberrant regulation. However, how SV affects the 3D genome structure and their association among different aspects of gene regulation and candidate <em>cis</em>-regulatory elements (cCREs) have rarely been studied systematically. Here, we assess the impact of SVs intersecting with TAD boundaries by developing an integrative Hi-C analysis pipeline, which enables the generation of an in-depth catalog of TADs and TAD boundaries in human lymphoblastoid cell lines (LCLs) to fill the gap of limited resources. Our catalog contains 18,865 TADs, including 4596 sub-TADs, with 185 SVs (TAD–SVs) that alter chromatin architecture. By leveraging the ENCODE registry of cCREs in humans, we determine that 34 of 185 TAD–SVs intersect with cCREs and observe significant enrichment of TAD–SVs within cCREs. This study provides a database of TADs and TAD–SVs in the human genome that will facilitate future investigations of the impact of SVs on chromatin structure and gene regulation in health and disease.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"199 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142783298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hydra has mammal-like mutation rates facilitating fast adaptation despite its nonaging phenotype 九头蛇具有类似哺乳动物的突变率,促进快速适应,尽管它的表型不老化
IF 7 2区 生物学
Genome research Pub Date : 2024-12-04 DOI: 10.1101/gr.279025.124
Arne Sahm, Konstantin Riege, Marco Groth, Martin Bens, Johann Kraus, Martin Fischer, Hans Kestler, Christoph Englert, Ralf Schaible, Matthias Platzer, Steve Hoffmann
{"title":"Hydra has mammal-like mutation rates facilitating fast adaptation despite its nonaging phenotype","authors":"Arne Sahm, Konstantin Riege, Marco Groth, Martin Bens, Johann Kraus, Martin Fischer, Hans Kestler, Christoph Englert, Ralf Schaible, Matthias Platzer, Steve Hoffmann","doi":"10.1101/gr.279025.124","DOIUrl":"https://doi.org/10.1101/gr.279025.124","url":null,"abstract":"Growing evidence suggests that somatic mutations may be a major cause of the aging process. However, it remains to be tested whether the predictions of the theory also apply to species with longer life spans than humans. <em>Hydra</em> is a genus of freshwater polyps with remarkable regeneration abilities and a potentially unlimited life span under laboratory conditions. By genome sequencing of single cells and whole animals, we found that the mutation rates in <em>Hydra</em>’s stem cells are even slightly higher than in humans or mice. A potential explanation for this deviation from the prediction of the theory may lie in the adaptability offered by a higher mutation rate, as we were able to show that the genome of the widely studied <em>Hydra magnipapillata s</em>train 105 has undergone a process of strong positive selection since the strain's cultivation 50 years ago. This most likely represents a rapid adaptation to the drastically altered environmental conditions associated with the transition from the wild to laboratory conditions. Processes under positive selection in captive animals include pathways associated with <em>Hydra</em>’s simple nervous system, its nucleic acid metabolic process, cell migration, and hydrolase activity.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"27 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142777005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization of DNA methylation reader proteins of Arabidopsis thaliana 拟南芥DNA甲基化解读蛋白的研究
IF 7 2区 生物学
Genome research Pub Date : 2024-12-04 DOI: 10.1101/gr.279379.124
Jonathan Cahn, James P.B. Lloyd, Ino D. Karemaker, Pascal W.T.C. Jansen, Jahnvi Pflueger, Owen Duncan, Jakob Petereit, Ozren Bogdanovic, A. Harvey Millar, Michiel Vermeulen, Ryan Lister
{"title":"Characterization of DNA methylation reader proteins of Arabidopsis thaliana","authors":"Jonathan Cahn, James P.B. Lloyd, Ino D. Karemaker, Pascal W.T.C. Jansen, Jahnvi Pflueger, Owen Duncan, Jakob Petereit, Ozren Bogdanovic, A. Harvey Millar, Michiel Vermeulen, Ryan Lister","doi":"10.1101/gr.279379.124","DOIUrl":"https://doi.org/10.1101/gr.279379.124","url":null,"abstract":"In plants, cytosine DNA methylation (mC) is largely associated with transcriptional repression of transposable elements, but it can also be found in the body of expressed genes, referred to as gene body methylation (gbM). gbM is correlated with ubiquitously expressed genes; however, its function, or absence thereof, is highly debated. The different outputs that mC can have raise questions as to how it is interpreted—or read—differently in these sequence and genomic contexts. To screen for potential mC-binding proteins, we performed an unbiased DNA affinity pull-down assay combined with quantitative mass spectrometry using methylated DNA probes for each DNA sequence context. All mC readers known to date preferentially bind to the methylated probes, along with a range of new mC-binding protein candidates. Functional characterization of these mC readers, focused on the MBD and SUVH families, was undertaken by ChIP-seq mapping of genome-wide binding sites, their protein interactors, and the impact of high-order mutations on transcriptomic and epigenomic profiles. Together, these results highlight specific context preferences for these proteins, and in particular the ability of MBD2 to bind predominantly to gbM. This comprehensive analysis of <em>Arabidopsis</em> mC readers emphasizes the complexity and interconnectivity between DNA methylation and chromatin remodeling processes in plants.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"28 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142776758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structure-optimized sgRNA selection with PlatinumCRISPr for efficient Cas9 generation of knockouts 利用PlatinumCRISPr进行结构优化的sgRNA选择,实现Cas9基因敲除的高效产生
IF 7 2区 生物学
Genome research Pub Date : 2024-12-03 DOI: 10.1101/gr.279479.124
Irmgard U. Haussmann, Thomas C. Dix, David W.J. McQuarrie, Veronica Dezi, Abdullah I. Hans, Roland Arnold, Matthias Soller
{"title":"Structure-optimized sgRNA selection with PlatinumCRISPr for efficient Cas9 generation of knockouts","authors":"Irmgard U. Haussmann, Thomas C. Dix, David W.J. McQuarrie, Veronica Dezi, Abdullah I. Hans, Roland Arnold, Matthias Soller","doi":"10.1101/gr.279479.124","DOIUrl":"https://doi.org/10.1101/gr.279479.124","url":null,"abstract":"A single guide RNA (sgRNA) directs Cas9 nuclease for gene-specific scission of double-stranded DNA. High Cas9 activity is essential for efficient gene editing to generate gene deletions and gene replacements by homologous recombination. However, cleavage efficiency is below 50% for more than half of randomly selected sgRNA sequences in human cell culture screens or model organisms. We used in vitro assays to determine intrinsic molecular parameters for maximal sgRNA activity including correct folding of sgRNAs and Cas9 structural information. From the comparison of over 10 data sets, we find major constraints in sgRNA design originating from defective secondary structure of the sgRNA, sequence context of the seed region, GC context, and detrimental motifs, but we also find considerable variation among different prediction tools when applied to different data sets. To aid selection of efficient sgRNAs, we developed web-based PlatinumCRISPr, an sgRNA design tool to evaluate base-pairing and sequence composition parameters for optimal design of highly efficient sgRNAs for Cas9 genome editing. We applied this tool to select sgRNAs to efficiently generate gene deletions in <em>Drosophila Ythdc1</em> and <em>Ythdf</em>, that bind to <em>N</em><sup>6</sup> methylated adenosines (m<sup>6</sup>A) in mRNA. However, we discovered that generating small deletions with sgRNAs and Cas9 leads to ectopic reinsertion of the deleted DNA fragment elsewhere in the genome. These insertions can be removed by standard genetic recombination and chromosome exchange. These new insights into sgRNA design and the mechanisms of CRISPR–Cas9 genome editing advance the efficient use of this technique for safer applications in humans.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"2 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142763467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The rate and spectrum of new mutations in mice inferred by long-read sequencing 通过长读测序推断小鼠新突变的速率和谱
IF 7 2区 生物学
Genome research Pub Date : 2024-12-02 DOI: 10.1101/gr.279982.124
Eugenio López-Cortegano, Jobran Chebib, Anika Jonas, Anastasia Vock, Sven Künzel, Peter D. Keightley, Diethard Tautz
{"title":"The rate and spectrum of new mutations in mice inferred by long-read sequencing","authors":"Eugenio López-Cortegano, Jobran Chebib, Anika Jonas, Anastasia Vock, Sven Künzel, Peter D. Keightley, Diethard Tautz","doi":"10.1101/gr.279982.124","DOIUrl":"https://doi.org/10.1101/gr.279982.124","url":null,"abstract":"All forms of genetic variation originate from new mutations, making it crucial to understand their rates and mechanisms. Here, we use long-read PacBio sequencing to investigate de novo mutations that accumulated in 12 inbred mouse lines derived from three commonly used inbred strains (C3H, C57BL/6, and FVB) maintained for 8-15 generations in a mutation accumulation (MA) experiment. We built chromosome-level genome assemblies based on the MA line founders' genomes, and then employed a combination of read and assembly-based methods to call the complete spectrum of new mutations. On average, there are ~45 mutations per haploid genome per generation, about half of which (54%) are insertions and deletions shorter than 50 bp (indels). The remainder are single nucleotide mutations (SNMs, 44%) and large structural mutations (SMs, 2%). We found that the degree of DNA repetitiveness is positively correlated with SNM and indel rates, and that a substantial fraction of SMs can be explained by homology-dependent mechanisms associated with repeat sequences. Most (90%) indels can be attributed to microsatellite contractions and expansions, and there is a marked bias towards 4 bp indels. Among the different types of SMs, tandem repeat mutations have the highest mutation rate, followed by insertions of transposable elements (TEs). We uncover a rich landscape of active TEs, and notable differences in their spectrum among MA lines and strains, and a high rate of gene retroposition. Our study offers novel insights into mammalian genome evolution, and highlights the importance of repetitive elements in shaping genomic diversity.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"45 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142760310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring disease progressive stages in single-cell transcriptomics using a weakly-supervised deep learning approach 使用弱监督深度学习方法推断单细胞转录组学中的疾病进展阶段
IF 7 2区 生物学
Genome research Pub Date : 2024-12-02 DOI: 10.1101/gr.278812.123
Fabien Wehbe, Levi Adams, Jordan Babadoudou, Samantha Yuen, Yoon-Seong Kim, Yoshiaki Tanaka
{"title":"Inferring disease progressive stages in single-cell transcriptomics using a weakly-supervised deep learning approach","authors":"Fabien Wehbe, Levi Adams, Jordan Babadoudou, Samantha Yuen, Yoon-Seong Kim, Yoshiaki Tanaka","doi":"10.1101/gr.278812.123","DOIUrl":"https://doi.org/10.1101/gr.278812.123","url":null,"abstract":"Application of single-cell/nucleus genomic sequencing to patient-derived tissues offers potential solutions to delineate disease mechanisms in human. However, individual cells in patient-derived tissues are in different pathological stages, and hence such cellular variability impedes subsequent differential gene expression analyses. To overcome such heterogeneity issue, we present a novel deep learning approach, scIDST, that infers disease progressive levels of individual cells with weak supervision framework. The inferred disease progressive cells displayed significant differential expression of disease-relevant genes, which could not be detected by comparative analysis between patients and healthy donors. In addition, we demonstrated that pretrained models by scIDST are applicable to multiple independent data resources, and advantageous to infer cells related to certain disease risks and comorbidities. Taken together, scIDST offers a new strategy of single-cell sequencing analysis to identify bona fide disease-associated molecular features.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"32 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142760320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A low-abundance class of Dicer-dependent siRNAs produced from a variety of features in C. elegans 一种低丰度的dicer依赖性sirna,由秀丽隐杆线虫的多种特征产生
IF 7 2区 生物学
Genome research Pub Date : 2024-12-02 DOI: 10.1101/gr.279083.124
Thiago L. Knittel, Brooke E. Montgomery, Alex J. Tate, Ennis W. Deihl, Anastasia S. Nawrocki, Frederic J. Hoerndli, Taiowa A. Montgomery
{"title":"A low-abundance class of Dicer-dependent siRNAs produced from a variety of features in C. elegans","authors":"Thiago L. Knittel, Brooke E. Montgomery, Alex J. Tate, Ennis W. Deihl, Anastasia S. Nawrocki, Frederic J. Hoerndli, Taiowa A. Montgomery","doi":"10.1101/gr.279083.124","DOIUrl":"https://doi.org/10.1101/gr.279083.124","url":null,"abstract":"Canonical small interfering RNAs (siRNAs) are processed from double-stranded RNA (dsRNA) by Dicer and associate with Argonautes to direct RNA silencing. In <em>Caenorhabditis elegans</em>, 22G-RNAs and 26G-RNAs are often referred to as siRNAs but display distinct characteristics. For example, 22G-RNAs do not originate from dsRNA and do not depend on Dicer, whereas 26G-RNAs require Dicer but derive from an atypical RNA duplex and are produced exclusively antisense to their messenger RNA (mRNA) templates. To identify canonical siRNAs in <em>C. elegans</em>, we first characterized the siRNAs produced via the exogenous RNA interference (RNAi) pathway. During RNAi, dsRNA is processed into ∼23 nt duplexes with ∼2 nt, 3′-overhangs, ultimately yielding siRNAs devoid of 5′G-containing sequences that bind with high affinity to the Argonaute RDE-1, but also to the microRNA (miRNA) pathway Argonaute, ALG-1. Using these characteristics, we searched for their endogenous counterparts and identified thousands of endogenous loci representing dozens of unique elements that give rise to mostly low to moderate levels of siRNAs, called 23H-RNAs. These loci include repetitive elements, putative coding genes, pseudogenes, noncoding RNAs, and unannotated features, many of which adopt hairpin (hp) structures reminiscent of the hpRNA/RNAi pathway in flies and mice. RDE-1 competes with other Argonautes for binding to 23H-RNAs. When RDE-1 is depleted, these siRNAs are enriched in ALG-1 and ALG-2 complexes. Our results expand the known repertoire of <em>C. elegans</em> small RNAs and their Argonaute interactors, and demonstrate that key features of the endogenous siRNA pathway are relatively unchanged in animals.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"45 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142760361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信