通过直接RNA测序分析罕见的C-to-U编辑事件。

4区生物学 Q3 Biochemistry, Genetics and Molecular Biology

Methods in enzymology Pub Date : 2025-01-01 Epub Date: 2025-01-21 DOI:10.1016/bs.mie.2024.11.040

Adriano Fonzino, Pietro Luca Mazzacuva, Graziano Pesole, Ernesto Picardi

{"title":"通过直接RNA测序分析罕见的C-to-U编辑事件。","authors":"Adriano Fonzino, Pietro Luca Mazzacuva, Graziano Pesole, Ernesto Picardi","doi":"10.1016/bs.mie.2024.11.040","DOIUrl":null,"url":null,"abstract":"In mammals, RNA editing involves the hydrolytic deamination of adenosine (A) to inosine (I) or of cytosine (C) to uracil (U) by the ADAR and APOBEC families of enzymes, respectively. Direct RNA (dRNA) sequencing by Oxford Nanopore Technology (ONT) allows the detection of Us and, thus, facilitates the unveiling of edited Cs avoiding Reverse Transcription and PCR amplification steps. However, dRNA data are noisy, and very rare events such as C-to-U conversions cannot be easily distinguished from background noise or mutation errors. To overcome this issue, we developed a novel machine-learning strategy based on the Isolation Forest (iForest) algorithm to denoise the signal deriving from dRNA highly-informative ONT data. Here we present a step-by-step protocol illustrating the usage of the C-to-U-Classifier package and how to apply its pretrained iForest models for ameliorating the detection of C-to-U events in mammalian transcriptomes. As an example, we show here the whole pipeline in action on data deriving from wild-type (WT) and APOBEC1 knock-out (KO) macrophagic cell lines. Additionally, the polishing power of our algorithm is proved through a synthetic in-vitro transcribed (IVT) sample in which C-to-U events are not present.","PeriodicalId":18662,"journal":{"name":"Methods in enzymology","volume":"713 ","pages":"221-254"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Profiling rare C-to-U editing events via direct RNA sequencing.\",\"authors\":\"Adriano Fonzino, Pietro Luca Mazzacuva, Graziano Pesole, Ernesto Picardi\",\"doi\":\"10.1016/bs.mie.2024.11.040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In mammals, RNA editing involves the hydrolytic deamination of adenosine (A) to inosine (I) or of cytosine (C) to uracil (U) by the ADAR and APOBEC families of enzymes, respectively. Direct RNA (dRNA) sequencing by Oxford Nanopore Technology (ONT) allows the detection of Us and, thus, facilitates the unveiling of edited Cs avoiding Reverse Transcription and PCR amplification steps. However, dRNA data are noisy, and very rare events such as C-to-U conversions cannot be easily distinguished from background noise or mutation errors. To overcome this issue, we developed a novel machine-learning strategy based on the Isolation Forest (iForest) algorithm to denoise the signal deriving from dRNA highly-informative ONT data. Here we present a step-by-step protocol illustrating the usage of the C-to-U-Classifier package and how to apply its pretrained iForest models for ameliorating the detection of C-to-U events in mammalian transcriptomes. As an example, we show here the whole pipeline in action on data deriving from wild-type (WT) and APOBEC1 knock-out (KO) macrophagic cell lines. Additionally, the polishing power of our algorithm is proved through a synthetic in-vitro transcribed (IVT) sample in which C-to-U events are not present.\",\"PeriodicalId\":18662,\"journal\":{\"name\":\"Methods in enzymology\",\"volume\":\"713 \",\"pages\":\"221-254\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods in enzymology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/bs.mie.2024.11.040\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods in enzymology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/bs.mie.2024.11.040","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/21 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 0

摘要

在哺乳动物中，RNA编辑涉及ADAR和APOBEC酶家族分别将腺苷(A)水解为肌苷(I)或将胞嘧啶(C)水解为尿嘧啶(U)。牛津纳米孔技术（ONT）的直接RNA （dRNA）测序允许检测Us，从而有助于揭示编辑过的Cs，避免了逆转录和PCR扩增步骤。然而，dna数据是有噪声的，非常罕见的事件，如C-to-U转换，很难与背景噪声或突变错误区分开来。为了克服这个问题，我们开发了一种基于隔离森林（ifforest）算法的新型机器学习策略，以对来自dRNA高信息量ONT数据的信号进行降噪。在这里，我们提出了一个循序渐进的方案，说明了C-to-U- classifier包的使用，以及如何应用其预训练的ifforest模型来改进哺乳动物转录组中C-to-U事件的检测。作为一个例子，我们在这里展示了整个管道在野生型（WT）和APOBEC1敲除（KO）巨噬细胞系数据上的作用。此外，通过不存在C-to-U事件的合成体外转录（IVT）样本证明了我们算法的抛光能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Profiling rare C-to-U editing events via direct RNA sequencing.

In mammals, RNA editing involves the hydrolytic deamination of adenosine (A) to inosine (I) or of cytosine (C) to uracil (U) by the ADAR and APOBEC families of enzymes, respectively. Direct RNA (dRNA) sequencing by Oxford Nanopore Technology (ONT) allows the detection of Us and, thus, facilitates the unveiling of edited Cs avoiding Reverse Transcription and PCR amplification steps. However, dRNA data are noisy, and very rare events such as C-to-U conversions cannot be easily distinguished from background noise or mutation errors. To overcome this issue, we developed a novel machine-learning strategy based on the Isolation Forest (iForest) algorithm to denoise the signal deriving from dRNA highly-informative ONT data. Here we present a step-by-step protocol illustrating the usage of the C-to-U-Classifier package and how to apply its pretrained iForest models for ameliorating the detection of C-to-U events in mammalian transcriptomes. As an example, we show here the whole pipeline in action on data deriving from wild-type (WT) and APOBEC1 knock-out (KO) macrophagic cell lines. Additionally, the polishing power of our algorithm is proved through a synthetic in-vitro transcribed (IVT) sample in which C-to-U events are not present.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Methods in enzymology 生物-生化研究方法

CiteScore

2.90

自引率

0.00%

发文量

308

审稿时长

3-6 weeks

期刊介绍： The critically acclaimed laboratory standard for almost 50 years, Methods in Enzymology is one of the most highly respected publications in the field of biochemistry. Each volume is eagerly awaited, frequently consulted, and praised by researchers and reviewers alike. Now with over 500 volumes the series contains much material still relevant today and is truly an essential publication for researchers in all fields of life sciences, including microbiology, biochemistry, cancer research and genetics-just to name a few. Five of the 2013 Nobel Laureates have edited or contributed to volumes of MIE.