Introductory Analysis and Validation of CUT&RUN Sequencing Data.

IF 1.2 4区综合性期刊 Q3 MULTIDISCIPLINARY SCIENCES

Jove-Journal of Visualized Experiments Pub Date : 2024-12-13 DOI:10.3791/67359

Junwoo Lee, Biji Chatterjee, Nakyung Oh, Dhurjhoti Saha, Yue Lu, Blaine Bartholomew, Charles A Ishak

{"title":"Introductory Analysis and Validation of CUT&RUN Sequencing Data.","authors":"Junwoo Lee, Biji Chatterjee, Nakyung Oh, Dhurjhoti Saha, Yue Lu, Blaine Bartholomew, Charles A Ishak","doi":"10.3791/67359","DOIUrl":null,"url":null,"abstract":"<p><p>The CUT&RUN technique facilitates detection of protein-DNA interactions across the genome. Typical applications of CUT&RUN include profiling changes in histone tail modifications or mapping transcription factor chromatin occupancy. Widespread adoption of CUT&RUN is driven, in part, by technical advantages over conventional ChIP-seq that include lower cell input requirements, lower sequencing depth requirements, and increased sensitivity with reduced background signal due to a lack of cross-linking agents that otherwise mask antibody epitopes. Widespread adoption of CUT&RUN has also been achieved through the generous sharing of reagents by the Henikoff lab and the development of commercial kits to accelerate adoption for beginners. As technical adoption of CUT&RUN increases, CUT&RUN sequencing analysis and validation become critical bottlenecks that must be surmounted to enable complete adoption by predominantly wet lab teams. CUT&RUN analysis typically begins with quality control checks on raw sequencing reads to assess sequencing depth, read quality, and potential biases. Reads are then aligned to a reference genome sequence assembly, and several bioinformatics tools are subsequently employed to annotate genomic regions of protein enrichment, confirm data interpretability, and draw biological conclusions. Although multiple in silico analysis pipelines have been developed to support CUT&RUN data analysis, their complex multi-module structure and usage of multiple programming languages render the platforms difficult for bioinformatics beginners who may lack familiarity with multiple programming languages but wish to understand the CUT&RUN analysis procedure and customize their analysis pipelines. Here, we provide a single-language step-by-step CUT&RUN analysis pipeline protocol designed for users with any level of bioinformatics experience. This protocol includes completing critical quality checks to validate that the sequencing data is suitable for biological interpretation. We expect that following the introductory protocol provided in this article combined with downstream peak annotation will allow users to draw biological insights from their own CUT&RUN datasets.</p>","PeriodicalId":48787,"journal":{"name":"Jove-Journal of Visualized Experiments","volume":" 214","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jove-Journal of Visualized Experiments","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.3791/67359","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

The CUT&RUN technique facilitates detection of protein-DNA interactions across the genome. Typical applications of CUT&RUN include profiling changes in histone tail modifications or mapping transcription factor chromatin occupancy. Widespread adoption of CUT&RUN is driven, in part, by technical advantages over conventional ChIP-seq that include lower cell input requirements, lower sequencing depth requirements, and increased sensitivity with reduced background signal due to a lack of cross-linking agents that otherwise mask antibody epitopes. Widespread adoption of CUT&RUN has also been achieved through the generous sharing of reagents by the Henikoff lab and the development of commercial kits to accelerate adoption for beginners. As technical adoption of CUT&RUN increases, CUT&RUN sequencing analysis and validation become critical bottlenecks that must be surmounted to enable complete adoption by predominantly wet lab teams. CUT&RUN analysis typically begins with quality control checks on raw sequencing reads to assess sequencing depth, read quality, and potential biases. Reads are then aligned to a reference genome sequence assembly, and several bioinformatics tools are subsequently employed to annotate genomic regions of protein enrichment, confirm data interpretability, and draw biological conclusions. Although multiple in silico analysis pipelines have been developed to support CUT&RUN data analysis, their complex multi-module structure and usage of multiple programming languages render the platforms difficult for bioinformatics beginners who may lack familiarity with multiple programming languages but wish to understand the CUT&RUN analysis procedure and customize their analysis pipelines. Here, we provide a single-language step-by-step CUT&RUN analysis pipeline protocol designed for users with any level of bioinformatics experience. This protocol includes completing critical quality checks to validate that the sequencing data is suitable for biological interpretation. We expect that following the introductory protocol provided in this article combined with downstream peak annotation will allow users to draw biological insights from their own CUT&RUN datasets.

查看原文本刊更多论文

CUT&RUN测序数据的分析与验证。

CUT&RUN技术促进了整个基因组中蛋白质- dna相互作用的检测。CUT&RUN的典型应用包括分析组蛋白尾部修饰的变化或定位转录因子染色质占用。与传统ChIP-seq相比，CUT&RUN的广泛采用在一定程度上是由于其技术优势，包括较低的细胞输入要求、较低的测序深度要求，以及由于缺乏掩盖抗体表位的交联剂，在降低背景信号的情况下提高灵敏度。通过Henikoff实验室慷慨分享试剂和商业试剂盒的开发，以加速初学者的采用，CUT&RUN也得到了广泛采用。随着CUT&RUN技术采用的增加，CUT&RUN测序分析和验证成为关键的瓶颈，必须被克服，才能使主要的湿实验室团队完全采用。CUT&RUN分析通常从对原始测序读取的质量控制检查开始，以评估测序深度、读取质量和潜在偏差。然后将Reads与参考基因组序列组合对齐，随后使用几种生物信息学工具来注释蛋白质富集的基因组区域，确认数据的可解释性，并得出生物学结论。尽管已经开发了多个芯片分析管道来支持CUT&RUN数据分析，但其复杂的多模块结构和多种编程语言的使用使生物信息学初学者难以理解多种编程语言，但希望了解CUT&RUN分析过程并自定义其分析管道。在这里，我们提供了一个单语言一步一步的CUT&RUN分析管道协议，专为具有任何生物信息学经验的用户设计。该方案包括完成关键的质量检查，以验证测序数据适合生物学解释。我们期望，遵循本文提供的介绍性协议，结合下游峰值注释，将允许用户从他们自己的CUT&RUN数据集中获得生物学见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Jove-Journal of Visualized Experiments MULTIDISCIPLINARY SCIENCES-

CiteScore

2.10

自引率

0.00%

发文量

992

期刊介绍： JoVE, the Journal of Visualized Experiments, is the world''s first peer reviewed scientific video journal. Established in 2006, JoVE is devoted to publishing scientific research in a visual format to help researchers overcome two of the biggest challenges facing the scientific research community today; poor reproducibility and the time and labor intensive nature of learning new experimental techniques.