Inference of transcriptional regulation from STARR-seq data.

IF 2.2 3区 物理与天体物理 Q2 PHYSICS, FLUIDS & PLASMAS
Amin Safaeesirat, Hoda Taeb, Emirhan Tekoglu, Tunc Morova, Nathan A Lack, Eldon Emberly
{"title":"Inference of transcriptional regulation from STARR-seq data.","authors":"Amin Safaeesirat, Hoda Taeb, Emirhan Tekoglu, Tunc Morova, Nathan A Lack, Eldon Emberly","doi":"10.1103/PhysRevE.111.024402","DOIUrl":null,"url":null,"abstract":"<p><p>One of the primary regulatory processes in cells is transcription, during which RNA polymerase II (Pol-II) transcribes DNA into RNA. The binding of Pol-II to its site is regulated through interactions with transcription factors (TFs) that bind to DNA at enhancer cis-regulatory elements. Measuring the enhancer activity of large libraries of distinct DNA sequences is now possible using massively parallel reporter assays (MPRAs), and computational methods have been developed to identify the dominant statistical patterns of TF binding within these large datasets. Such methods are global in their approach and may overlook important regulatory sites that function only within the local context. Here we introduce a method for inferring functional regulatory sites (their number, location, and width) within an enhancer sequence based on measurements of its transcriptional activity from an MPRA method such as STARR-seq. The model is based on a mean-field thermodynamic description of Pol-II binding that includes interactions with bound TFs. Our method applied to simulated STARR-seq data for a variety of enhancer architectures shows how data quality impacts the inference and also how it can find local regulatory sites that may be missed in a global approach. We also apply the method to recently measured STARR-seq data on androgen receptor (AR) bound sequences, a TF that plays an important role in the regulation of prostate cancer. The method identifies key regulatory sites within these sequences, which are found to overlap with binding sites of known coregulators of AR.</p>","PeriodicalId":48698,"journal":{"name":"Physical Review E","volume":"111 2-1","pages":"024402"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Review E","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1103/PhysRevE.111.024402","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, FLUIDS & PLASMAS","Score":null,"Total":0}
引用次数: 0

Abstract

One of the primary regulatory processes in cells is transcription, during which RNA polymerase II (Pol-II) transcribes DNA into RNA. The binding of Pol-II to its site is regulated through interactions with transcription factors (TFs) that bind to DNA at enhancer cis-regulatory elements. Measuring the enhancer activity of large libraries of distinct DNA sequences is now possible using massively parallel reporter assays (MPRAs), and computational methods have been developed to identify the dominant statistical patterns of TF binding within these large datasets. Such methods are global in their approach and may overlook important regulatory sites that function only within the local context. Here we introduce a method for inferring functional regulatory sites (their number, location, and width) within an enhancer sequence based on measurements of its transcriptional activity from an MPRA method such as STARR-seq. The model is based on a mean-field thermodynamic description of Pol-II binding that includes interactions with bound TFs. Our method applied to simulated STARR-seq data for a variety of enhancer architectures shows how data quality impacts the inference and also how it can find local regulatory sites that may be missed in a global approach. We also apply the method to recently measured STARR-seq data on androgen receptor (AR) bound sequences, a TF that plays an important role in the regulation of prostate cancer. The method identifies key regulatory sites within these sequences, which are found to overlap with binding sites of known coregulators of AR.

从STARR-seq数据推断转录调控。
细胞的主要调控过程之一是转录,在此过程中,RNA聚合酶II (Pol-II)将DNA转录成RNA。Pol-II与其位点的结合是通过与转录因子(tf)的相互作用来调节的,转录因子与DNA的增强顺式调控元件结合。现在可以使用大规模平行报告基因测定(MPRAs)来测量不同DNA序列的大型文库的增强子活性,并且已经开发出计算方法来确定这些大型数据集中TF结合的主要统计模式。这些方法的方法是全球性的,可能会忽略仅在当地环境中起作用的重要调控位点。在这里,我们介绍了一种基于MPRA方法(如STARR-seq)对增强子序列转录活性的测量来推断其功能调控位点(它们的数量、位置和宽度)的方法。该模型基于Pol-II结合的平均场热力学描述,包括与结合的tf的相互作用。我们的方法应用于模拟各种增强子架构的STARR-seq数据,显示了数据质量如何影响推断,以及它如何找到全局方法中可能遗漏的局部调控位点。我们还将该方法应用于最近测量的雄激素受体(AR)结合序列的STARR-seq数据,雄激素受体(AR)结合序列是一种在前列腺癌调节中起重要作用的TF。该方法确定了这些序列中的关键调控位点,发现这些位点与已知AR共调控因子的结合位点重叠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Physical Review E
Physical Review E PHYSICS, FLUIDS & PLASMASPHYSICS, MATHEMAT-PHYSICS, MATHEMATICAL
CiteScore
4.50
自引率
16.70%
发文量
2110
期刊介绍: Physical Review E (PRE), broad and interdisciplinary in scope, focuses on collective phenomena of many-body systems, with statistical physics and nonlinear dynamics as the central themes of the journal. Physical Review E publishes recent developments in biological and soft matter physics including granular materials, colloids, complex fluids, liquid crystals, and polymers. The journal covers fluid dynamics and plasma physics and includes sections on computational and interdisciplinary physics, for example, complex networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信