{"title":"Multiple overlapping binding sites determine transcription factor occupancy","authors":"Shubham Khetan, Brent S. Carroll, Martha L. Bulyk","doi":"10.1038/s41586-025-09472-3","DOIUrl":null,"url":null,"abstract":"Transcription factors (TFs) regulate gene expression by interacting with DNA in a sequence-specific manner. High-throughput in vitro technologies, such as protein-binding microarrays1–6 and HT-SELEX (high-throughput systematic evolution of ligands by exponential enrichment)7,8, have revealed the DNA-binding specificities of hundreds of TFs. However, they have limited ability to reliably identify lower-affinity DNA binding sites, which are increasingly recognized as important for precise spatiotemporal control of gene expression9–19. Here, to address this limitation, we developed protein affinity to DNA by in vitro transcription and RNA sequencing (PADIT-seq), with which we comprehensively assayed the binding preferences of six TFs to all possible ten-base-pair DNA sequences, detecting hundreds of novel, lower-affinity binding sites. The expanded repertoire of lower-affinity binding sites revealed that nucleotides flanking high-affinity DNA binding sites create overlapping lower-affinity sites that together modulate TF genomic occupancy in vivo. We propose a model in which TF binding is not determined by individual binding sites, but rather by the sum of multiple, overlapping binding sites. The overlapping binding model explains how competition between paralogous TFs for shared high-affinity binding sites is determined by flanking nucleotides that create differential numbers of overlapping, lower-affinity binding sites. Critically, the model transforms our understanding of noncoding-variant effects, revealing how single nucleotide changes simultaneously alter multiple overlapping sites to additively influence gene expression and human traits, including diseases. A new method enables comprehensive screening and identification of low-affinity DNA binding sites for transcription factors, and reveals that nucleotides flanking high-affinity binding sites create overlapping low-affinity binding sites that modulate transcription factor binding in vivo.","PeriodicalId":18787,"journal":{"name":"Nature","volume":"646 8086","pages":"1001-1011"},"PeriodicalIF":48.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature","FirstCategoryId":"103","ListUrlMain":"https://www.nature.com/articles/s41586-025-09472-3","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Transcription factors (TFs) regulate gene expression by interacting with DNA in a sequence-specific manner. High-throughput in vitro technologies, such as protein-binding microarrays1–6 and HT-SELEX (high-throughput systematic evolution of ligands by exponential enrichment)7,8, have revealed the DNA-binding specificities of hundreds of TFs. However, they have limited ability to reliably identify lower-affinity DNA binding sites, which are increasingly recognized as important for precise spatiotemporal control of gene expression9–19. Here, to address this limitation, we developed protein affinity to DNA by in vitro transcription and RNA sequencing (PADIT-seq), with which we comprehensively assayed the binding preferences of six TFs to all possible ten-base-pair DNA sequences, detecting hundreds of novel, lower-affinity binding sites. The expanded repertoire of lower-affinity binding sites revealed that nucleotides flanking high-affinity DNA binding sites create overlapping lower-affinity sites that together modulate TF genomic occupancy in vivo. We propose a model in which TF binding is not determined by individual binding sites, but rather by the sum of multiple, overlapping binding sites. The overlapping binding model explains how competition between paralogous TFs for shared high-affinity binding sites is determined by flanking nucleotides that create differential numbers of overlapping, lower-affinity binding sites. Critically, the model transforms our understanding of noncoding-variant effects, revealing how single nucleotide changes simultaneously alter multiple overlapping sites to additively influence gene expression and human traits, including diseases. A new method enables comprehensive screening and identification of low-affinity DNA binding sites for transcription factors, and reveals that nucleotides flanking high-affinity binding sites create overlapping low-affinity binding sites that modulate transcription factor binding in vivo.
期刊介绍:
Nature is a prestigious international journal that publishes peer-reviewed research in various scientific and technological fields. The selection of articles is based on criteria such as originality, importance, interdisciplinary relevance, timeliness, accessibility, elegance, and surprising conclusions. In addition to showcasing significant scientific advances, Nature delivers rapid, authoritative, insightful news, and interpretation of current and upcoming trends impacting science, scientists, and the broader public. The journal serves a dual purpose: firstly, to promptly share noteworthy scientific advances and foster discussions among scientists, and secondly, to ensure the swift dissemination of scientific results globally, emphasizing their significance for knowledge, culture, and daily life.