{"title":"Prediction of target genes and functional types of cis-regulatory modules in the human genome reveals their distinct properties.","authors":"Sisi Yuan, Pengyu Ni, Zhengchang Su","doi":"10.1186/s12915-025-02313-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cis-regulatory modules (CRMs) such as enhancers and silencers play critical roles in virtually all biological processes by enhancing and repressing, respectively, the transcription of their target genes in specific cell types. Although numerous CRMs have been predicted in genomes, identifying their target genes remains a challenge due to low quality of the predicted CRMs and the fact that CRMs often do not regulate their closest genes.</p><p><strong>Results: </strong>We developed a method - correlation and physical proximity (CAPP) by leveraging our recently predicted 1.2 M CRMs in the human genome. CAPP is able to not only predict the CRMs' target genes but also their functional types using only chromatin accessibility (CA) and RNA-seq data in a panel of cell/tissue types plus Hi-C data in a few cell types. Applying CAPP to a panel of only 107 cell/tissue types with CA and RNA-seq data available, we predict target genes for 14.3% of the 1.2 M CRMs, of which 1.4% are predicted as both enhancers and silencers (dual functional CRMs), 98.2% as exclusive enhancers, and 0.4% as exclusive silencers. Dual functional CRMs tend to regulate more distant genes than exclusive enhancers and silencers. Enhancers tend to cooperate with other enhancers, whereas silencers typically act independently. Silencers preferentially regulate genes expressed across many cell/tissue types, while enhancers are prone to regulate genes expressed in fewer cell/tissue types.</p><p><strong>Conclusions: </strong>CAPP represents a significant advancement in predicting target genes and functional types of CRMs, especially dual functional CRMs, and different types of CRMs show distinct properties.</p>","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":"23 1","pages":"211"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261737/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-025-02313-9","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Cis-regulatory modules (CRMs) such as enhancers and silencers play critical roles in virtually all biological processes by enhancing and repressing, respectively, the transcription of their target genes in specific cell types. Although numerous CRMs have been predicted in genomes, identifying their target genes remains a challenge due to low quality of the predicted CRMs and the fact that CRMs often do not regulate their closest genes.
Results: We developed a method - correlation and physical proximity (CAPP) by leveraging our recently predicted 1.2 M CRMs in the human genome. CAPP is able to not only predict the CRMs' target genes but also their functional types using only chromatin accessibility (CA) and RNA-seq data in a panel of cell/tissue types plus Hi-C data in a few cell types. Applying CAPP to a panel of only 107 cell/tissue types with CA and RNA-seq data available, we predict target genes for 14.3% of the 1.2 M CRMs, of which 1.4% are predicted as both enhancers and silencers (dual functional CRMs), 98.2% as exclusive enhancers, and 0.4% as exclusive silencers. Dual functional CRMs tend to regulate more distant genes than exclusive enhancers and silencers. Enhancers tend to cooperate with other enhancers, whereas silencers typically act independently. Silencers preferentially regulate genes expressed across many cell/tissue types, while enhancers are prone to regulate genes expressed in fewer cell/tissue types.
Conclusions: CAPP represents a significant advancement in predicting target genes and functional types of CRMs, especially dual functional CRMs, and different types of CRMs show distinct properties.
背景:顺式调控模块(CRMs),如增强子和沉默子,通过在特定细胞类型中分别增强和抑制其靶基因的转录,在几乎所有的生物过程中发挥关键作用。尽管已经在基因组中预测了许多CRMs,但由于预测的CRMs质量较低,而且CRMs通常不调节其最接近的基因,因此确定其靶基因仍然是一个挑战。结果:通过利用我们最近预测的人类基因组中的1.2 M crm,我们开发了一种方法-相关性和物理接近性(CAPP)。CAPP不仅能够预测crm的靶基因,还能够预测其功能类型,仅使用细胞/组织类型面板中的染色质可及性(CA)和RNA-seq数据以及少数细胞类型中的Hi-C数据。将CAPP应用于仅107种具有CA和RNA-seq数据的细胞/组织类型,我们预测了1.2 M crm中14.3%的靶基因,其中1.4%被预测为增强子和沉默子(双功能crm), 98.2%为独家增强子,0.4%为独家沉默子。双功能的基因调节因子往往比排他性的增强子和沉默子调节更远的基因。增强者倾向于与其他增强者合作,而沉默者通常独立行动。沉默子优先调节在许多细胞/组织类型中表达的基因,而增强子倾向于调节在较少细胞/组织类型中表达的基因。结论:CAPP在预测CRMs靶基因和功能类型,特别是双功能CRMs方面取得了重大进展,不同类型的CRMs表现出不同的特性。
期刊介绍:
BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.