CREdb: A comprehensive database of Cis-Regulatory Elements and their activity in human cells and tissues.

IF 4.2 2区 生物学 Q1 GENETICS & HEREDITY
Chris Hartl, Jiali Zhuang, Aaron Tyler, Bing Zhou, Emily Wong, David Merberg, Brad Farrell, Chris DeBoever, Julie Bryant, Dorothée Diogo
{"title":"CREdb: A comprehensive database of Cis-Regulatory Elements and their activity in human cells and tissues.","authors":"Chris Hartl, Jiali Zhuang, Aaron Tyler, Bing Zhou, Emily Wong, David Merberg, Brad Farrell, Chris DeBoever, Julie Bryant, Dorothée Diogo","doi":"10.1186/s13072-024-00545-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cis-regulatory elements (CREs) play a pivotal role in gene expression regulation, allowing cells to serve diverse functions and respond to external stimuli. Understanding CREs is essential for personalized medicine and disease research, as an increasing number of genetic variants associated with phenotypes and diseases overlap with CREs. However, existing databases often focus on subsets of regulatory elements and present each identified instance of element individually, confounding the effort to obtain a comprehensive view. To address this gap, we have created CREdb, a comprehensive database with over 10 million human regulatory elements across 1,058 cell types and 315 tissues harmonized from different data sources. We curated and aligned the cell types and tissues to standard ontologies for efficient data query.</p><p><strong>Results: </strong>Data from 11 sources were curated and mapped to standard ontological terms. 11,223,434 combined elements are present in the final database, and these were merged into 5,666,240 consensus elements representing the combined ranges of the individual elements informed by their overlap. Each consensus element contains curated metadata including the number of elements supporting it and a hash linking to the source databases. The inferred activity of each consensus element in various cell-type and tissue context is also provided. Examples presented here show the potential utility of CREdb in annotating non-coding genetic variants and informing chromatin accessibility profiling analysis.</p><p><strong>Conclusions: </strong>We developed CREdb, a comprehensive database of CREs, to simplify the analysis of CREs by providing a unified framework for researchers. CREdb compiles consensus ranges for each element by integrating the information from all instances identified across various source databases. This unified database facilitates the functional annotation of non-coding genetic variants and complements chromatin accessibility profiling analysis. CREdb will serve as an important resource in expanding our knowledge of the epigenome and its role in human diseases.</p>","PeriodicalId":49253,"journal":{"name":"Epigenetics & Chromatin","volume":null,"pages":null},"PeriodicalIF":4.2000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11253421/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenetics & Chromatin","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13072-024-00545-7","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Cis-regulatory elements (CREs) play a pivotal role in gene expression regulation, allowing cells to serve diverse functions and respond to external stimuli. Understanding CREs is essential for personalized medicine and disease research, as an increasing number of genetic variants associated with phenotypes and diseases overlap with CREs. However, existing databases often focus on subsets of regulatory elements and present each identified instance of element individually, confounding the effort to obtain a comprehensive view. To address this gap, we have created CREdb, a comprehensive database with over 10 million human regulatory elements across 1,058 cell types and 315 tissues harmonized from different data sources. We curated and aligned the cell types and tissues to standard ontologies for efficient data query.

Results: Data from 11 sources were curated and mapped to standard ontological terms. 11,223,434 combined elements are present in the final database, and these were merged into 5,666,240 consensus elements representing the combined ranges of the individual elements informed by their overlap. Each consensus element contains curated metadata including the number of elements supporting it and a hash linking to the source databases. The inferred activity of each consensus element in various cell-type and tissue context is also provided. Examples presented here show the potential utility of CREdb in annotating non-coding genetic variants and informing chromatin accessibility profiling analysis.

Conclusions: We developed CREdb, a comprehensive database of CREs, to simplify the analysis of CREs by providing a unified framework for researchers. CREdb compiles consensus ranges for each element by integrating the information from all instances identified across various source databases. This unified database facilitates the functional annotation of non-coding genetic variants and complements chromatin accessibility profiling analysis. CREdb will serve as an important resource in expanding our knowledge of the epigenome and its role in human diseases.

CREdb:顺式调控元件及其在人体细胞和组织中的活性的综合数据库。
背景:顺式调控元件(CREs)在基因表达调控中发挥着关键作用,使细胞能够发挥多种功能并对外部刺激做出反应。由于越来越多与表型和疾病相关的基因变异与 CREs 重叠,因此了解 CREs 对个性化医疗和疾病研究至关重要。然而,现有的数据库通常只关注调控元件的子集,并单独呈现每个已识别的元件实例,从而影响了获得全面观点的努力。为了填补这一空白,我们创建了 CREdb,这是一个全面的数据库,包含 1,058 种细胞类型和 315 种组织中的 1,000 多万个人类调控元件,由不同的数据源协调而成。我们根据标准本体对细胞类型和组织进行了整理和对齐,以实现高效的数据查询:我们对来自 11 个数据源的数据进行了整理,并将其映射到标准本体术语。最终数据库中有 11,223,434 个组合元素,这些元素被合并为 5,666,240 个共识元素,代表了各个元素重叠后的组合范围。每个共识元素都包含经过整理的元数据,包括支持该元素的元素数量和链接到源数据库的哈希值。此外,还提供了每个共识元素在不同细胞类型和组织背景下的推断活性。这里介绍的例子显示了 CREdb 在注释非编码基因变异和为染色质可及性剖析分析提供信息方面的潜在用途:我们开发的 CREdb 是一个全面的 CREs 数据库,它为研究人员提供了一个统一的框架,从而简化了对 CREs 的分析。CREdb 通过整合在各种来源数据库中发现的所有实例的信息,为每个元素编制了共识范围。这个统一的数据库有助于对非编码基因变异进行功能注释,并补充染色质可及性分析。CREdb 将成为拓展我们对表观基因组及其在人类疾病中作用的认识的重要资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Epigenetics & Chromatin
Epigenetics & Chromatin GENETICS & HEREDITY-
CiteScore
7.00
自引率
0.00%
发文量
35
审稿时长
1 months
期刊介绍: Epigenetics & Chromatin is a peer-reviewed, open access, online journal that publishes research, and reviews, providing novel insights into epigenetic inheritance and chromatin-based interactions. The journal aims to understand how gene and chromosomal elements are regulated and their activities maintained during processes such as cell division, differentiation and environmental alteration.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信