InSilico-ChIP: A Coregulation and Evolutionary Conservation Based Transcription Factor and Target Gene Predictor

M. Munoz, A. Zambon
{"title":"InSilico-ChIP: A Coregulation and Evolutionary Conservation Based Transcription Factor and Target Gene Predictor","authors":"M. Munoz, A. Zambon","doi":"10.1109/HISB.2012.47","DOIUrl":null,"url":null,"abstract":"Chromatin Immunoprecipitation followed by high-content sequencing (ChIP-seq) is a powerful approach for identifying bonafide transcription factor (TF) binding sites, however these studies can be difficult, time-consuming and costly. They require ChIP-validated antibodies and a priori knowledge of which TF to pull down. Moreover, to gain further mechanistic insights, transcriptomic data is required to determine if TF binding alters proximal gene expression. Determining regulatory pathways from expression data is not an easy task. The typical result of a gene expression experiment, a set of co-regulated genes, may not include the relevant TF at all. Many such factors become active through mechanisms other than a change in their level of expression. To understand how such a set of genes is co-regulated, it is necessary to find evidence of shared TF binding sites (TFBS). This approach presents its own set of problems. TFBS are identified by short sequences that can exist by chance without being biologically functional. An evolutionary perspective is required to consider only those functionally important sites that are conserved between the promoters of the genes in question and those of their orthologs in related species. Furthermore, the common occurrence of short TFBS makes it necessary to consider only those TFs that are significantly more common in the co-regulated genes than in the genome (or microarray) as a whole. InSilico-ChIP is a precomputed database of evolutionarily conserved TFBS for various species, which accepts a set of genes and quickly returns the conserved TFs that are statistically over-represented in the proximal promoter regions of those genes. It allows new species data to be created using only a whole genome alignment with a related species and gene locations. Several methods of identifying TF binding sites can be used, varying in alignment type, location, conservation restrictions, and TFBS matrices used to analyze the promoter regions.","PeriodicalId":375089,"journal":{"name":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HISB.2012.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Chromatin Immunoprecipitation followed by high-content sequencing (ChIP-seq) is a powerful approach for identifying bonafide transcription factor (TF) binding sites, however these studies can be difficult, time-consuming and costly. They require ChIP-validated antibodies and a priori knowledge of which TF to pull down. Moreover, to gain further mechanistic insights, transcriptomic data is required to determine if TF binding alters proximal gene expression. Determining regulatory pathways from expression data is not an easy task. The typical result of a gene expression experiment, a set of co-regulated genes, may not include the relevant TF at all. Many such factors become active through mechanisms other than a change in their level of expression. To understand how such a set of genes is co-regulated, it is necessary to find evidence of shared TF binding sites (TFBS). This approach presents its own set of problems. TFBS are identified by short sequences that can exist by chance without being biologically functional. An evolutionary perspective is required to consider only those functionally important sites that are conserved between the promoters of the genes in question and those of their orthologs in related species. Furthermore, the common occurrence of short TFBS makes it necessary to consider only those TFs that are significantly more common in the co-regulated genes than in the genome (or microarray) as a whole. InSilico-ChIP is a precomputed database of evolutionarily conserved TFBS for various species, which accepts a set of genes and quickly returns the conserved TFs that are statistically over-represented in the proximal promoter regions of those genes. It allows new species data to be created using only a whole genome alignment with a related species and gene locations. Several methods of identifying TF binding sites can be used, varying in alignment type, location, conservation restrictions, and TFBS matrices used to analyze the promoter regions.
InSilico-ChIP:一个基于协同调节和进化保护的转录因子和靶基因预测因子
染色质免疫沉淀与高含量测序(ChIP-seq)是鉴定真实转录因子(TF)结合位点的一种有效方法,然而这些研究可能困难,耗时且昂贵。它们需要经过chip验证的抗体和要拉下哪个TF的先验知识。此外,为了获得进一步的机制见解,需要转录组学数据来确定TF结合是否改变近端基因表达。从表达数据中确定调控途径并不是一件容易的事。基因表达实验的典型结果是一组共调控基因,可能根本不包括相关的TF。许多这样的因素不是通过表达水平的变化,而是通过机制变得活跃起来。为了了解这组基因是如何协同调节的,有必要找到共享TF结合位点(TFBS)的证据。这种方法有其自身的问题。TFBS是通过短序列识别的,这些短序列可以偶然存在,而不具有生物学功能。从进化的角度来看,只考虑那些在有问题的基因启动子和相关物种的同源基因启动子之间保守的功能重要位点。此外,短TFBS的普遍存在使得我们有必要只考虑那些在共调控基因中比在整个基因组(或微阵列)中更常见的TFs。insilicon - chip是一个预先计算的各种物种的进化保守TFBS数据库,它接受一组基因,并迅速返回在这些基因的近端启动子区域统计上过度代表的保守tffs。它允许只使用与相关物种和基因位置的全基因组比对就可以创建新的物种数据。可以使用几种识别TF结合位点的方法,这些方法在排列类型、位置、保护限制和用于分析启动子区域的TFBS矩阵方面有所不同。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信