来自下一代测序数据的SNP注释

2011 11th International Conference on Intelligent Systems Design and Applications Pub Date : 2011-11-01 DOI:10.1109/ISDA.2011.6121821

J. A. Morente-Molinera, J. M. Martin, C. Cano, M. C. Celorrio, A. Blanco

{"title":"来自下一代测序数据的SNP注释","authors":"J. A. Morente-Molinera, J. M. Martin, C. Cano, M. C. Celorrio, A. Blanco","doi":"10.1109/ISDA.2011.6121821","DOIUrl":null,"url":null,"abstract":"Massive sequencing technologies are producing an increasing amount of whole genome data, which need to be explored and analyzed. New computational tools are thus required to deal with the dimensionality and complexity of these data. Single Nucleotide Polymorphisms (SNPs) are the most common human genome variation and can be involved in disease conditions. Identifying SNPs and annotating its functional and clinical role in whole human genomes is a challenging task, which requires expert curation. There are several software tools that assist researchers in the SNP calling and SNP annotation processes. However, these tools do not focus on the association of SNPs to regulatory regions such as Transcription Factor Binding Sites (TFBSs). This paper proposes a methodology to assist the annotation of SNPs in whole genome sequences, including not only genes but also known TFBSs. Our main contribution is that we use an intuitionistic-based similarity measure (SCintuit [1]), based on fuzzy technology and intuitionistic sets, to perform accurate comparisons between DNA sequences and identify TFBSs affected by a SNP.","PeriodicalId":433207,"journal":{"name":"2011 11th International Conference on Intelligent Systems Design and Applications","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"SNP annotation from next generation sequencing data\",\"authors\":\"J. A. Morente-Molinera, J. M. Martin, C. Cano, M. C. Celorrio, A. Blanco\",\"doi\":\"10.1109/ISDA.2011.6121821\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Massive sequencing technologies are producing an increasing amount of whole genome data, which need to be explored and analyzed. New computational tools are thus required to deal with the dimensionality and complexity of these data. Single Nucleotide Polymorphisms (SNPs) are the most common human genome variation and can be involved in disease conditions. Identifying SNPs and annotating its functional and clinical role in whole human genomes is a challenging task, which requires expert curation. There are several software tools that assist researchers in the SNP calling and SNP annotation processes. However, these tools do not focus on the association of SNPs to regulatory regions such as Transcription Factor Binding Sites (TFBSs). This paper proposes a methodology to assist the annotation of SNPs in whole genome sequences, including not only genes but also known TFBSs. Our main contribution is that we use an intuitionistic-based similarity measure (SCintuit [1]), based on fuzzy technology and intuitionistic sets, to perform accurate comparisons between DNA sequences and identify TFBSs affected by a SNP.\",\"PeriodicalId\":433207,\"journal\":{\"name\":\"2011 11th International Conference on Intelligent Systems Design and Applications\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 11th International Conference on Intelligent Systems Design and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISDA.2011.6121821\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 11th International Conference on Intelligent Systems Design and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2011.6121821","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

大规模测序技术正在产生越来越多的全基因组数据，这些数据需要探索和分析。因此需要新的计算工具来处理这些数据的维度和复杂性。单核苷酸多态性(snp)是最常见的人类基因组变异，可能与疾病状况有关。鉴定snp并注释其在整个人类基因组中的功能和临床作用是一项具有挑战性的任务，这需要专家的管理。有几个软件工具可以帮助研究人员进行SNP调用和SNP注释过程。然而，这些工具并不关注snp与转录因子结合位点(TFBSs)等调控区域的关联。本文提出了一种辅助全基因组序列snp注释的方法，该方法不仅包括基因，还包括已知的TFBSs。我们的主要贡献是使用基于直觉的相似性度量(SCintuit[1])，基于模糊技术和直觉集，在DNA序列之间进行准确的比较，并识别受SNP影响的TFBSs。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SNP annotation from next generation sequencing data

Massive sequencing technologies are producing an increasing amount of whole genome data, which need to be explored and analyzed. New computational tools are thus required to deal with the dimensionality and complexity of these data. Single Nucleotide Polymorphisms (SNPs) are the most common human genome variation and can be involved in disease conditions. Identifying SNPs and annotating its functional and clinical role in whole human genomes is a challenging task, which requires expert curation. There are several software tools that assist researchers in the SNP calling and SNP annotation processes. However, these tools do not focus on the association of SNPs to regulatory regions such as Transcription Factor Binding Sites (TFBSs). This paper proposes a methodology to assist the annotation of SNPs in whole genome sequences, including not only genes but also known TFBSs. Our main contribution is that we use an intuitionistic-based similarity measure (SCintuit [1]), based on fuzzy technology and intuitionistic sets, to perform accurate comparisons between DNA sequences and identify TFBSs affected by a SNP.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 11th International Conference on Intelligent Systems Design and Applications

自引率

0.00%

发文量