改进的分支结合算法检测乳腺癌中SNP-SNP相互作用。

Li-Yeh Chuang, Hsueh-Wei Chang, Ming-Cheng Lin, Cheng-Hong Yang
{"title":"改进的分支结合算法检测乳腺癌中SNP-SNP相互作用。","authors":"Li-Yeh Chuang,&nbsp;Hsueh-Wei Chang,&nbsp;Ming-Cheng Lin,&nbsp;Cheng-Hong Yang","doi":"10.1186/2043-9113-3-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Single nucleotide polymorphisms (SNPs) in genes derived from distinct pathways are associated with a breast cancer risk. Identifying possible SNP-SNP interactions in genome-wide case-control studies is an important task when investigating genetic factors that influence common complex traits; the effects of SNP-SNP interaction need to be characterized. Furthermore, observations of the complex interplay (interactions) between SNPs for high-dimensional combinations are still computationally and methodologically challenging. An improved branch and bound algorithm with feature selection (IBBFS) is introduced to identify SNP combinations with a maximal difference of allele frequencies between the case and control groups in breast cancer, i.e., the high/low risk combinations of SNPs.</p><p><strong>Results: </strong>A total of 220 real case and 334 real control breast cancer data are used to test IBBFS and identify significant SNP combinations. We used the odds ratio (OR) as a quantitative measure to estimate the associated cancer risk of multiple SNP combinations to identify the complex biological relationships underlying the progression of breast cancer, i.e., the most likely SNP combinations. Experimental results show the estimated odds ratio of the best SNP combination with genotypes is significantly smaller than 1 (between 0.165 and 0.657) for specific SNP combinations of the tested SNPs in the low risk groups. In the high risk groups, predicted SNP combinations with genotypes are significantly greater than 1 (between 2.384 and 6.167) for specific SNP combinations of the tested SNPs.</p><p><strong>Conclusions: </strong>This study proposes an effective high-speed method to analyze SNP-SNP interactions in breast cancer association studies. A number of important SNPs are found to be significant for the high/low risk group. They can thus be considered a potential predictor for breast cancer association.</p>","PeriodicalId":73663,"journal":{"name":"Journal of clinical bioinformatics","volume":" ","pages":"4"},"PeriodicalIF":0.0000,"publicationDate":"2013-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/2043-9113-3-4","citationCount":"18","resultStr":"{\"title\":\"Improved branch and bound algorithm for detecting SNP-SNP interactions in breast cancer.\",\"authors\":\"Li-Yeh Chuang,&nbsp;Hsueh-Wei Chang,&nbsp;Ming-Cheng Lin,&nbsp;Cheng-Hong Yang\",\"doi\":\"10.1186/2043-9113-3-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Single nucleotide polymorphisms (SNPs) in genes derived from distinct pathways are associated with a breast cancer risk. Identifying possible SNP-SNP interactions in genome-wide case-control studies is an important task when investigating genetic factors that influence common complex traits; the effects of SNP-SNP interaction need to be characterized. Furthermore, observations of the complex interplay (interactions) between SNPs for high-dimensional combinations are still computationally and methodologically challenging. An improved branch and bound algorithm with feature selection (IBBFS) is introduced to identify SNP combinations with a maximal difference of allele frequencies between the case and control groups in breast cancer, i.e., the high/low risk combinations of SNPs.</p><p><strong>Results: </strong>A total of 220 real case and 334 real control breast cancer data are used to test IBBFS and identify significant SNP combinations. We used the odds ratio (OR) as a quantitative measure to estimate the associated cancer risk of multiple SNP combinations to identify the complex biological relationships underlying the progression of breast cancer, i.e., the most likely SNP combinations. Experimental results show the estimated odds ratio of the best SNP combination with genotypes is significantly smaller than 1 (between 0.165 and 0.657) for specific SNP combinations of the tested SNPs in the low risk groups. In the high risk groups, predicted SNP combinations with genotypes are significantly greater than 1 (between 2.384 and 6.167) for specific SNP combinations of the tested SNPs.</p><p><strong>Conclusions: </strong>This study proposes an effective high-speed method to analyze SNP-SNP interactions in breast cancer association studies. A number of important SNPs are found to be significant for the high/low risk group. They can thus be considered a potential predictor for breast cancer association.</p>\",\"PeriodicalId\":73663,\"journal\":{\"name\":\"Journal of clinical bioinformatics\",\"volume\":\" \",\"pages\":\"4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-02-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/2043-9113-3-4\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of clinical bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/2043-9113-3-4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of clinical bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/2043-9113-3-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

摘要

背景:来自不同途径的基因中的单核苷酸多态性(snp)与乳腺癌风险相关。在研究影响常见复杂性状的遗传因素时,在全基因组病例对照研究中确定可能的SNP-SNP相互作用是一项重要任务;SNP-SNP相互作用的影响需要被表征。此外,观察高维组合的snp之间复杂的相互作用(相互作用)在计算和方法上仍然具有挑战性。提出了一种改进的分支结合特征选择算法(branch and bound algorithm with feature selection, IBBFS),用于识别乳腺癌病例组与对照组之间等位基因频率差异最大的SNP组合,即SNP的高/低风险组合。结果:共使用220例真实病例和334例真实对照乳腺癌数据进行IBBFS检测,并发现显著SNP组合。我们使用比值比(OR)作为定量测量来估计多个SNP组合的相关癌症风险,以确定乳腺癌进展背后的复杂生物学关系,即最可能的SNP组合。实验结果显示,低风险人群中所检测SNP的特定SNP组合的最佳组合与基因型的比值比显著小于1(在0.165 ~ 0.657之间)。在高危人群中,检测SNP的特定SNP组合预测与基因型的SNP组合显著大于1(2.384 ~ 6.167)。结论:本研究提出了一种有效的快速分析乳腺癌关联研究中SNP-SNP相互作用的方法。许多重要的snp被发现对高/低风险群体是显著的。因此,它们可以被认为是乳腺癌关联的潜在预测因子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Improved branch and bound algorithm for detecting SNP-SNP interactions in breast cancer.

Improved branch and bound algorithm for detecting SNP-SNP interactions in breast cancer.

Improved branch and bound algorithm for detecting SNP-SNP interactions in breast cancer.

Background: Single nucleotide polymorphisms (SNPs) in genes derived from distinct pathways are associated with a breast cancer risk. Identifying possible SNP-SNP interactions in genome-wide case-control studies is an important task when investigating genetic factors that influence common complex traits; the effects of SNP-SNP interaction need to be characterized. Furthermore, observations of the complex interplay (interactions) between SNPs for high-dimensional combinations are still computationally and methodologically challenging. An improved branch and bound algorithm with feature selection (IBBFS) is introduced to identify SNP combinations with a maximal difference of allele frequencies between the case and control groups in breast cancer, i.e., the high/low risk combinations of SNPs.

Results: A total of 220 real case and 334 real control breast cancer data are used to test IBBFS and identify significant SNP combinations. We used the odds ratio (OR) as a quantitative measure to estimate the associated cancer risk of multiple SNP combinations to identify the complex biological relationships underlying the progression of breast cancer, i.e., the most likely SNP combinations. Experimental results show the estimated odds ratio of the best SNP combination with genotypes is significantly smaller than 1 (between 0.165 and 0.657) for specific SNP combinations of the tested SNPs in the low risk groups. In the high risk groups, predicted SNP combinations with genotypes are significantly greater than 1 (between 2.384 and 6.167) for specific SNP combinations of the tested SNPs.

Conclusions: This study proposes an effective high-speed method to analyze SNP-SNP interactions in breast cancer association studies. A number of important SNPs are found to be significant for the high/low risk group. They can thus be considered a potential predictor for breast cancer association.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信