基于最近邻分类性能的阵列CGH数据差异异常区域检测

Q3 Biochemistry, Genetics and Molecular Biology

IPSJ Transactions on Bioinformatics Pub Date : 2010-10-13 DOI:10.2197/IPSJTBIO.3.70

Yuta Ishikawa, I. Takeuchi

{"title":"基于最近邻分类性能的阵列CGH数据差异异常区域检测","authors":"Yuta Ishikawa, I. Takeuchi","doi":"10.2197/IPSJTBIO.3.70","DOIUrl":null,"url":null,"abstract":"Array CGH is a useful technology for detecting copy number aberrations in genome-wide scale. We study the problem of detecting differentially aberrant genomic regions in two or more groups of CGH arrays and estimating the statistical significance of those regions. An important property of array CGH data is that there are spatial correlations among probes, and we need to take this fact into consideration when we develop a computational algorithm for array CGH data analysis. In this paper we first discuss three difficult issues underlying this problem, and then introduce nearest-neighbor multivariate test in order to alleviate these difficulties. Our proposed approach has three advantages. First, it can incorporate the spatial correlation among probes. Second, genomic regions with different sizes can be analyzed in a common ground. And finally, the computational cost can be considerably reduced with the use of a simple trick. We demonstrate the effectiveness of our approach through an application to previously published array CGH data set on 75 malignant lymphoma patients.","PeriodicalId":38959,"journal":{"name":"IPSJ Transactions on Bioinformatics","volume":"3 1","pages":"70-81"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2197/IPSJTBIO.3.70","citationCount":"0","resultStr":"{\"title\":\"Differentially Aberrant Region Detection in Array CGH Data Based on Nearest Neighbor Classification Performance\",\"authors\":\"Yuta Ishikawa, I. Takeuchi\",\"doi\":\"10.2197/IPSJTBIO.3.70\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Array CGH is a useful technology for detecting copy number aberrations in genome-wide scale. We study the problem of detecting differentially aberrant genomic regions in two or more groups of CGH arrays and estimating the statistical significance of those regions. An important property of array CGH data is that there are spatial correlations among probes, and we need to take this fact into consideration when we develop a computational algorithm for array CGH data analysis. In this paper we first discuss three difficult issues underlying this problem, and then introduce nearest-neighbor multivariate test in order to alleviate these difficulties. Our proposed approach has three advantages. First, it can incorporate the spatial correlation among probes. Second, genomic regions with different sizes can be analyzed in a common ground. And finally, the computational cost can be considerably reduced with the use of a simple trick. We demonstrate the effectiveness of our approach through an application to previously published array CGH data set on 75 malignant lymphoma patients.\",\"PeriodicalId\":38959,\"journal\":{\"name\":\"IPSJ Transactions on Bioinformatics\",\"volume\":\"3 1\",\"pages\":\"70-81\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.2197/IPSJTBIO.3.70\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPSJ Transactions on Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2197/IPSJTBIO.3.70\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSJ Transactions on Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2197/IPSJTBIO.3.70","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 0

摘要

阵列CGH是一种在全基因组范围内检测拷贝数畸变的有效技术。我们研究了在两组或多组CGH阵列中检测差异异常基因组区域并估计这些区域的统计显著性的问题。阵列CGH数据的一个重要特性是探测器之间存在空间相关性，在开发阵列CGH数据分析的计算算法时需要考虑到这一点。本文首先讨论了这一问题背后的三个难点问题，然后引入了最近邻多元检验来缓解这些困难。我们提出的方法有三个优点。首先，它可以纳入探针之间的空间相关性。其次，不同大小的基因组区域可以在一个共同的基础上进行分析。最后，使用一个简单的技巧可以大大降低计算成本。我们通过对先前发表的75例恶性淋巴瘤患者的阵列CGH数据集的应用证明了我们方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Differentially Aberrant Region Detection in Array CGH Data Based on Nearest Neighbor Classification Performance

Array CGH is a useful technology for detecting copy number aberrations in genome-wide scale. We study the problem of detecting differentially aberrant genomic regions in two or more groups of CGH arrays and estimating the statistical significance of those regions. An important property of array CGH data is that there are spatial correlations among probes, and we need to take this fact into consideration when we develop a computational algorithm for array CGH data analysis. In this paper we first discuss three difficult issues underlying this problem, and then introduce nearest-neighbor multivariate test in order to alleviate these difficulties. Our proposed approach has three advantages. First, it can incorporate the spatial correlation among probes. Second, genomic regions with different sizes can be analyzed in a common ground. And finally, the computational cost can be considerably reduced with the use of a simple trick. We demonstrate the effectiveness of our approach through an application to previously published array CGH data set on 75 malignant lymphoma patients.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IPSJ Transactions on Bioinformatics Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)

CiteScore

1.90

自引率

0.00%

发文量