{"title":"Approximation Algorithms for the Selection of Robust Tag SNPs","authors":"Kui Zhang, K. Chao, Yao-Ting Huang, Ting Chen","doi":"10.1201/9781420010749.ch77","DOIUrl":null,"url":null,"abstract":"Recent studies have shown that the chromosomal recombination only takes places at some narrow hotspots. Within the chromosomal region between these hotspots (called haplotype block), little or even no recombination occurs, and a small subset of SNPs (called tag SNPs) is sufficient to capture the haplotype pattern of the block. In reality, the tag SNPs may be genotyped as missing data, and we may fail to distinguish two distinct haplotypes due to the ambiguity caused by missing data. In this paper, we formulate this problem as finding a set of SNPs (called robust tag SNPs) which is able to tolerate missing data. To find robust tag SNPs, we propose two greedy and one LP-relaxation algorithms which give solutions of \\((m+1)\\ln\\frac{K(K-1)}{2}\\), \\(\\ln((m+1)\\frac{K(K-1)}{2})\\), and O(mln K) approximation respectively, where m is the number of SNPs allowed for missing data and K is the number of patterns in the block.","PeriodicalId":262519,"journal":{"name":"Handbook of Approximation Algorithms and Metaheuristics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Handbook of Approximation Algorithms and Metaheuristics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1201/9781420010749.ch77","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Recent studies have shown that the chromosomal recombination only takes places at some narrow hotspots. Within the chromosomal region between these hotspots (called haplotype block), little or even no recombination occurs, and a small subset of SNPs (called tag SNPs) is sufficient to capture the haplotype pattern of the block. In reality, the tag SNPs may be genotyped as missing data, and we may fail to distinguish two distinct haplotypes due to the ambiguity caused by missing data. In this paper, we formulate this problem as finding a set of SNPs (called robust tag SNPs) which is able to tolerate missing data. To find robust tag SNPs, we propose two greedy and one LP-relaxation algorithms which give solutions of \((m+1)\ln\frac{K(K-1)}{2}\), \(\ln((m+1)\frac{K(K-1)}{2})\), and O(mln K) approximation respectively, where m is the number of SNPs allowed for missing data and K is the number of patterns in the block.