{"title":"利用snp片段和相关基因型重建单倍型的概率分支结合方法","authors":"Rasoul Taghipour, Naemeh Ganoodi, E. Asgarian","doi":"10.1109/ICI.2011.17","DOIUrl":null,"url":null,"abstract":"Most positions of the human genome are typically invariant (99%) and only some positions (1%) are commonly invariant which are associated with complex genetic diseases. Haplotype reconstruction problem divide aligned single nucleotide polymorphism (SNP) fragments into two classes and infer a pair of haplotypes from them. An important computational model of this problem is minimum error correction (MEC) but it is only effective when the error rate of the fragments is low. MEC/GI as an extension to MEC employs the compatible genotype information besides the SNP fragments and so results in a more accurate inference. The haplotyping problems, due to its NP-hardness, several computational and heuristic methods have addressed the problem seeking feasible answers. In this paper, we develop a new branch-and-bound algorithm with running time O([(n-h)/k]2ˆh x nm) in which m is maximum length of SNP fragments where SNP sites are heterozygous, n is the number of fragments and h is depth of our exploration in binary tree. Since h (h?n) is small in real biological applications, our proposed algorithm is practical and efficient.","PeriodicalId":146712,"journal":{"name":"2011 First International Conference on Informatics and Computational Intelligence","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Probabilistic Branch-and-Bound Approach for Reconstructing of Haplotype Using SNP-Fragments and Related Genotype\",\"authors\":\"Rasoul Taghipour, Naemeh Ganoodi, E. Asgarian\",\"doi\":\"10.1109/ICI.2011.17\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most positions of the human genome are typically invariant (99%) and only some positions (1%) are commonly invariant which are associated with complex genetic diseases. Haplotype reconstruction problem divide aligned single nucleotide polymorphism (SNP) fragments into two classes and infer a pair of haplotypes from them. An important computational model of this problem is minimum error correction (MEC) but it is only effective when the error rate of the fragments is low. MEC/GI as an extension to MEC employs the compatible genotype information besides the SNP fragments and so results in a more accurate inference. The haplotyping problems, due to its NP-hardness, several computational and heuristic methods have addressed the problem seeking feasible answers. In this paper, we develop a new branch-and-bound algorithm with running time O([(n-h)/k]2ˆh x nm) in which m is maximum length of SNP fragments where SNP sites are heterozygous, n is the number of fragments and h is depth of our exploration in binary tree. Since h (h?n) is small in real biological applications, our proposed algorithm is practical and efficient.\",\"PeriodicalId\":146712,\"journal\":{\"name\":\"2011 First International Conference on Informatics and Computational Intelligence\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 First International Conference on Informatics and Computational Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICI.2011.17\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 First International Conference on Informatics and Computational Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICI.2011.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
人类基因组的大多数位置(99%)是典型不变的,只有一些位置(1%)是与复杂遗传疾病相关的通常不变的。单倍型重建问题将排列的单核苷酸多态性(SNP)片段分为两类,并由此推断出一对单倍型。该问题的一个重要计算模型是最小误差校正(MEC),但它仅在片段错误率较低时有效。MEC/GI作为MEC的延伸,除了使用SNP片段外,还使用了兼容的基因型信息,因此推断更加准确。单倍型问题,由于其np -硬度,一些计算和启发式方法解决了寻找可行答案的问题。本文提出了一种运行时间为O([(n-h)/k]2 h × nm)的分支定界算法,其中m为杂合SNP片段的最大长度,n为片段数,h为我们在二叉树中的探索深度。由于h (h?n)在实际生物应用中很小,因此我们提出的算法是实用和高效的。
A Probabilistic Branch-and-Bound Approach for Reconstructing of Haplotype Using SNP-Fragments and Related Genotype
Most positions of the human genome are typically invariant (99%) and only some positions (1%) are commonly invariant which are associated with complex genetic diseases. Haplotype reconstruction problem divide aligned single nucleotide polymorphism (SNP) fragments into two classes and infer a pair of haplotypes from them. An important computational model of this problem is minimum error correction (MEC) but it is only effective when the error rate of the fragments is low. MEC/GI as an extension to MEC employs the compatible genotype information besides the SNP fragments and so results in a more accurate inference. The haplotyping problems, due to its NP-hardness, several computational and heuristic methods have addressed the problem seeking feasible answers. In this paper, we develop a new branch-and-bound algorithm with running time O([(n-h)/k]2ˆh x nm) in which m is maximum length of SNP fragments where SNP sites are heterozygous, n is the number of fragments and h is depth of our exploration in binary tree. Since h (h?n) is small in real biological applications, our proposed algorithm is practical and efficient.