Group testing with DNA chips: generating designs and decoding experiments.

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2003-01-01

Alexander Schliep, David C Torney, Sven Rahmann

{"title":"Group testing with DNA chips: generating designs and decoding experiments.","authors":"Alexander Schliep, David C Torney, Sven Rahmann","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>DNA microarrays are a valuable tool for massively parallel DNA-DNA hybridization experiments. Currently, most applications rely on the existence of sequence-specific oligonucleotide probes. In large families of closely related target sequences, such as different virus subtypes, the high degree of similarity often makes it impossible to find a unique probe for every target. Fortunately, this is unnecessary. We propose a microarray design methodology based on a group testing approach. While probes might bind to multiple targets simultaneously, a properly chosen probe set can still unambiguously distinguish the presence of one target set from the presence of a different target set. Our method is the first one that explicitly takes cross-hybridization and experimental errors into account while accommodating several targets. The approach consists of three steps: (1) Pre-selection of probe candidates, (2) Generation of a suitable group testing design, and (3) Decoding of hybridization results to infer presence or absence of individual targets. Our results show that this approach is very promising, even for challenging data sets and experimental error rates of up to 5%. On a data set of 28S rDNA sequences we were able to identify 660 sequences, a substantial improvement over a prior approach using unique probes which only identified 408 sequences.</p>","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"2 ","pages":"84-91"},"PeriodicalIF":0.0000,"publicationDate":"2003-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Computer Society Bioinformatics Conference","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

DNA microarrays are a valuable tool for massively parallel DNA-DNA hybridization experiments. Currently, most applications rely on the existence of sequence-specific oligonucleotide probes. In large families of closely related target sequences, such as different virus subtypes, the high degree of similarity often makes it impossible to find a unique probe for every target. Fortunately, this is unnecessary. We propose a microarray design methodology based on a group testing approach. While probes might bind to multiple targets simultaneously, a properly chosen probe set can still unambiguously distinguish the presence of one target set from the presence of a different target set. Our method is the first one that explicitly takes cross-hybridization and experimental errors into account while accommodating several targets. The approach consists of three steps: (1) Pre-selection of probe candidates, (2) Generation of a suitable group testing design, and (3) Decoding of hybridization results to infer presence or absence of individual targets. Our results show that this approach is very promising, even for challenging data sets and experimental error rates of up to 5%. On a data set of 28S rDNA sequences we were able to identify 660 sequences, a substantial improvement over a prior approach using unique probes which only identified 408 sequences.

本刊更多论文

DNA芯片组测:生成设计和解码实验。

DNA微阵列是进行大规模并行DNA-DNA杂交实验的重要工具。目前，大多数应用依赖于序列特异性寡核苷酸探针的存在。在密切相关的靶标序列的大家族中，例如不同的病毒亚型，高度的相似性往往使得不可能为每个靶标找到独特的探针。幸运的是，这是不必要的。我们提出了一种基于群体测试方法的微阵列设计方法。虽然探针可能同时绑定到多个目标，但正确选择的探针集仍然可以明确区分一个目标集的存在与另一个目标集的存在。我们的方法是第一个明确地考虑到交叉杂交和实验误差，同时适应多个目标。该方法包括三个步骤:(1)预先选择探针候选对象，(2)生成合适的组测试设计，(3)解码杂交结果以推断单个目标的存在与否。我们的结果表明，即使对于具有挑战性的数据集和高达5%的实验错误率，这种方法也是非常有前途的。在28S rDNA序列的数据集上，我们能够识别660个序列，这比之前使用独特探针只能识别408个序列的方法有了很大的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. IEEE Computer Society Bioinformatics Conference

自引率

0.00%

发文量