高吞吐量数据源的会话间再现性度量。

Summit on translational bioinformatics Pub Date : 2008-03-01

Milos Hauskrecht, Richard Pelikan

{"title":"高吞吐量数据源的会话间再现性度量。","authors":"Milos Hauskrecht, Richard Pelikan","doi":"","DOIUrl":null,"url":null,"abstract":"High-throughput biological assays such as micro-arrays and mass spectrometry (MS) have risen as potential clinical tools for disease detection. Multiple potential biomarkers can be rapidly and cheaply evaluated for a large number of patients. Typical research and evaluation studies in these fields have focused primarily on data that were generated from samples in a single data-generation session. However, in the clinical setting, new patients screened by the technology will arrive at different times and data will unavoidably come from multiple data-generation sessions. The understanding and assessment of multi-session effects on data generated by the technology is critical for its application to clinical practice. This paper proposes a methodology for measuring and testing the reproducibility of various aspects of high-throughput data across multiple data-generation sessions. We test and demonstrate the framework on mass-spectrometry data obtained from four different data-generation sessions for the same set of samples.","PeriodicalId":89276,"journal":{"name":"Summit on translational bioinformatics","volume":"2008 ","pages":"41-5"},"PeriodicalIF":0.0000,"publicationDate":"2008-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041518/pdf/","citationCount":"0","resultStr":"{\"title\":\"Inter-session reproducibility measures for high-throughput data sources.\",\"authors\":\"Milos Hauskrecht, Richard Pelikan\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High-throughput biological assays such as micro-arrays and mass spectrometry (MS) have risen as potential clinical tools for disease detection. Multiple potential biomarkers can be rapidly and cheaply evaluated for a large number of patients. Typical research and evaluation studies in these fields have focused primarily on data that were generated from samples in a single data-generation session. However, in the clinical setting, new patients screened by the technology will arrive at different times and data will unavoidably come from multiple data-generation sessions. The understanding and assessment of multi-session effects on data generated by the technology is critical for its application to clinical practice. This paper proposes a methodology for measuring and testing the reproducibility of various aspects of high-throughput data across multiple data-generation sessions. We test and demonstrate the framework on mass-spectrometry data obtained from four different data-generation sessions for the same set of samples.\",\"PeriodicalId\":89276,\"journal\":{\"name\":\"Summit on translational bioinformatics\",\"volume\":\"2008 \",\"pages\":\"41-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041518/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Summit on translational bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Summit on translational bioinformatics","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

高通量生物分析，如微阵列和质谱(MS)已经成为潜在的疾病检测临床工具。多种潜在的生物标志物可以快速和廉价地为大量患者进行评估。这些领域的典型研究和评价研究主要侧重于在一次数据生成会议中从样本中产生的数据。然而，在临床环境中，该技术筛选的新患者将在不同的时间到达，数据将不可避免地来自多个数据生成会话。了解和评估由该技术产生的数据的多时段效应对于其在临床实践中的应用至关重要。本文提出了一种方法，用于测量和测试跨多个数据生成会话的高吞吐量数据的各个方面的可重复性。我们测试并演示了从同一组样品的四个不同的数据生成会话中获得的质谱数据框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Inter-session reproducibility measures for high-throughput data sources.

本刊更多论文

Inter-session reproducibility measures for high-throughput data sources.

High-throughput biological assays such as micro-arrays and mass spectrometry (MS) have risen as potential clinical tools for disease detection. Multiple potential biomarkers can be rapidly and cheaply evaluated for a large number of patients. Typical research and evaluation studies in these fields have focused primarily on data that were generated from samples in a single data-generation session. However, in the clinical setting, new patients screened by the technology will arrive at different times and data will unavoidably come from multiple data-generation sessions. The understanding and assessment of multi-session effects on data generated by the technology is critical for its application to clinical practice. This paper proposes a methodology for measuring and testing the reproducibility of various aspects of high-throughput data across multiple data-generation sessions. We test and demonstrate the framework on mass-spectrometry data obtained from four different data-generation sessions for the same set of samples.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Summit on translational bioinformatics

自引率

0.00%

发文量