NCI 的蛋白质组数据公共资源：基于云的蛋白质组学资料库，通过与基因组学和影像学数据的交叉引用，增强癌症综合分析能力。

IF 2 Q3 ONCOLOGY

Cancer research communications Pub Date : 2024-09-01 DOI:10.1158/2767-9764.CRC-24-0243

Ratna R Thangudu, Michael Holck, Deepak Singhal, Alexander Pilozzi, Nathan Edwards, Paul A Rudnick, Marcin J Domagalski, Padmini Chilappagari, Lei Ma, Yi Xin, Toan Le, Kristen Nyce, Rekha Chaudhary, Karen A Ketchum, Aaron Maurais, Brian Connolly, Michael Riffle, Matthew C Chambers, Brendan MacLean, Michael J MacCoss, Peter B McGarvey, Anand Basu, John Otridge, Esmeralda Casas-Silva, Sudha Venkatachari, Henry Rodriguez, Xu Zhang

{"title":"NCI 的蛋白质组数据公共资源：基于云的蛋白质组学资料库，通过与基因组学和影像学数据的交叉引用，增强癌症综合分析能力。","authors":"Ratna R Thangudu, Michael Holck, Deepak Singhal, Alexander Pilozzi, Nathan Edwards, Paul A Rudnick, Marcin J Domagalski, Padmini Chilappagari, Lei Ma, Yi Xin, Toan Le, Kristen Nyce, Rekha Chaudhary, Karen A Ketchum, Aaron Maurais, Brian Connolly, Michael Riffle, Matthew C Chambers, Brendan MacLean, Michael J MacCoss, Peter B McGarvey, Anand Basu, John Otridge, Esmeralda Casas-Silva, Sudha Venkatachari, Henry Rodriguez, Xu Zhang","doi":"10.1158/2767-9764.CRC-24-0243","DOIUrl":null,"url":null,"abstract":"Proteomics has emerged as a powerful tool for studying cancer biology, developing diagnostics, and therapies. With the continuous improvement and widespread availability of high-throughput proteomic technologies, the generation of large-scale proteomic data has become more common in cancer research, and there is a growing need for resources that support the sharing and integration of multi-omics datasets. Such datasets require extensive metadata including clinical, biospecimen, and experimental and workflow annotations that are crucial for data interpretation and reanalysis. The need to integrate, analyze, and share these data has led to the development of NCI's Proteomic Data Commons (PDC), accessible at https://pdc.cancer.gov. As a specialized repository within the NCI Cancer Research Data Commons (CRDC), PDC enables researchers to locate and analyze proteomic data from various cancer types and connect with genomic and imaging data available for the same samples in other CRDC nodes. Presently, PDC houses annotated data from more than 160 datasets across 19 cancer types, generated by several large-scale cancer research programs with cohort sizes exceeding 100 samples (tumor and associated normal when available). In this article, we review the current state of PDC in cancer research, discuss the opportunities and challenges associated with data sharing in proteomics, and propose future directions for the resource.Significance: The Proteomic Data Commons (PDC) plays a crucial role in advancing cancer research by providing a centralized repository of high-quality cancer proteomic data, enriched with extensive clinical annotations. By integrating and cross-referencing with complementary genomic and imaging data, the PDC facilitates multi-omics analyses, driving comprehensive insights, and accelerating discoveries across various cancer types.","PeriodicalId":72516,"journal":{"name":"Cancer research communications","volume":" ","pages":"2480-2488"},"PeriodicalIF":2.0000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11413857/pdf/","citationCount":"0","resultStr":"{\"title\":\"NCI's Proteomic Data Commons: A Cloud-Based Proteomics Repository Empowering Comprehensive Cancer Analysis through Cross-Referencing with Genomic and Imaging Data.\",\"authors\":\"Ratna R Thangudu, Michael Holck, Deepak Singhal, Alexander Pilozzi, Nathan Edwards, Paul A Rudnick, Marcin J Domagalski, Padmini Chilappagari, Lei Ma, Yi Xin, Toan Le, Kristen Nyce, Rekha Chaudhary, Karen A Ketchum, Aaron Maurais, Brian Connolly, Michael Riffle, Matthew C Chambers, Brendan MacLean, Michael J MacCoss, Peter B McGarvey, Anand Basu, John Otridge, Esmeralda Casas-Silva, Sudha Venkatachari, Henry Rodriguez, Xu Zhang\",\"doi\":\"10.1158/2767-9764.CRC-24-0243\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Proteomics has emerged as a powerful tool for studying cancer biology, developing diagnostics, and therapies. With the continuous improvement and widespread availability of high-throughput proteomic technologies, the generation of large-scale proteomic data has become more common in cancer research, and there is a growing need for resources that support the sharing and integration of multi-omics datasets. Such datasets require extensive metadata including clinical, biospecimen, and experimental and workflow annotations that are crucial for data interpretation and reanalysis. The need to integrate, analyze, and share these data has led to the development of NCI's Proteomic Data Commons (PDC), accessible at https://pdc.cancer.gov. As a specialized repository within the NCI Cancer Research Data Commons (CRDC), PDC enables researchers to locate and analyze proteomic data from various cancer types and connect with genomic and imaging data available for the same samples in other CRDC nodes. Presently, PDC houses annotated data from more than 160 datasets across 19 cancer types, generated by several large-scale cancer research programs with cohort sizes exceeding 100 samples (tumor and associated normal when available). In this article, we review the current state of PDC in cancer research, discuss the opportunities and challenges associated with data sharing in proteomics, and propose future directions for the resource.Significance: The Proteomic Data Commons (PDC) plays a crucial role in advancing cancer research by providing a centralized repository of high-quality cancer proteomic data, enriched with extensive clinical annotations. By integrating and cross-referencing with complementary genomic and imaging data, the PDC facilitates multi-omics analyses, driving comprehensive insights, and accelerating discoveries across various cancer types.\",\"PeriodicalId\":72516,\"journal\":{\"name\":\"Cancer research communications\",\"volume\":\" \",\"pages\":\"2480-2488\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11413857/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cancer research communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1158/2767-9764.CRC-24-0243\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1158/2767-9764.CRC-24-0243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

蛋白质组学已成为研究癌症生物学、开发诊断和治疗方法的有力工具。随着高通量蛋白质组学技术的不断改进和普及，大规模蛋白质组学数据的生成在癌症研究中变得越来越普遍，对支持多组学数据集共享和整合的资源的需求也越来越大。此类数据集需要大量元数据，包括临床、生物样本、实验和工作流程注释，这些对于数据解读和再分析至关重要。由于需要整合、分析和共享这些数据，美国国家癌症研究所（NCI）开发了蛋白质组数据共享中心（PDC），可通过 https://pdc.cancer.gov 访问。作为 NCI 癌症研究数据共享中心 (CRDC) 中的一个专门存储库，PDC 使研究人员能够查找和分析各种癌症类型的蛋白质组数据，并与 CRDC 其他节点中相同样本的基因组和成像数据相连接。目前，PDC 收录了来自 19 种癌症类型近 140 个数据集的注释数据，这些数据集由几个大规模癌症研究项目生成，队列规模超过 100 个样本（肿瘤及相关正常样本）。在本文中，我们回顾了 PDC 在癌症研究中的现状，讨论了与蛋白质组学数据共享相关的机遇和挑战，并提出了该资源的未来发展方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

NCI's Proteomic Data Commons: A Cloud-Based Proteomics Repository Empowering Comprehensive Cancer Analysis through Cross-Referencing with Genomic and Imaging Data.

Proteomics has emerged as a powerful tool for studying cancer biology, developing diagnostics, and therapies. With the continuous improvement and widespread availability of high-throughput proteomic technologies, the generation of large-scale proteomic data has become more common in cancer research, and there is a growing need for resources that support the sharing and integration of multi-omics datasets. Such datasets require extensive metadata including clinical, biospecimen, and experimental and workflow annotations that are crucial for data interpretation and reanalysis. The need to integrate, analyze, and share these data has led to the development of NCI's Proteomic Data Commons (PDC), accessible at https://pdc.cancer.gov. As a specialized repository within the NCI Cancer Research Data Commons (CRDC), PDC enables researchers to locate and analyze proteomic data from various cancer types and connect with genomic and imaging data available for the same samples in other CRDC nodes. Presently, PDC houses annotated data from more than 160 datasets across 19 cancer types, generated by several large-scale cancer research programs with cohort sizes exceeding 100 samples (tumor and associated normal when available). In this article, we review the current state of PDC in cancer research, discuss the opportunities and challenges associated with data sharing in proteomics, and propose future directions for the resource.

Significance: The Proteomic Data Commons (PDC) plays a crucial role in advancing cancer research by providing a centralized repository of high-quality cancer proteomic data, enriched with extensive clinical annotations. By integrating and cross-referencing with complementary genomic and imaging data, the PDC facilitates multi-omics analyses, driving comprehensive insights, and accelerating discoveries across various cancer types.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Cancer research communications

自引率

0.00%

发文量