对ICGC癌症基因组肺腺癌研究的组学数据进行综合探索性分析

S. Sikdar, Hyoyoung Choo Wosoba, Younathan Abdia, S. Dutta, R. Gill, S. Datta, S. Datta
{"title":"对ICGC癌症基因组肺腺癌研究的组学数据进行综合探索性分析","authors":"S. Sikdar, Hyoyoung Choo Wosoba, Younathan Abdia, S. Dutta, R. Gill, S. Datta, S. Datta","doi":"10.1080/21628130.2015.1040618","DOIUrl":null,"url":null,"abstract":"It is known that all agents that cause cancer (carcinogens) also cause a change in the DNA sequence. In order to identify such often subtle changes, we attempt to integrate multiple molecular profile data sets released by the International Cancer Genome Consortium (ICGC). The list of data sets includes matched gene and microRNA expression profiles, somatic copy number variation, DNA methylation, and protein expression profiles for lung adenocarcinoma patients receiving treatments. We consider both unsupervised and supervised learning techniques (clustering and penalized regression) to identify interesting molecular markers corresponding to each type of –omics profiles that can differentiate patients. Associations between important markers of 2 types have been studied. An adaptive ensemble binary regression model has been presented that uses the entirety of available –omics profiles leading to a more accurate clinical prognosis for the patients in the given sample. This integrated study provides a more comprehensive picture of lung adenocarcinoma.","PeriodicalId":90057,"journal":{"name":"Systems biomedicine (Austin, Tex.)","volume":"2 1","pages":"54 - 62"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/21628130.2015.1040618","citationCount":"4","resultStr":"{\"title\":\"An integrative exploratory analysis of –omics data from the ICGC cancer genomes lung adenocarcinoma study\",\"authors\":\"S. Sikdar, Hyoyoung Choo Wosoba, Younathan Abdia, S. Dutta, R. Gill, S. Datta, S. Datta\",\"doi\":\"10.1080/21628130.2015.1040618\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is known that all agents that cause cancer (carcinogens) also cause a change in the DNA sequence. In order to identify such often subtle changes, we attempt to integrate multiple molecular profile data sets released by the International Cancer Genome Consortium (ICGC). The list of data sets includes matched gene and microRNA expression profiles, somatic copy number variation, DNA methylation, and protein expression profiles for lung adenocarcinoma patients receiving treatments. We consider both unsupervised and supervised learning techniques (clustering and penalized regression) to identify interesting molecular markers corresponding to each type of –omics profiles that can differentiate patients. Associations between important markers of 2 types have been studied. An adaptive ensemble binary regression model has been presented that uses the entirety of available –omics profiles leading to a more accurate clinical prognosis for the patients in the given sample. This integrated study provides a more comprehensive picture of lung adenocarcinoma.\",\"PeriodicalId\":90057,\"journal\":{\"name\":\"Systems biomedicine (Austin, Tex.)\",\"volume\":\"2 1\",\"pages\":\"54 - 62\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/21628130.2015.1040618\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Systems biomedicine (Austin, Tex.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/21628130.2015.1040618\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systems biomedicine (Austin, Tex.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21628130.2015.1040618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

众所周知,所有导致癌症的物质(致癌物)也会引起DNA序列的变化。为了识别这些微妙的变化,我们尝试整合国际癌症基因组联盟(ICGC)发布的多个分子谱数据集。数据集列表包括接受治疗的肺腺癌患者的匹配基因和microRNA表达谱、体细胞拷贝数变异、DNA甲基化和蛋白质表达谱。我们考虑了无监督和监督学习技术(聚类和惩罚回归)来识别与每种类型的组学图谱相对应的有趣的分子标记,这些分子标记可以区分患者。研究了两种重要标记之间的关联。提出了一种自适应集成二元回归模型,该模型使用了所有可用的组学资料,从而为给定样本中的患者提供了更准确的临床预后。这项综合研究提供了一个更全面的肺腺癌的图像。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An integrative exploratory analysis of –omics data from the ICGC cancer genomes lung adenocarcinoma study
It is known that all agents that cause cancer (carcinogens) also cause a change in the DNA sequence. In order to identify such often subtle changes, we attempt to integrate multiple molecular profile data sets released by the International Cancer Genome Consortium (ICGC). The list of data sets includes matched gene and microRNA expression profiles, somatic copy number variation, DNA methylation, and protein expression profiles for lung adenocarcinoma patients receiving treatments. We consider both unsupervised and supervised learning techniques (clustering and penalized regression) to identify interesting molecular markers corresponding to each type of –omics profiles that can differentiate patients. Associations between important markers of 2 types have been studied. An adaptive ensemble binary regression model has been presented that uses the entirety of available –omics profiles leading to a more accurate clinical prognosis for the patients in the given sample. This integrated study provides a more comprehensive picture of lung adenocarcinoma.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信