利用完全配对和弱配对多组学数据进行癌症亚型的多视角对比聚类。

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Yabin Kuang , Minzhu Xie , Zhanhong Zhao , Dongze Deng , Ergude Bao
{"title":"利用完全配对和弱配对多组学数据进行癌症亚型的多视角对比聚类。","authors":"Yabin Kuang ,&nbsp;Minzhu Xie ,&nbsp;Zhanhong Zhao ,&nbsp;Dongze Deng ,&nbsp;Ergude Bao","doi":"10.1016/j.ymeth.2024.09.016","DOIUrl":null,"url":null,"abstract":"<div><div>The identification of cancer subtypes is crucial for advancing precision medicine, as it facilitates the development of more effective and personalized treatment and prevention strategies. With the development of high-throughput sequencing technologies, researchers now have access to a wealth of multi-omics data from cancer patients, making computational cancer subtyping increasingly feasible. One of the main challenges in integrating multi-omics data is handling missing data, since not all biomolecules are consistently measured across all samples. Current computational models based on multi-omics data for cancer subtyping often struggle with the challenge of weakly paired omics data. To address this challenge, we propose a novel unsupervised cancer subtyping model named Subtype-MVCC. This model leverages graph convolutional networks to extract and represent low-dimensional features from each omics data type, using intra-view and inter-view contrastive learning approaches. By incorporating a weighted average fusion strategy to unify the dimension of each sample, Subtype-MVCC effectively handles weakly paired multi-omics datasets. Comprehensive evaluations on established benchmark datasets demonstrate that Subtype-MVCC outperforms nine leading models in this domain. Additionally, simulations with varying levels of missing data highlight the model's robust performance in handling weakly paired omics data. The clinical relevance and survival outcomes associated with the identified subtypes further validate the interpretability and reliability of the clustering results produced by Subtype-MVCC.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"232 ","pages":"Pages 1-8"},"PeriodicalIF":4.2000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-view contrastive clustering for cancer subtyping using fully and weakly paired multi-omics data\",\"authors\":\"Yabin Kuang ,&nbsp;Minzhu Xie ,&nbsp;Zhanhong Zhao ,&nbsp;Dongze Deng ,&nbsp;Ergude Bao\",\"doi\":\"10.1016/j.ymeth.2024.09.016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The identification of cancer subtypes is crucial for advancing precision medicine, as it facilitates the development of more effective and personalized treatment and prevention strategies. With the development of high-throughput sequencing technologies, researchers now have access to a wealth of multi-omics data from cancer patients, making computational cancer subtyping increasingly feasible. One of the main challenges in integrating multi-omics data is handling missing data, since not all biomolecules are consistently measured across all samples. Current computational models based on multi-omics data for cancer subtyping often struggle with the challenge of weakly paired omics data. To address this challenge, we propose a novel unsupervised cancer subtyping model named Subtype-MVCC. This model leverages graph convolutional networks to extract and represent low-dimensional features from each omics data type, using intra-view and inter-view contrastive learning approaches. By incorporating a weighted average fusion strategy to unify the dimension of each sample, Subtype-MVCC effectively handles weakly paired multi-omics datasets. Comprehensive evaluations on established benchmark datasets demonstrate that Subtype-MVCC outperforms nine leading models in this domain. Additionally, simulations with varying levels of missing data highlight the model's robust performance in handling weakly paired omics data. The clinical relevance and survival outcomes associated with the identified subtypes further validate the interpretability and reliability of the clustering results produced by Subtype-MVCC.</div></div>\",\"PeriodicalId\":390,\"journal\":{\"name\":\"Methods\",\"volume\":\"232 \",\"pages\":\"Pages 1-8\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1046202324002147\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324002147","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

癌症亚型的确定对于推进精准医疗至关重要,因为这有助于开发更有效、更个性化的治疗和预防策略。随着高通量测序技术的发展,研究人员现在可以从癌症患者那里获得大量的多组学数据,这使得计算癌症亚型变得越来越可行。整合多组学数据的主要挑战之一是处理缺失数据,因为并非所有生物分子都能在所有样本中得到一致的测量。目前基于多组学数据进行癌症亚型分析的计算模型往往难以应对弱配对 omics 数据的挑战。为了应对这一挑战,我们提出了一种名为 Subtype-MVCC 的新型无监督癌症亚型分析模型。该模型利用图卷积网络,采用视图内和视图间对比学习方法,从每种 omics 数据类型中提取并表示低维特征。通过采用加权平均融合策略来统一每个样本的维度,Subtype-MVCC 能有效处理弱配对的多组学数据集。在已建立的基准数据集上进行的综合评估表明,Subtype-MVCC 优于该领域的九种领先模型。此外,不同程度的数据缺失模拟也凸显了该模型在处理弱配对组学数据时的强大性能。与已识别亚型相关的临床相关性和生存结果进一步验证了 Subtype-MVCC 生成的聚类结果的可解释性和可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-view contrastive clustering for cancer subtyping using fully and weakly paired multi-omics data
The identification of cancer subtypes is crucial for advancing precision medicine, as it facilitates the development of more effective and personalized treatment and prevention strategies. With the development of high-throughput sequencing technologies, researchers now have access to a wealth of multi-omics data from cancer patients, making computational cancer subtyping increasingly feasible. One of the main challenges in integrating multi-omics data is handling missing data, since not all biomolecules are consistently measured across all samples. Current computational models based on multi-omics data for cancer subtyping often struggle with the challenge of weakly paired omics data. To address this challenge, we propose a novel unsupervised cancer subtyping model named Subtype-MVCC. This model leverages graph convolutional networks to extract and represent low-dimensional features from each omics data type, using intra-view and inter-view contrastive learning approaches. By incorporating a weighted average fusion strategy to unify the dimension of each sample, Subtype-MVCC effectively handles weakly paired multi-omics datasets. Comprehensive evaluations on established benchmark datasets demonstrate that Subtype-MVCC outperforms nine leading models in this domain. Additionally, simulations with varying levels of missing data highlight the model's robust performance in handling weakly paired omics data. The clinical relevance and survival outcomes associated with the identified subtypes further validate the interpretability and reliability of the clustering results produced by Subtype-MVCC.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信