Integrative clustering methods for multi-omics data.

IF 5.4 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics Pub Date : 2022-05-01 DOI:10.1002/wics.1553

Xiaoyu Zhang, Zhenwei Zhou, Hanfei Xu, Ching-Ti Liu

{"title":"Integrative clustering methods for multi-omics data.","authors":"Xiaoyu Zhang, Zhenwei Zhou, Hanfei Xu, Ching-Ti Liu","doi":"10.1002/wics.1553","DOIUrl":null,"url":null,"abstract":"Integrative analysis of multi-omics data has drawn much attention from the scientific community due to the technological advancements which have generated various omics data. Leveraging these multi-omics data potentially provides a more comprehensive view of the disease mechanism or biological processes. Integrative multi-omics clustering is an unsupervised integrative method specifically used to find coherent groups of samples or features by utilizing information across multi-omics data. It aims to better stratify diseases and to suggest biological mechanisms and potential targeted therapies for the diseases. However, applying integrative multi-omics clustering is both statistically and computationally challenging due to various reasons such as high dimensionality and heterogeneity. In this review, we summarized integrative multi-omics clustering methods into three general categories: concatenated clustering, clustering of clusters, and interactive clustering based on when and how the multi-omics data are processed for clustering. We further classified the methods into different approaches under each category based on the main statistical strategy used during clustering. In addition, we have provided recommended practices tailored to four real-life scenarios to help researchers to strategize their selection in integrative multi-omics clustering methods for their future studies.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":"14 3","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1553","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews-Computational Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/wics.1553","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 4

Abstract

Integrative analysis of multi-omics data has drawn much attention from the scientific community due to the technological advancements which have generated various omics data. Leveraging these multi-omics data potentially provides a more comprehensive view of the disease mechanism or biological processes. Integrative multi-omics clustering is an unsupervised integrative method specifically used to find coherent groups of samples or features by utilizing information across multi-omics data. It aims to better stratify diseases and to suggest biological mechanisms and potential targeted therapies for the diseases. However, applying integrative multi-omics clustering is both statistically and computationally challenging due to various reasons such as high dimensionality and heterogeneity. In this review, we summarized integrative multi-omics clustering methods into three general categories: concatenated clustering, clustering of clusters, and interactive clustering based on when and how the multi-omics data are processed for clustering. We further classified the methods into different approaches under each category based on the main statistical strategy used during clustering. In addition, we have provided recommended practices tailored to four real-life scenarios to help researchers to strategize their selection in integrative multi-omics clustering methods for their future studies.

查看原文本刊更多论文

多组学数据的集成聚类方法。

由于技术的进步产生了各种组学数据，多组学数据的综合分析受到了科学界的广泛关注。利用这些多组学数据有可能为疾病机制或生物学过程提供更全面的观点。整合多组学聚类是一种无监督的整合方法，专门用于利用跨多组学数据的信息找到连贯的样本组或特征。它旨在更好地对疾病进行分层，并提出疾病的生物学机制和潜在的靶向治疗方法。然而，由于高维度和异质性等原因，应用集成多组学聚类在统计和计算上都具有挑战性。本文根据多组学数据的聚类处理时间和方式，将多组学聚类方法分为串联聚类、聚类的聚类和交互聚类三大类。基于聚类过程中使用的主要统计策略，我们进一步将每种方法分类为不同的方法。此外，我们还提供了针对四种现实场景的推荐实践，以帮助研究人员在未来的研究中制定综合多组学聚类方法的选择策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Wiley Interdisciplinary Reviews-Computational Statistics STATISTICS & PROBABILITY-

CiteScore

6.20

自引率

0.00%

发文量