{"title":"Deep multi-view clustering: A comprehensive survey of the contemporary techniques","authors":"Anal Roy Chowdhury , Avisek Gupta , Swagatam Das","doi":"10.1016/j.inffus.2025.103012","DOIUrl":null,"url":null,"abstract":"<div><div>Data can be represented by multiple sets of features, where each semantically coherent set of features is called a view. For example, an image can be represented by multiple sets of features that measure textures, shapes, edge features, etc. Collecting multiple views of data is generally easier than annotating it with the help of experts. Thus, the unsupervised exploration of data in consultation with all collected views is essential to identify naturally occurring clusters of data instances. In deep multi-view clustering, deep neural networks are used to obtain non-linear latent representations of data instances that agree with the multiple views, using which clusters of data instances are identified. A wide variety of such deep multi-view clustering approaches exist, which we systematically study and categorize into a novel taxonomy that provides structure to the existing literature and can also guide future researchers. We provide a pedagogical discussion on preliminary concepts to help understand topics relevant to the studied deep clustering methods. Various multi-view problems that are being studied are summarized, and future research scopes have been noted.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"119 ","pages":"Article 103012"},"PeriodicalIF":14.7000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525000855","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Data can be represented by multiple sets of features, where each semantically coherent set of features is called a view. For example, an image can be represented by multiple sets of features that measure textures, shapes, edge features, etc. Collecting multiple views of data is generally easier than annotating it with the help of experts. Thus, the unsupervised exploration of data in consultation with all collected views is essential to identify naturally occurring clusters of data instances. In deep multi-view clustering, deep neural networks are used to obtain non-linear latent representations of data instances that agree with the multiple views, using which clusters of data instances are identified. A wide variety of such deep multi-view clustering approaches exist, which we systematically study and categorize into a novel taxonomy that provides structure to the existing literature and can also guide future researchers. We provide a pedagogical discussion on preliminary concepts to help understand topics relevant to the studied deep clustering methods. Various multi-view problems that are being studied are summarized, and future research scopes have been noted.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.