Dealing with high dimensional multi-view data: A comprehensive review of non-negative matrix factorization approaches in data mining and machine learning
IF 13.3 1区 计算机科学Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Nafiseh Soleymani, Mohammad Hossein Moattar, Reza Sheibani
{"title":"Dealing with high dimensional multi-view data: A comprehensive review of non-negative matrix factorization approaches in data mining and machine learning","authors":"Nafiseh Soleymani, Mohammad Hossein Moattar, Reza Sheibani","doi":"10.1016/j.cosrev.2025.100788","DOIUrl":null,"url":null,"abstract":"<div><div>Non-negative matrix factorization (NMF) has become a well-known model in data mining in recent years. NMF is an unsupervised algorithm that efficiently reduces the number of features while maintaining the crucial information needed to reconstruct the original data by projecting the data onto a lower-dimensional space. NMF's main goal is to automatically extract hidden patterns from high-dimensional vectors; it has been effectively used for prediction, clustering, and dimensionality reduction. On the other hand, a major problem in data mining and machine learning is multi-view decision-making. Multi-view learning is a significant issue in today's multi-modal decision-making environment since it makes use of many and frequently high-dimensional data representations to improve learning outcomes. This study aims to review the state-of-the-art NMF methods for multi-view data processing, encompassing principles, representation approaches, clustering models, and algorithms with various generalizations, developments, and modifications. The review includes discussions on different aspects of the algorithms and provides a comprehensive comparison of their advantages. Additionally, it addresses several open issues and remaining challenges. Ultimately, this review seeks to establish a framework for the NMF concept that may benefit future research.</div></div>","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":"58 ","pages":"Article 100788"},"PeriodicalIF":13.3000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574013725000644","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Non-negative matrix factorization (NMF) has become a well-known model in data mining in recent years. NMF is an unsupervised algorithm that efficiently reduces the number of features while maintaining the crucial information needed to reconstruct the original data by projecting the data onto a lower-dimensional space. NMF's main goal is to automatically extract hidden patterns from high-dimensional vectors; it has been effectively used for prediction, clustering, and dimensionality reduction. On the other hand, a major problem in data mining and machine learning is multi-view decision-making. Multi-view learning is a significant issue in today's multi-modal decision-making environment since it makes use of many and frequently high-dimensional data representations to improve learning outcomes. This study aims to review the state-of-the-art NMF methods for multi-view data processing, encompassing principles, representation approaches, clustering models, and algorithms with various generalizations, developments, and modifications. The review includes discussions on different aspects of the algorithms and provides a comprehensive comparison of their advantages. Additionally, it addresses several open issues and remaining challenges. Ultimately, this review seeks to establish a framework for the NMF concept that may benefit future research.
期刊介绍:
Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.