L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima
{"title":"基于非负矩阵三因子分解的重叠列共聚类算法","authors":"L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima","doi":"10.1109/BRACIS.2017.80","DOIUrl":null,"url":null,"abstract":"Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.","PeriodicalId":202240,"journal":{"name":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"The BinOvNMTF Algorithm: Overlapping Columns Co-clustering Based on Non-negative Matrix Tri-factorization\",\"authors\":\"L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima\",\"doi\":\"10.1109/BRACIS.2017.80\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.\",\"PeriodicalId\":202240,\"journal\":{\"name\":\"2017 Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRACIS.2017.80\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2017.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The BinOvNMTF Algorithm: Overlapping Columns Co-clustering Based on Non-negative Matrix Tri-factorization
Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.