基于非负矩阵三因子分解的重叠列共聚类算法

L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima
{"title":"基于非负矩阵三因子分解的重叠列共聚类算法","authors":"L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima","doi":"10.1109/BRACIS.2017.80","DOIUrl":null,"url":null,"abstract":"Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.","PeriodicalId":202240,"journal":{"name":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"The BinOvNMTF Algorithm: Overlapping Columns Co-clustering Based on Non-negative Matrix Tri-factorization\",\"authors\":\"L. F. Brunialti, S. M. Peres, V. F. Silva, C. Lima\",\"doi\":\"10.1109/BRACIS.2017.80\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.\",\"PeriodicalId\":202240,\"journal\":{\"name\":\"2017 Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRACIS.2017.80\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2017.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

共聚类越来越受到数据科学家的关注,因为它通过对项目聚类和属性聚类的分析,揭示了数据中先验的隐藏信息。使用基于非负矩阵分解的共聚类方法被认为对数据为正矩阵的上下文有利。但是,当共同集群的特征是列重叠(或属性重叠)时,这些方法存在局限性——这是许多应用程序上下文中的常见情况。本文形式化了列重叠共聚类问题,并引入了一种新的算法BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization),对每一项聚类独立分析属性聚类。这种分析对于发现嵌入在属性集群中的信息特别有用。我们在合成和真实(文本)数据集上测试了BinOvNMTF算法;BinOvNMTF取得了优于相关算法的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The BinOvNMTF Algorithm: Overlapping Columns Co-clustering Based on Non-negative Matrix Tri-factorization
Co-clustering is being given increasing attention by data scientists because it reveals a priori hidden information in data, through an analysis of item clusters along with attribute clusters. The use of co-clustering methods based on non-negative matrix factorization is considered to be advantageous for contexts in which data is positive matrices. However, there are limitations in these methods when co-clusters are characterized by columns overlapping (or attributes) – a common situation in several application contexts. In this paper, we have formalized the problem of Columns Overlapping Co-clustering and introduced BinOvNMTF (Binary Overlapped Non-negative Matrix Tri-Factorization), a new algorithm to analyze attribute clusters independently for each item cluster. This analysis is particularly useful for discovering information embedded in attribute clusters. We tested the BinOvNMTF algorithm in synthetic and real (textual) datasets; BinOvNMTF achieved superior results than those obtained by correlated algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信