总结布尔张量和模糊张量的子张量

IF 6.8 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS
Victor Henrique Silva Ribeiro, Loïc Cerf
{"title":"总结布尔张量和模糊张量的子张量","authors":"Victor Henrique Silva Ribeiro,&nbsp;Loïc Cerf","doi":"10.1016/j.ins.2025.122489","DOIUrl":null,"url":null,"abstract":"<div><div>The disjunctive box cluster model summarizes an <em>n</em>-way Boolean tensor with some of its sub-tensors and their densities, <em>i.e.</em>, the arithmetic means of their values. Mirkin and Kramarenko proposed that easy-to-interpret regression model, for <span><math><mi>n</mi><mo>∈</mo><mo>{</mo><mn>2</mn><mo>,</mo><mn>3</mn><mo>}</mo></math></span>, and hill climbing to discover good sub-tensors, according to ordinary least squares. This article generalizes Mirkin and Kramarenko's work: <em>n</em>-way <em>fuzzy</em> tensors are summarized. They encode to what extent <em>n</em>-ary predicates are satisfied. The article also details significant performance improvements to the sequential execution, its parallelization, better starting points for hill climbing, a selection of the discovered sub-tensors, their ranking in order of contribution to the model, and the use of the elbow method to truncate the ordered list. In-depth experiments using synthetic and real-world tensors compare the proposed method, NclusterBox, to Mirkin and Kramarenko's and to the state-of-the-art algorithms for matrix factorization using the max (rather than +) operator and for Boolean tensor factorization. NclusterBox summarizes synthetic and real-world fuzzy tensors more efficiently and, most importantly, more accurately.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"719 ","pages":"Article 122489"},"PeriodicalIF":6.8000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Summarizing Boolean and fuzzy tensors with sub-tensors\",\"authors\":\"Victor Henrique Silva Ribeiro,&nbsp;Loïc Cerf\",\"doi\":\"10.1016/j.ins.2025.122489\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The disjunctive box cluster model summarizes an <em>n</em>-way Boolean tensor with some of its sub-tensors and their densities, <em>i.e.</em>, the arithmetic means of their values. Mirkin and Kramarenko proposed that easy-to-interpret regression model, for <span><math><mi>n</mi><mo>∈</mo><mo>{</mo><mn>2</mn><mo>,</mo><mn>3</mn><mo>}</mo></math></span>, and hill climbing to discover good sub-tensors, according to ordinary least squares. This article generalizes Mirkin and Kramarenko's work: <em>n</em>-way <em>fuzzy</em> tensors are summarized. They encode to what extent <em>n</em>-ary predicates are satisfied. The article also details significant performance improvements to the sequential execution, its parallelization, better starting points for hill climbing, a selection of the discovered sub-tensors, their ranking in order of contribution to the model, and the use of the elbow method to truncate the ordered list. In-depth experiments using synthetic and real-world tensors compare the proposed method, NclusterBox, to Mirkin and Kramarenko's and to the state-of-the-art algorithms for matrix factorization using the max (rather than +) operator and for Boolean tensor factorization. NclusterBox summarizes synthetic and real-world fuzzy tensors more efficiently and, most importantly, more accurately.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"719 \",\"pages\":\"Article 122489\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025525006218\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525006218","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

析取盒簇模型总结了一个n路布尔张量及其若干子张量及其密度,即它们值的算术平均值。Mirkin和Kramarenko提出了易于解释的回归模型,对于n∈{2,3},根据普通最小二乘爬山来发现好的子张量。本文推广了Mirkin和Kramarenko的工作:总结了n向模糊张量。它们编码n元谓词满足的程度。本文还详细介绍了顺序执行的显著性能改进、并行化、更好的爬坡起点、发现的子张量的选择、它们对模型的贡献顺序排序,以及使用肘形方法截断有序列表。使用合成张量和真实张量的深入实验将提出的方法NclusterBox与Mirkin和Kramarenko的方法以及使用max(而不是+)算子和布尔张量分解的最先进的矩阵分解算法进行了比较。NclusterBox更有效地总结了合成和现实世界的模糊张量,最重要的是,更准确。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Summarizing Boolean and fuzzy tensors with sub-tensors
The disjunctive box cluster model summarizes an n-way Boolean tensor with some of its sub-tensors and their densities, i.e., the arithmetic means of their values. Mirkin and Kramarenko proposed that easy-to-interpret regression model, for n{2,3}, and hill climbing to discover good sub-tensors, according to ordinary least squares. This article generalizes Mirkin and Kramarenko's work: n-way fuzzy tensors are summarized. They encode to what extent n-ary predicates are satisfied. The article also details significant performance improvements to the sequential execution, its parallelization, better starting points for hill climbing, a selection of the discovered sub-tensors, their ranking in order of contribution to the model, and the use of the elbow method to truncate the ordered list. In-depth experiments using synthetic and real-world tensors compare the proposed method, NclusterBox, to Mirkin and Kramarenko's and to the state-of-the-art algorithms for matrix factorization using the max (rather than +) operator and for Boolean tensor factorization. NclusterBox summarizes synthetic and real-world fuzzy tensors more efficiently and, most importantly, more accurately.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信