{"title":"Summarizing Boolean and fuzzy tensors with sub-tensors","authors":"Victor Henrique Silva Ribeiro, Loïc Cerf","doi":"10.1016/j.ins.2025.122489","DOIUrl":null,"url":null,"abstract":"<div><div>The disjunctive box cluster model summarizes an <em>n</em>-way Boolean tensor with some of its sub-tensors and their densities, <em>i.e.</em>, the arithmetic means of their values. Mirkin and Kramarenko proposed that easy-to-interpret regression model, for <span><math><mi>n</mi><mo>∈</mo><mo>{</mo><mn>2</mn><mo>,</mo><mn>3</mn><mo>}</mo></math></span>, and hill climbing to discover good sub-tensors, according to ordinary least squares. This article generalizes Mirkin and Kramarenko's work: <em>n</em>-way <em>fuzzy</em> tensors are summarized. They encode to what extent <em>n</em>-ary predicates are satisfied. The article also details significant performance improvements to the sequential execution, its parallelization, better starting points for hill climbing, a selection of the discovered sub-tensors, their ranking in order of contribution to the model, and the use of the elbow method to truncate the ordered list. In-depth experiments using synthetic and real-world tensors compare the proposed method, NclusterBox, to Mirkin and Kramarenko's and to the state-of-the-art algorithms for matrix factorization using the max (rather than +) operator and for Boolean tensor factorization. NclusterBox summarizes synthetic and real-world fuzzy tensors more efficiently and, most importantly, more accurately.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"719 ","pages":"Article 122489"},"PeriodicalIF":6.8000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525006218","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The disjunctive box cluster model summarizes an n-way Boolean tensor with some of its sub-tensors and their densities, i.e., the arithmetic means of their values. Mirkin and Kramarenko proposed that easy-to-interpret regression model, for , and hill climbing to discover good sub-tensors, according to ordinary least squares. This article generalizes Mirkin and Kramarenko's work: n-way fuzzy tensors are summarized. They encode to what extent n-ary predicates are satisfied. The article also details significant performance improvements to the sequential execution, its parallelization, better starting points for hill climbing, a selection of the discovered sub-tensors, their ranking in order of contribution to the model, and the use of the elbow method to truncate the ordered list. In-depth experiments using synthetic and real-world tensors compare the proposed method, NclusterBox, to Mirkin and Kramarenko's and to the state-of-the-art algorithms for matrix factorization using the max (rather than +) operator and for Boolean tensor factorization. NclusterBox summarizes synthetic and real-world fuzzy tensors more efficiently and, most importantly, more accurately.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.