{"title":"Improved Linear-Time Streaming Algorithms for Maximizing Monotone Cardinality-Constrained Set Functions","authors":"Min Cui, Donglei Du, Ling Gai, Ruiqi Yang","doi":"10.1142/s0129054123410083","DOIUrl":null,"url":null,"abstract":"Many real-world applications arising from social networks, personalized recommendations, and others, require extracting a relatively small but broadly representative portion of massive data sets. Such problems can often be formulated as maximizing a monotone set function with cardinality constraints. In this paper, we consider a streaming model where elements arrive quickly over time, and create an effective, and low-memory algorithm. First, we provide the first single-pass linear-time algorithm, which is a a deterministic algorithm, achieves an approximation ratio of [Formula: see text] for any [Formula: see text] with a query complexity of [Formula: see text] and a memory complexity of [Formula: see text], where [Formula: see text] is a positive integer and [Formula: see text] is the submodularity ratio. However, the algorithm may produce less-than-ideal results. Our next result is to describe a multi-streaming algorithm, which is the first deterministic algorithm to attain an approximation ratio of [Formula: see text] with linear query complexity.","PeriodicalId":50323,"journal":{"name":"International Journal of Foundations of Computer Science","volume":"106 1","pages":"0"},"PeriodicalIF":0.6000,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Foundations of Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0129054123410083","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Many real-world applications arising from social networks, personalized recommendations, and others, require extracting a relatively small but broadly representative portion of massive data sets. Such problems can often be formulated as maximizing a monotone set function with cardinality constraints. In this paper, we consider a streaming model where elements arrive quickly over time, and create an effective, and low-memory algorithm. First, we provide the first single-pass linear-time algorithm, which is a a deterministic algorithm, achieves an approximation ratio of [Formula: see text] for any [Formula: see text] with a query complexity of [Formula: see text] and a memory complexity of [Formula: see text], where [Formula: see text] is a positive integer and [Formula: see text] is the submodularity ratio. However, the algorithm may produce less-than-ideal results. Our next result is to describe a multi-streaming algorithm, which is the first deterministic algorithm to attain an approximation ratio of [Formula: see text] with linear query complexity.
来自社交网络、个性化推荐等的许多实际应用程序都需要从大量数据集中提取相对较小但具有广泛代表性的部分。这类问题通常可以表述为具有基数约束的单调集函数的最大化。在本文中,我们考虑了一个元素随时间快速到达的流模型,并创建了一个有效的低内存算法。首先,我们提供了第一个单遍线性时间算法,该算法是一种确定性算法,对任意一个查询复杂度为[Formula: see text]、内存复杂度为[Formula: see text]的[Formula: see text]实现了近似比为[Formula: see text],其中[Formula: see text]为正整数,[Formula: see text]为子模块化比。然而,该算法可能产生不太理想的结果。我们的下一个结果是描述一个多流算法,这是第一个获得线性查询复杂度近似比率的确定性算法[公式:见文本]。
期刊介绍:
The International Journal of Foundations of Computer Science is a bimonthly journal that publishes articles which contribute new theoretical results in all areas of the foundations of computer science. The theoretical and mathematical aspects covered include:
- Algebraic theory of computing and formal systems
- Algorithm and system implementation issues
- Approximation, probabilistic, and randomized algorithms
- Automata and formal languages
- Automated deduction
- Combinatorics and graph theory
- Complexity theory
- Computational biology and bioinformatics
- Cryptography
- Database theory
- Data structures
- Design and analysis of algorithms
- DNA computing
- Foundations of computer security
- Foundations of high-performance computing