Universal Algorithms for Clustering Problems

IF 0.9 3区 计算机科学 Q3 COMPUTER SCIENCE, THEORY & METHODS
Arun Ganesh, Bruce M. Maggs, Debmalya Panigrahi
{"title":"Universal Algorithms for Clustering Problems","authors":"Arun Ganesh, Bruce M. Maggs, Debmalya Panigrahi","doi":"https://dl.acm.org/doi/10.1145/3572840","DOIUrl":null,"url":null,"abstract":"<p>This article presents <i>universal</i> algorithms for clustering problems, including the widely studied <i>k</i>-median, <i>k</i>-means, and <i>k</i>-center objectives. The input is a metric space containing all <i>potential</i> client locations. The algorithm must select <i>k</i> cluster centers such that they are a good solution for <i>any</i> subset of clients that actually realize. Specifically, we aim for low <i>regret</i>, defined as the maximum over all subsets of the difference between the cost of the algorithm’s solution and that of an optimal solution. A universal algorithm’s solution <span>Sol</span> for a clustering problem is said to be an α , β-approximation if for all subsets of clients <i>C<sup>′</sup></i>, it satisfies <span>sol</span> (<i>C</i><sup>′</sup>) ≤ α ċ <span>opt</span> (<i>C</i>′) + β ċ <span>mr</span>, where <span>opt</span> (<i>C</i>′ is the cost of the optimal solution for clients (<i>C</i>′) and <span>mr</span> is the minimum regret achievable by any solution.</p><p>Our main results are universal algorithms for the standard clustering objectives of <i>k</i>-median, <i>k</i>-means, and <i>k</i>-center that achieve (<i>O</i>(1), <i>O</i>(1))-approximations. These results are obtained via a novel framework for universal algorithms using linear programming (LP) relaxations. These results generalize to other ℓ<i><sub>p</sub></i>-objectives and the setting where some subset of the clients are <i>fixed</i>. We also give hardness results showing that (α, β)-approximation is NP-hard if α or β is at most a certain constant, even for the widely studied special case of Euclidean metric spaces. This shows that in some sense, (<i>O</i>(1), <i>O</i>(1))-approximation is the strongest type of guarantee obtainable for universal clustering.</p>","PeriodicalId":50922,"journal":{"name":"ACM Transactions on Algorithms","volume":"8 7","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Algorithms","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3572840","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

This article presents universal algorithms for clustering problems, including the widely studied k-median, k-means, and k-center objectives. The input is a metric space containing all potential client locations. The algorithm must select k cluster centers such that they are a good solution for any subset of clients that actually realize. Specifically, we aim for low regret, defined as the maximum over all subsets of the difference between the cost of the algorithm’s solution and that of an optimal solution. A universal algorithm’s solution Sol for a clustering problem is said to be an α , β-approximation if for all subsets of clients C, it satisfies sol (C) ≤ α ċ opt (C′) + β ċ mr, where opt (C′ is the cost of the optimal solution for clients (C′) and mr is the minimum regret achievable by any solution.

Our main results are universal algorithms for the standard clustering objectives of k-median, k-means, and k-center that achieve (O(1), O(1))-approximations. These results are obtained via a novel framework for universal algorithms using linear programming (LP) relaxations. These results generalize to other ℓp-objectives and the setting where some subset of the clients are fixed. We also give hardness results showing that (α, β)-approximation is NP-hard if α or β is at most a certain constant, even for the widely studied special case of Euclidean metric spaces. This shows that in some sense, (O(1), O(1))-approximation is the strongest type of guarantee obtainable for universal clustering.

聚类问题的通用算法
本文介绍了聚类问题的通用算法,包括广泛研究的k-median, k-means和k-center目标。输入是包含所有潜在客户位置的度量空间。算法必须选择k个聚类中心,使得它们对于实际实现的任何客户端子集都是一个很好的解决方案。具体来说,我们的目标是低遗憾,定义为算法解决方案的成本与最优解决方案的成本之差的所有子集的最大值。聚类问题的通用算法解Sol是一个α, β-近似,如果对于客户端C '的所有子集,它满足Sol (C ')≤α * opt (C ') + β * mr,其中opt (C ')是客户端(C ')的最优解的代价,mr是任何解可实现的最小遗憾。我们的主要成果是k-median, k-means和k-center的标准聚类目标的通用算法,这些算法实现了(O(1), O(1))-近似。这些结果是通过使用线性规划(LP)松弛的一种新的通用算法框架得到的。这些结果推广到其他的目标和设置,其中客户的某些子集是固定的。我们还给出了硬度结果,表明(α, β)-近似是np困难的,如果α或β最多是某个常数,即使对于广泛研究的欧几里得度量空间的特殊情况也是如此。这表明在某种意义上,(O(1), O(1))-近似是普遍聚类所能得到的最强保证类型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACM Transactions on Algorithms
ACM Transactions on Algorithms COMPUTER SCIENCE, THEORY & METHODS-MATHEMATICS, APPLIED
CiteScore
3.30
自引率
0.00%
发文量
50
审稿时长
6-12 weeks
期刊介绍: ACM Transactions on Algorithms welcomes submissions of original research of the highest quality dealing with algorithms that are inherently discrete and finite, and having mathematical content in a natural way, either in the objective or in the analysis. Most welcome are new algorithms and data structures, new and improved analyses, and complexity results. Specific areas of computation covered by the journal include combinatorial searches and objects; counting; discrete optimization and approximation; randomization and quantum computation; parallel and distributed computation; algorithms for graphs, geometry, arithmetic, number theory, strings; on-line analysis; cryptography; coding; data compression; learning algorithms; methods of algorithmic analysis; discrete algorithms for application areas such as biology, economics, game theory, communication, computer systems and architecture, hardware design, scientific computing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信