公平范围聚类的近似算法

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-06-11 DOI:10.48550/arXiv.2306.06778

S. S. Hotegni, S. Mahabadi, A. Vakilian

{"title":"公平范围聚类的近似算法","authors":"S. S. Hotegni, S. Mahabadi, A. Vakilian","doi":"10.48550/arXiv.2306.06778","DOIUrl":null,"url":null,"abstract":"This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick $k$ centers with the minimum clustering cost such that each group is at least minimally represented in the centers set and no group dominates the centers set. More precisely, given a set of $n$ points in a metric space $(P,d)$ where each point belongs to one of the $\\ell$ different demographics (i.e., $P = P_1 \\uplus P_2 \\uplus \\cdots \\uplus P_\\ell$) and a set of $\\ell$ intervals $[\\alpha_1, \\beta_1], \\cdots, [\\alpha_\\ell, \\beta_\\ell]$ on desired number of centers from each group, the goal is to pick a set of $k$ centers $C$ with minimum $\\ell_p$-clustering cost (i.e., $(\\sum_{v\\in P} d(v,C)^p)^{1/p}$) such that for each group $i\\in \\ell$, $|C\\cap P_i| \\in [\\alpha_i, \\beta_i]$. In particular, the fair range $\\ell_p$-clustering captures fair range $k$-center, $k$-median and $k$-means as its special cases. In this work, we provide efficient constant factor approximation algorithms for fair range $\\ell_p$-clustering for all values of $p\\in [1,\\infty)$.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"82 1","pages":"13270-13284"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Approximation Algorithms for Fair Range Clustering\",\"authors\":\"S. S. Hotegni, S. Mahabadi, A. Vakilian\",\"doi\":\"10.48550/arXiv.2306.06778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick $k$ centers with the minimum clustering cost such that each group is at least minimally represented in the centers set and no group dominates the centers set. More precisely, given a set of $n$ points in a metric space $(P,d)$ where each point belongs to one of the $\\\\ell$ different demographics (i.e., $P = P_1 \\\\uplus P_2 \\\\uplus \\\\cdots \\\\uplus P_\\\\ell$) and a set of $\\\\ell$ intervals $[\\\\alpha_1, \\\\beta_1], \\\\cdots, [\\\\alpha_\\\\ell, \\\\beta_\\\\ell]$ on desired number of centers from each group, the goal is to pick a set of $k$ centers $C$ with minimum $\\\\ell_p$-clustering cost (i.e., $(\\\\sum_{v\\\\in P} d(v,C)^p)^{1/p}$) such that for each group $i\\\\in \\\\ell$, $|C\\\\cap P_i| \\\\in [\\\\alpha_i, \\\\beta_i]$. In particular, the fair range $\\\\ell_p$-clustering captures fair range $k$-center, $k$-median and $k$-means as its special cases. In this work, we provide efficient constant factor approximation algorithms for fair range $\\\\ell_p$-clustering for all values of $p\\\\in [1,\\\\infty)$.\",\"PeriodicalId\":74529,\"journal\":{\"name\":\"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning\",\"volume\":\"82 1\",\"pages\":\"13270-13284\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2306.06778\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2306.06778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文研究了来自不同人口统计群体的数据点的公平范围聚类问题，其目标是选择具有最小聚类成本的$k$中心，使每个群体在中心集中至少具有最小的代表性，并且没有群体在中心集中占主导地位。更准确地说，给定度量空间$(P,d)$中的一组$n$点，其中每个点属于$\ell$不同的人口统计数据(即$P = P_1 \uplus P_2 \uplus \cdots \uplus P_\ell$)和一组$\ell$区间(即$[\alpha_1, \beta_1], \cdots, [\alpha_\ell, \beta_\ell]$)，目标是选择一组$k$中心$C$具有最小的$\ell_p$聚类成本(即$(\sum_{v\in P} d(v,C)^p)^{1/p}$)，使得对于每个组$i\in \ell$，$|C\cap P_i| \in [\alpha_i, \beta_i]$。特别是，公平范围$\ell_p$ -聚类捕获公平范围$k$ -中心，$k$ -中位数和$k$ -均值作为其特殊情况。在这项工作中，我们为所有$p\in [1,\infty)$值的公平范围$\ell_p$ -聚类提供了有效的常因子近似算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Approximation Algorithms for Fair Range Clustering

This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick $k$ centers with the minimum clustering cost such that each group is at least minimally represented in the centers set and no group dominates the centers set. More precisely, given a set of $n$ points in a metric space $(P,d)$ where each point belongs to one of the $\ell$ different demographics (i.e., $P = P_1 \uplus P_2 \uplus \cdots \uplus P_\ell$) and a set of $\ell$ intervals $[\alpha_1, \beta_1], \cdots, [\alpha_\ell, \beta_\ell]$ on desired number of centers from each group, the goal is to pick a set of $k$ centers $C$ with minimum $\ell_p$-clustering cost (i.e., $(\sum_{v\in P} d(v,C)^p)^{1/p}$) such that for each group $i\in \ell$, $|C\cap P_i| \in [\alpha_i, \beta_i]$. In particular, the fair range $\ell_p$-clustering captures fair range $k$-center, $k$-median and $k$-means as its special cases. In this work, we provide efficient constant factor approximation algorithms for fair range $\ell_p$-clustering for all values of $p\in [1,\infty)$.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning

自引率

0.00%

发文量