{"title":"带有惩罚的k-means问题的改进原对偶逼近算法","authors":"Chunying Ren, Dachuan Xu, D. Du, Min Li","doi":"10.1017/S0960129521000104","DOIUrl":null,"url":null,"abstract":"Abstract In the k-means problem with penalties, we are given a data set $${\\cal D} \\subseteq \\mathbb{R}^\\ell $$ of n points where each point $$j \\in {\\cal D}$$ is associated with a penalty cost pj and an integer k. The goal is to choose a set $${\\rm{C}}S \\subseteq {{\\cal R}^\\ell }$$ with |CS| ≤ k and a penalized subset $${{\\cal D}_p} \\subseteq {\\cal D}$$ to minimize the sum of the total squared distance from the points in D / Dp to CS and the total penalty cost of points in Dp, namely $$\\sum\\nolimits_{j \\in {\\cal D}\\backslash {{\\cal D}_p}} {d^2}(j,{\\rm{C}}S) + \\sum\\nolimits_{j \\in {{\\cal D}_p}} {p_j}$$. We employ the primal-dual technique to give a pseudo-polynomial time algorithm with an approximation ratio of (6.357+ε) for the k-means problem with penalties, improving the previous best approximation ratio 19.849+∊ for this problem given by Feng et al. in Proceedings of FAW (2019).","PeriodicalId":49855,"journal":{"name":"Mathematical Structures in Computer Science","volume":"32 1","pages":"151 - 163"},"PeriodicalIF":0.4000,"publicationDate":"2021-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved primal-dual approximation algorithm for the k-means problem with penalties\",\"authors\":\"Chunying Ren, Dachuan Xu, D. Du, Min Li\",\"doi\":\"10.1017/S0960129521000104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In the k-means problem with penalties, we are given a data set $${\\\\cal D} \\\\subseteq \\\\mathbb{R}^\\\\ell $$ of n points where each point $$j \\\\in {\\\\cal D}$$ is associated with a penalty cost pj and an integer k. The goal is to choose a set $${\\\\rm{C}}S \\\\subseteq {{\\\\cal R}^\\\\ell }$$ with |CS| ≤ k and a penalized subset $${{\\\\cal D}_p} \\\\subseteq {\\\\cal D}$$ to minimize the sum of the total squared distance from the points in D / Dp to CS and the total penalty cost of points in Dp, namely $$\\\\sum\\\\nolimits_{j \\\\in {\\\\cal D}\\\\backslash {{\\\\cal D}_p}} {d^2}(j,{\\\\rm{C}}S) + \\\\sum\\\\nolimits_{j \\\\in {{\\\\cal D}_p}} {p_j}$$. We employ the primal-dual technique to give a pseudo-polynomial time algorithm with an approximation ratio of (6.357+ε) for the k-means problem with penalties, improving the previous best approximation ratio 19.849+∊ for this problem given by Feng et al. in Proceedings of FAW (2019).\",\"PeriodicalId\":49855,\"journal\":{\"name\":\"Mathematical Structures in Computer Science\",\"volume\":\"32 1\",\"pages\":\"151 - 163\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2021-08-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Structures in Computer Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1017/S0960129521000104\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Structures in Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1017/S0960129521000104","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
An improved primal-dual approximation algorithm for the k-means problem with penalties
Abstract In the k-means problem with penalties, we are given a data set $${\cal D} \subseteq \mathbb{R}^\ell $$ of n points where each point $$j \in {\cal D}$$ is associated with a penalty cost pj and an integer k. The goal is to choose a set $${\rm{C}}S \subseteq {{\cal R}^\ell }$$ with |CS| ≤ k and a penalized subset $${{\cal D}_p} \subseteq {\cal D}$$ to minimize the sum of the total squared distance from the points in D / Dp to CS and the total penalty cost of points in Dp, namely $$\sum\nolimits_{j \in {\cal D}\backslash {{\cal D}_p}} {d^2}(j,{\rm{C}}S) + \sum\nolimits_{j \in {{\cal D}_p}} {p_j}$$. We employ the primal-dual technique to give a pseudo-polynomial time algorithm with an approximation ratio of (6.357+ε) for the k-means problem with penalties, improving the previous best approximation ratio 19.849+∊ for this problem given by Feng et al. in Proceedings of FAW (2019).
期刊介绍:
Mathematical Structures in Computer Science is a journal of theoretical computer science which focuses on the application of ideas from the structural side of mathematics and mathematical logic to computer science. The journal aims to bridge the gap between theoretical contributions and software design, publishing original papers of a high standard and broad surveys with original perspectives in all areas of computing, provided that ideas or results from logic, algebra, geometry, category theory or other areas of logic and mathematics form a basis for the work. The journal welcomes applications to computing based on the use of specific mathematical structures (e.g. topological and order-theoretic structures) as well as on proof-theoretic notions or results.