{"title":"稳定聚类实例的局部结构","authors":"Vincent Cohen-Addad, Chris Schwiegelshohn","doi":"10.1109/FOCS.2017.14","DOIUrl":null,"url":null,"abstract":"We study the classic k-median and k-means clustering objectives in the beyond-worst-case scenario. We consider three well-studied notions of structured data that aim at characterizing real-world inputs:• Distribution Stability (introduced by Awasthi, Blum, and Sheffet, FOCS 2010)• Spectral Separability (introduced by Kumar and Kannan, FOCS 2010)• Perturbation Resilience (introduced by Bilu and Linial, ICS 2010)We prove structural results showing that inputs satisfying at least one of the conditions are inherently local. Namely, for any such input, any local optimum is close both in term of structure and in term of objective value to the global optima.As a corollary we obtain that the widely-used Local Search algorithm has strong performance guarantees for both the tasks of recovering the underlying optimal clustering and obtaining a clustering of small cost. This is a significant step toward understanding the success of local search heuristics in clustering applications.","PeriodicalId":311592,"journal":{"name":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","volume":"261 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":"{\"title\":\"On the Local Structure of Stable Clustering Instances\",\"authors\":\"Vincent Cohen-Addad, Chris Schwiegelshohn\",\"doi\":\"10.1109/FOCS.2017.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study the classic k-median and k-means clustering objectives in the beyond-worst-case scenario. We consider three well-studied notions of structured data that aim at characterizing real-world inputs:• Distribution Stability (introduced by Awasthi, Blum, and Sheffet, FOCS 2010)• Spectral Separability (introduced by Kumar and Kannan, FOCS 2010)• Perturbation Resilience (introduced by Bilu and Linial, ICS 2010)We prove structural results showing that inputs satisfying at least one of the conditions are inherently local. Namely, for any such input, any local optimum is close both in term of structure and in term of objective value to the global optima.As a corollary we obtain that the widely-used Local Search algorithm has strong performance guarantees for both the tasks of recovering the underlying optimal clustering and obtaining a clustering of small cost. This is a significant step toward understanding the success of local search heuristics in clustering applications.\",\"PeriodicalId\":311592,\"journal\":{\"name\":\"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)\",\"volume\":\"261 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"44\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FOCS.2017.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FOCS.2017.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 44
摘要
我们研究了超越最坏情况下的经典k中值和k均值聚类目标。我们考虑了三个经过充分研究的结构化数据概念,旨在表征现实世界的输入:•分配稳定性(Awasthi, Blum, and Sheffet, FOCS 2010引入)•光谱可分性(由Kumar和Kannan引入,FOCS 2010)•扰动弹性(由Bilu和Linial引入,ICS 2010)我们证明了结构结果,表明满足至少一个条件的输入本质上是局部的。也就是说,对于任何这样的输入,任何局部最优在结构和目标值上都接近全局最优。结果表明,广泛使用的局部搜索算法对于恢复底层最优聚类和获得小代价聚类都有很强的性能保证。这是理解本地搜索启发式在集群应用程序中的成功的重要一步。
On the Local Structure of Stable Clustering Instances
We study the classic k-median and k-means clustering objectives in the beyond-worst-case scenario. We consider three well-studied notions of structured data that aim at characterizing real-world inputs:• Distribution Stability (introduced by Awasthi, Blum, and Sheffet, FOCS 2010)• Spectral Separability (introduced by Kumar and Kannan, FOCS 2010)• Perturbation Resilience (introduced by Bilu and Linial, ICS 2010)We prove structural results showing that inputs satisfying at least one of the conditions are inherently local. Namely, for any such input, any local optimum is close both in term of structure and in term of objective value to the global optima.As a corollary we obtain that the widely-used Local Search algorithm has strong performance guarantees for both the tasks of recovering the underlying optimal clustering and obtaining a clustering of small cost. This is a significant step toward understanding the success of local search heuristics in clustering applications.