通过非凸松弛和反集中实现子行列式最大化

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) Pub Date : 2017-07-10 DOI:10.1109/FOCS.2017.98

J. Ebrahimi, D. Straszak, Nisheeth K. Vishnoi

{"title":"通过非凸松弛和反集中实现子行列式最大化","authors":"J. Ebrahimi, D. Straszak, Nisheeth K. Vishnoi","doi":"10.1109/FOCS.2017.98","DOIUrl":null,"url":null,"abstract":"Several fundamental problems that arise in optimization and computer science can be cast as follows: Given vectors v_1,...,v_m in R^d and a constraint family B of subsets of [m], find a set S in B that maximizes the squared volume of the simplex spanned by the vectors in S. A motivating example is the ubiquitous data-summarization problem in machine learning and information retrieval where one is given a collection of feature vectors that represent data such as documents or images. The volume of a collection of vectors is used as a measure of their diversity, and partition or matroid constraints over [m] are imposed in order to ensure resource or fairness constraints. Even with a simple cardinality constraint, the problem becomes NP-hard and has received much attention starting with a result by Khachiyan who gave an r^{O(r)} approximation algorithm for this problem. Recently, Nikolov and Singh presented a convex program and showed how it can be used to estimate the value of the most diverse set when there are multiple cardinality constraints (i.e., when B corresponds to a partition matroid). Their proof of the integrality gap of the convex program relied on an inequality by Gurvits, and was recently extended to regular matroids. The question of whether these estimation algorithms can be converted into the more useful approximation algorithms – that also output a set – remained open.The main contribution of this paper is to give the first approximation algorithms for both partition and regular matroids. We present novel formulations for the subdeterminant maximization problem for these matroids; this reduces them to the problem of finding a point that maximizes the absolute value of a nonconvex function over a Cartesian product of probability simplices. The technical core of our results is a new anti-concentration inequality for dependent random variables that arise from these functions which allows us to relate the optimal value of these nonconvex functions to their value at a random point. Unlike prior work on the constrained subdeterminant maximization problem, our proofs do not rely on real-stability or convexity and could be of independent interest both in algorithms and complexity where anti-concentration phenomena has recently been deployed.","PeriodicalId":311592,"journal":{"name":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Subdeterminant Maximization via Nonconvex Relaxations and Anti-Concentration\",\"authors\":\"J. Ebrahimi, D. Straszak, Nisheeth K. Vishnoi\",\"doi\":\"10.1109/FOCS.2017.98\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Several fundamental problems that arise in optimization and computer science can be cast as follows: Given vectors v_1,...,v_m in R^d and a constraint family B of subsets of [m], find a set S in B that maximizes the squared volume of the simplex spanned by the vectors in S. A motivating example is the ubiquitous data-summarization problem in machine learning and information retrieval where one is given a collection of feature vectors that represent data such as documents or images. The volume of a collection of vectors is used as a measure of their diversity, and partition or matroid constraints over [m] are imposed in order to ensure resource or fairness constraints. Even with a simple cardinality constraint, the problem becomes NP-hard and has received much attention starting with a result by Khachiyan who gave an r^{O(r)} approximation algorithm for this problem. Recently, Nikolov and Singh presented a convex program and showed how it can be used to estimate the value of the most diverse set when there are multiple cardinality constraints (i.e., when B corresponds to a partition matroid). Their proof of the integrality gap of the convex program relied on an inequality by Gurvits, and was recently extended to regular matroids. The question of whether these estimation algorithms can be converted into the more useful approximation algorithms – that also output a set – remained open.The main contribution of this paper is to give the first approximation algorithms for both partition and regular matroids. We present novel formulations for the subdeterminant maximization problem for these matroids; this reduces them to the problem of finding a point that maximizes the absolute value of a nonconvex function over a Cartesian product of probability simplices. The technical core of our results is a new anti-concentration inequality for dependent random variables that arise from these functions which allows us to relate the optimal value of these nonconvex functions to their value at a random point. Unlike prior work on the constrained subdeterminant maximization problem, our proofs do not rely on real-stability or convexity and could be of independent interest both in algorithms and complexity where anti-concentration phenomena has recently been deployed.\",\"PeriodicalId\":311592,\"journal\":{\"name\":\"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FOCS.2017.98\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FOCS.2017.98","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

最优化和计算机科学中出现的几个基本问题可以描述如下:给定向量v_1，…， R^d中的v_m和[m]子集的约束族B，在B中找到一个集S，该集S使S中的向量所张成的单纯形的平方体积最大化。一个激励的例子是机器学习和信息检索中普遍存在的数据总结问题，其中给出一个特征向量的集合，这些特征向量表示文档或图像等数据。向量集合的体积被用作其多样性的度量，并且在[m]上施加分区或矩阵约束以确保资源或公平性约束。即使有一个简单的基数约束，这个问题也变得np困难，并且从kachiyan给出一个r^{O(r)}近似算法的结果开始，这个问题就受到了广泛的关注。最近，Nikolov和Singh提出了一个凸规划，并展示了当存在多个基数约束时(即当B对应于划分矩阵时)如何使用它来估计最多样化集的值。他们对凸规划的完整性缺口的证明依赖于Gurvits的一个不等式，并且最近被推广到正则拟阵。这些估计算法能否转化为更有用的近似算法的问题–也输出一组–保持开放。本文的主要贡献是给出了划分拟阵和正则拟阵的第一逼近算法。我们提出了这些拟阵的子行列式最大化问题的新公式;这就把它们简化为找到一个点，使非凸函数的绝对值在简单概率的笛卡尔积上最大化。我们的结果的技术核心是由这些函数产生的依赖随机变量的一个新的反集中不等式，它允许我们将这些非凸函数的最优值与它们在随机点的值联系起来。与先前在约束子行列式最大化问题上的工作不同，我们的证明不依赖于实稳定性或凸性，并且在算法和复杂性方面可能具有独立的兴趣，其中最近部署了反集中现象。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Subdeterminant Maximization via Nonconvex Relaxations and Anti-Concentration

Several fundamental problems that arise in optimization and computer science can be cast as follows: Given vectors v_1,...,v_m in R^d and a constraint family B of subsets of [m], find a set S in B that maximizes the squared volume of the simplex spanned by the vectors in S. A motivating example is the ubiquitous data-summarization problem in machine learning and information retrieval where one is given a collection of feature vectors that represent data such as documents or images. The volume of a collection of vectors is used as a measure of their diversity, and partition or matroid constraints over [m] are imposed in order to ensure resource or fairness constraints. Even with a simple cardinality constraint, the problem becomes NP-hard and has received much attention starting with a result by Khachiyan who gave an r^{O(r)} approximation algorithm for this problem. Recently, Nikolov and Singh presented a convex program and showed how it can be used to estimate the value of the most diverse set when there are multiple cardinality constraints (i.e., when B corresponds to a partition matroid). Their proof of the integrality gap of the convex program relied on an inequality by Gurvits, and was recently extended to regular matroids. The question of whether these estimation algorithms can be converted into the more useful approximation algorithms – that also output a set – remained open.The main contribution of this paper is to give the first approximation algorithms for both partition and regular matroids. We present novel formulations for the subdeterminant maximization problem for these matroids; this reduces them to the problem of finding a point that maximizes the absolute value of a nonconvex function over a Cartesian product of probability simplices. The technical core of our results is a new anti-concentration inequality for dependent random variables that arise from these functions which allows us to relate the optimal value of these nonconvex functions to their value at a random point. Unlike prior work on the constrained subdeterminant maximization problem, our proofs do not rely on real-stability or convexity and could be of independent interest both in algorithms and complexity where anti-concentration phenomena has recently been deployed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)

自引率

0.00%

发文量