Hardness of Learning Boolean Functions from Label Proportions

Foundations of Software Technology and Theoretical Computer Science Pub Date : 2024-03-28 DOI:10.4230/LIPIcs.FSTTCS.2023.37

V. Guruswami, Rishi Saket

{"title":"Hardness of Learning Boolean Functions from Label Proportions","authors":"V. Guruswami, Rishi Saket","doi":"10.4230/LIPIcs.FSTTCS.2023.37","DOIUrl":null,"url":null,"abstract":"In recent years the framework of learning from label proportions (LLP) has been gaining importance in machine learning. In this setting, the training examples are aggregated into subsets or bags and only the average label per bag is available for learning an example-level predictor. This generalizes traditional PAC learning which is the special case of unit-sized bags. The computational learning aspects of LLP were studied in recent works (Saket, NeurIPS'21; Saket, NeurIPS'22) which showed algorithms and hardness for learning halfspaces in the LLP setting. In this work we focus on the intractability of LLP learning Boolean functions. Our first result shows that given a collection of bags of size at most $2$ which are consistent with an OR function, it is NP-hard to find a CNF of constantly many clauses which satisfies any constant-fraction of the bags. This is in contrast with the work of (Saket, NeurIPS'21) which gave a $(2/5)$-approximation for learning ORs using a halfspace. Thus, our result provides a separation between constant clause CNFs and halfspaces as hypotheses for LLP learning ORs. Next, we prove the hardness of satisfying more than $1/2 + o(1)$ fraction of such bags using a $t$-DNF (i.e. DNF where each term has $\\leq t$ literals) for any constant $t$. In usual PAC learning such a hardness was known (Khot-Saket, FOCS'08) only for learning noisy ORs. We also study the learnability of parities and show that it is NP-hard to satisfy more than $(q/2^{q-1} + o(1))$-fraction of $q$-sized bags which are consistent with a parity using a parity, while a random parity based algorithm achieves a $(1/2^{q-2})$-approximation.","PeriodicalId":175000,"journal":{"name":"Foundations of Software Technology and Theoretical Computer Science","volume":"10 5","pages":"37:1-37:15"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Foundations of Software Technology and Theoretical Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.FSTTCS.2023.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years the framework of learning from label proportions (LLP) has been gaining importance in machine learning. In this setting, the training examples are aggregated into subsets or bags and only the average label per bag is available for learning an example-level predictor. This generalizes traditional PAC learning which is the special case of unit-sized bags. The computational learning aspects of LLP were studied in recent works (Saket, NeurIPS'21; Saket, NeurIPS'22) which showed algorithms and hardness for learning halfspaces in the LLP setting. In this work we focus on the intractability of LLP learning Boolean functions. Our first result shows that given a collection of bags of size at most $2$ which are consistent with an OR function, it is NP-hard to find a CNF of constantly many clauses which satisfies any constant-fraction of the bags. This is in contrast with the work of (Saket, NeurIPS'21) which gave a $(2/5)$-approximation for learning ORs using a halfspace. Thus, our result provides a separation between constant clause CNFs and halfspaces as hypotheses for LLP learning ORs. Next, we prove the hardness of satisfying more than $1/2 + o(1)$ fraction of such bags using a $t$-DNF (i.e. DNF where each term has $\leq t$ literals) for any constant $t$. In usual PAC learning such a hardness was known (Khot-Saket, FOCS'08) only for learning noisy ORs. We also study the learnability of parities and show that it is NP-hard to satisfy more than $(q/2^{q-1} + o(1))$-fraction of $q$-sized bags which are consistent with a parity using a parity, while a random parity based algorithm achieves a $(1/2^{q-2})$-approximation.

查看原文本刊更多论文

从标签比例学习布尔函数的难易程度

近年来，从标签比例学习（LLP）框架在机器学习中越来越重要。在这种情况下，训练示例被聚合成子集或袋，只有每个袋的平均标签才可用于学习示例级预测器。这推广了传统的 PAC 学习，它是单位大小袋的特殊情况。最近的研究（Saket，NeurIPS'21；Saket，NeurIPS'22）对 LLP 的计算学习方面进行了研究，展示了在 LLP 环境中学习半空间的算法和硬度。在这项工作中，我们重点研究 LLP 学习布尔函数的难易程度。我们的第一个结果表明，如果给定一个大小最多为 2 美元、与 OR 函数一致的包集合，那么要找到一个满足包中任何恒定分数的、由恒定多个分句组成的 CNF 是非常困难的。这与（Saket，NeurIPS'21）的研究成果形成了鲜明对比，后者给出了使用半空间学习 OR 的 $(2/5)$ 近似值。因此，我们的结果提供了恒定子句 CNF 与半空间作为 LLP 学习 OR 的假设之间的分离。接下来，我们证明了对于任意常数 $t$，使用 $t$-DNF（即每个项都有 $\leq t$ 字面的 DNF）满足超过 $1/2 + o(1)$ 分数的此类包的难度。在通常的 PAC 学习中，这样的难度只有在学习噪声 OR 时才会出现（Khot-Saket，FOCS'08）。我们还研究了奇偶校验的可学习性，并证明使用奇偶校验满足超过 $(q/2^{q-1} + o(1))$ 的 $q$ 大小的包与奇偶校验一致是 NP-hard，而基于随机奇偶校验的算法可实现 $(1/2^{q-2})$ 近似值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Foundations of Software Technology and Theoretical Computer Science

自引率

0.00%

发文量