Practicality of generalization guarantees for unsupervised domain adaptation with neural networks

Trans. Mach. Learn. Res. Pub Date : 2023-03-15 DOI:10.48550/arXiv.2303.08720

Adam Breitholtz, Fredrik D. Johansson

{"title":"Practicality of generalization guarantees for unsupervised domain adaptation with neural networks","authors":"Adam Breitholtz, Fredrik D. Johansson","doi":"10.48550/arXiv.2303.08720","DOIUrl":null,"url":null,"abstract":"Understanding generalization is crucial to confidently engineer and deploy machine learning models, especially when deployment implies a shift in the data domain. For such domain adaptation problems, we seek generalization bounds which are tractably computable and tight. If these desiderata can be reached, the bounds can serve as guarantees for adequate performance in deployment. However, in applications where deep neural networks are the models of choice, deriving results which fulfill these remains an unresolved challenge; most existing bounds are either vacuous or has non-estimable terms, even in favorable conditions. In this work, we evaluate existing bounds from the literature with potential to satisfy our desiderata on domain adaptation image classification tasks, where deep neural networks are preferred. We find that all bounds are vacuous and that sample generalization terms account for much of the observed looseness, especially when these terms interact with measures of domain shift. To overcome this and arrive at the tightest possible results, we combine each bound with recent data-dependent PAC-Bayes analysis, greatly improving the guarantees. We find that, when domain overlap can be assumed, a simple importance weighting extension of previous work provides the tightest estimable bound. Finally, we study which terms dominate the bounds and identify possible directions for further improvement.","PeriodicalId":432739,"journal":{"name":"Trans. Mach. Learn. Res.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Trans. Mach. Learn. Res.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2303.08720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Understanding generalization is crucial to confidently engineer and deploy machine learning models, especially when deployment implies a shift in the data domain. For such domain adaptation problems, we seek generalization bounds which are tractably computable and tight. If these desiderata can be reached, the bounds can serve as guarantees for adequate performance in deployment. However, in applications where deep neural networks are the models of choice, deriving results which fulfill these remains an unresolved challenge; most existing bounds are either vacuous or has non-estimable terms, even in favorable conditions. In this work, we evaluate existing bounds from the literature with potential to satisfy our desiderata on domain adaptation image classification tasks, where deep neural networks are preferred. We find that all bounds are vacuous and that sample generalization terms account for much of the observed looseness, especially when these terms interact with measures of domain shift. To overcome this and arrive at the tightest possible results, we combine each bound with recent data-dependent PAC-Bayes analysis, greatly improving the guarantees. We find that, when domain overlap can be assumed, a simple importance weighting extension of previous work provides the tightest estimable bound. Finally, we study which terms dominate the bounds and identify possible directions for further improvement.

查看原文本刊更多论文

泛化的实用性为神经网络的无监督域自适应提供了保证

理解泛化对于自信地设计和部署机器学习模型至关重要，特别是当部署意味着数据领域的转变时。对于这类领域自适应问题，我们寻求可跟踪计算且紧密的泛化边界。如果能够达到这些要求，则边界可以作为部署中充分性能的保证。然而，在深度神经网络作为选择模型的应用中，获得满足这些模型的结果仍然是一个未解决的挑战;大多数现有的边界要么是空洞的，要么具有不可估计的项，即使在有利条件下也是如此。在这项工作中，我们评估了文献中现有的边界，这些边界有可能满足我们对领域自适应图像分类任务的期望，其中深度神经网络是首选。我们发现所有的边界都是空洞的，并且样本泛化项占了观察到的松动的大部分，特别是当这些项与域移的测量相互作用时。为了克服这个问题并获得最严格的结果，我们将每个边界与最近的数据相关PAC-Bayes分析结合起来，大大提高了保证。我们发现，当可以假设域重叠时，对先前工作的简单重要性加权扩展提供了最紧的可估计界。最后，我们研究了哪些项主导了边界，并确定了进一步改进的可能方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Trans. Mach. Learn. Res.

自引率

0.00%

发文量