Joint weight optimization for partial domain adaptation via kernel statistical distance estimation

IF 6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neural Networks Pub Date : 2024-09-16 DOI:10.1016/j.neunet.2024.106739

{"title":"Joint weight optimization for partial domain adaptation via kernel statistical distance estimation","authors":"","doi":"10.1016/j.neunet.2024.106739","DOIUrl":null,"url":null,"abstract":"<div><p>The goal of Partial Domain Adaptation (PDA) is to transfer a neural network from a source domain (joint source distribution) to a distinct target domain (joint target distribution), where the source label space subsumes the target label space. To address the PDA problem, existing works have proposed to learn the marginal source weights to match the weighted marginal source distribution to the marginal target distribution. However, this is sub-optimal, since the neural network’s target performance is concerned with the joint distribution disparity, not the marginal distribution disparity. In this paper, we propose a Joint Weight Optimization (JWO) approach that optimizes the joint source weights to match the weighted joint source distribution to the joint target distribution in the neural network’s feature space. To measure the joint distribution disparity, we exploit two statistical distances: the distribution-difference-based <span><math><msup><mrow><mi>L</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>-distance and the distribution-ratio-based <span><math><msup><mrow><mi>χ</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>-divergence. Since these two distances are unknown in practice, we propose a Kernel Statistical Distance Estimation (KSDE) method to estimate them from the weighted source data and the target data. Our KSDE method explicitly expresses the two estimated statistical distances as functions of the joint source weights. Therefore, we can optimize the joint weights to minimize the estimated distance functions and reduce the joint distribution disparity. Finally, we achieve the PDA goal by training the neural network on the weighted source data. Experiments on several popular datasets are conducted to demonstrate the effectiveness of our approach. Intro video and Pytorch code are available at <span><span>https://github.com/sentaochen/Joint-Weight-Optimation</span><svg><path></path></svg></span>. Interested readers can also visit <span><span>https://github.com/sentaochen</span><svg><path></path></svg></span> for more source codes of the related domain adaptation, multi-source domain adaptation, and domain generalization approaches.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608024006634","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The goal of Partial Domain Adaptation (PDA) is to transfer a neural network from a source domain (joint source distribution) to a distinct target domain (joint target distribution), where the source label space subsumes the target label space. To address the PDA problem, existing works have proposed to learn the marginal source weights to match the weighted marginal source distribution to the marginal target distribution. However, this is sub-optimal, since the neural network’s target performance is concerned with the joint distribution disparity, not the marginal distribution disparity. In this paper, we propose a Joint Weight Optimization (JWO) approach that optimizes the joint source weights to match the weighted joint source distribution to the joint target distribution in the neural network’s feature space. To measure the joint distribution disparity, we exploit two statistical distances: the distribution-difference-based $L^{2}$ -distance and the distribution-ratio-based $χ^{2}$ -divergence. Since these two distances are unknown in practice, we propose a Kernel Statistical Distance Estimation (KSDE) method to estimate them from the weighted source data and the target data. Our KSDE method explicitly expresses the two estimated statistical distances as functions of the joint source weights. Therefore, we can optimize the joint weights to minimize the estimated distance functions and reduce the joint distribution disparity. Finally, we achieve the PDA goal by training the neural network on the weighted source data. Experiments on several popular datasets are conducted to demonstrate the effectiveness of our approach. Intro video and Pytorch code are available at https://github.com/sentaochen/Joint-Weight-Optimation. Interested readers can also visit https://github.com/sentaochen for more source codes of the related domain adaptation, multi-source domain adaptation, and domain generalization approaches.

查看原文本刊更多论文

通过核统计距离估计实现部分域适应的联合权重优化

部分域自适应（PDA）的目标是将神经网络从源域（联合源分布）转移到不同的目标域（联合目标分布），其中源标签空间包含目标标签空间。为解决 PDA 问题，现有研究提出了学习边际源权重的方法，以便将加权边际源分布与边际目标分布相匹配。然而，这种方法并不理想，因为神经网络的目标性能关注的是联合分布差异，而不是边际分布差异。在本文中，我们提出了一种联合权重优化（JWO）方法，通过优化联合源权重，使加权联合源分布与神经网络特征空间中的联合目标分布相匹配。为了测量联合分布差异，我们利用了两种统计距离：基于分布差异的 L2 距离和基于分布比率的 χ2 分歧。由于这两个距离在实践中是未知的，我们提出了一种核统计距离估计（KSDE）方法，以从加权源数据和目标数据中估计出这两个距离。我们的 KSDE 方法将两个估计的统计距离明确地表示为联合源权重的函数。因此，我们可以优化联合权重，使估计的距离函数最小化，并减少联合分布差异。最后，我们通过在加权源数据上训练神经网络来实现 PDA 目标。我们在几个流行的数据集上进行了实验，以证明我们方法的有效性。介绍视频和 Pytorch 代码请访问 https://github.com/sentaochen/Joint-Weight-Optimation。感兴趣的读者还可以访问 https://github.com/sentaochen，获取更多相关领域适应、多源领域适应和领域泛化方法的源代码。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.