通过贝叶斯理论实现结构损伤检测中特征和标签移动下的模型泛化最大化

IF 7.9 1区工程技术 Q1 ENGINEERING, MECHANICAL

Mechanical Systems and Signal Processing Pub Date : 2024-10-20 DOI:10.1016/j.ymssp.2024.112052

Xiaoyou Wang , Jinyang Jiao , Xiaoqing Zhou , Yong Xia

{"title":"通过贝叶斯理论实现结构损伤检测中特征和标签移动下的模型泛化最大化","authors":"Xiaoyou Wang , Jinyang Jiao , Xiaoqing Zhou , Yong Xia","doi":"10.1016/j.ymssp.2024.112052","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning models face performance degradation when migrating to datasets with different distributions from training data. This challenge limits their applications because recollecting labeled data to retrain models is expensive and time-consuming. Domain generalization (DG) aims to learn a predictive model that can extract invariant features across source domains and then generalize to related but unseen target domains. However, most existing feature-invariant DG methods rely on unrealistic assumptions (e.g., stable feature distribution or no label shift) and simplify the problem to learn either invariant feature marginal or conditional distributions. In practice, both feature and label shifts may exist, rendering these assumptions invalid. This study develops a novel DG method to simultaneously consider conditional, marginal, and label shifts. With Bayes’ theorem, DG is interpreted as a posterior distribution alignment problem, which is derived by the likelihood function, evidence, and prior. The likelihood function and evidence correspond to feature conditional and marginal distributions, respectively, which are estimated by exploiting variational Bayesian inference. The conditional distributions across domains are aligned by minimizing the Kullback–Leibler divergence, and the marginal distributions across domains are aligned through moment minimization. The label prior shift is estimated by the label smoothing mechanism and class-wise prototype learning. Consequently, DG is achieved by aligning the label posterior distribution according to the Bayesian equation. Numerical and experimental examples demonstrate that the developed method outperforms state-of-the-art DG methods in damage detection of both mechanical and civil structures. The method can be extended to DG tasks in other fields.</div></div>","PeriodicalId":51124,"journal":{"name":"Mechanical Systems and Signal Processing","volume":"224 ","pages":"Article 112052"},"PeriodicalIF":7.9000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Maximizing model generalization under feature and label shifts for structural damage detection via Bayesian theory\",\"authors\":\"Xiaoyou Wang , Jinyang Jiao , Xiaoqing Zhou , Yong Xia\",\"doi\":\"10.1016/j.ymssp.2024.112052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning models face performance degradation when migrating to datasets with different distributions from training data. This challenge limits their applications because recollecting labeled data to retrain models is expensive and time-consuming. Domain generalization (DG) aims to learn a predictive model that can extract invariant features across source domains and then generalize to related but unseen target domains. However, most existing feature-invariant DG methods rely on unrealistic assumptions (e.g., stable feature distribution or no label shift) and simplify the problem to learn either invariant feature marginal or conditional distributions. In practice, both feature and label shifts may exist, rendering these assumptions invalid. This study develops a novel DG method to simultaneously consider conditional, marginal, and label shifts. With Bayes’ theorem, DG is interpreted as a posterior distribution alignment problem, which is derived by the likelihood function, evidence, and prior. The likelihood function and evidence correspond to feature conditional and marginal distributions, respectively, which are estimated by exploiting variational Bayesian inference. The conditional distributions across domains are aligned by minimizing the Kullback–Leibler divergence, and the marginal distributions across domains are aligned through moment minimization. The label prior shift is estimated by the label smoothing mechanism and class-wise prototype learning. Consequently, DG is achieved by aligning the label posterior distribution according to the Bayesian equation. Numerical and experimental examples demonstrate that the developed method outperforms state-of-the-art DG methods in damage detection of both mechanical and civil structures. The method can be extended to DG tasks in other fields.</div></div>\",\"PeriodicalId\":51124,\"journal\":{\"name\":\"Mechanical Systems and Signal Processing\",\"volume\":\"224 \",\"pages\":\"Article 112052\"},\"PeriodicalIF\":7.9000,\"publicationDate\":\"2024-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mechanical Systems and Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0888327024009506\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechanical Systems and Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888327024009506","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}

引用次数: 0

摘要

当迁移到与训练数据分布不同的数据集时，机器学习模型会面临性能下降的问题。这一挑战限制了它们的应用，因为重新收集标注数据以重新训练模型既昂贵又耗时。领域泛化（DG）旨在学习一种预测模型，该模型可以提取源领域的不变特征，然后泛化到相关但未见过的目标领域。然而，大多数现有的特征不变 DG 方法都依赖于不切实际的假设（如稳定的特征分布或无标签偏移），并将问题简化为学习不变的特征边际分布或条件分布。实际上，特征和标签都可能存在偏移，从而使这些假设失效。本研究开发了一种新颖的 DG 方法，可同时考虑条件、边际和标签偏移。利用贝叶斯定理，DG 被解释为一个后验分布对齐问题，它由似然函数、证据和先验得出。似然函数和证据分别对应于特征的条件分布和边际分布，它们是通过利用变异贝叶斯推理估计出来的。跨域的条件分布通过最小化库尔巴克-莱伯勒发散进行对齐，跨域的边际分布则通过矩最小化进行对齐。标签先验偏移是通过标签平滑机制和分类原型学习来估计的。因此，DG 是根据贝叶斯方程对齐标签后验分布而实现的。数值和实验实例表明，所开发的方法在机械和民用结构的损伤检测方面优于最先进的 DG 方法。该方法可扩展到其他领域的 DG 任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Maximizing model generalization under feature and label shifts for structural damage detection via Bayesian theory

Machine learning models face performance degradation when migrating to datasets with different distributions from training data. This challenge limits their applications because recollecting labeled data to retrain models is expensive and time-consuming. Domain generalization (DG) aims to learn a predictive model that can extract invariant features across source domains and then generalize to related but unseen target domains. However, most existing feature-invariant DG methods rely on unrealistic assumptions (e.g., stable feature distribution or no label shift) and simplify the problem to learn either invariant feature marginal or conditional distributions. In practice, both feature and label shifts may exist, rendering these assumptions invalid. This study develops a novel DG method to simultaneously consider conditional, marginal, and label shifts. With Bayes’ theorem, DG is interpreted as a posterior distribution alignment problem, which is derived by the likelihood function, evidence, and prior. The likelihood function and evidence correspond to feature conditional and marginal distributions, respectively, which are estimated by exploiting variational Bayesian inference. The conditional distributions across domains are aligned by minimizing the Kullback–Leibler divergence, and the marginal distributions across domains are aligned through moment minimization. The label prior shift is estimated by the label smoothing mechanism and class-wise prototype learning. Consequently, DG is achieved by aligning the label posterior distribution according to the Bayesian equation. Numerical and experimental examples demonstrate that the developed method outperforms state-of-the-art DG methods in damage detection of both mechanical and civil structures. The method can be extended to DG tasks in other fields.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Mechanical Systems and Signal Processing 工程技术-工程：机械

CiteScore

14.80

自引率

13.10%

发文量

1183

审稿时长

5.4 months

期刊介绍： Journal Name: Mechanical Systems and Signal Processing (MSSP) Interdisciplinary Focus: Mechanical, Aerospace, and Civil Engineering Purpose:Reporting scientific advancements of the highest quality Arising from new techniques in sensing, instrumentation, signal processing, modelling, and control of dynamic systems