Xue-Yu Zhang , Qun-Xiong Zhu , Wei Ke , Yan-Lin He , Ming-Qing Zhang , Yuan Xu
{"title":"Regression loss-assisted conditional style generative adversarial network for virtual sample generation with small data in soft sensing","authors":"Xue-Yu Zhang , Qun-Xiong Zhu , Wei Ke , Yan-Lin He , Ming-Qing Zhang , Yuan Xu","doi":"10.1016/j.engappai.2025.110306","DOIUrl":null,"url":null,"abstract":"<div><div>Existing methods that extend virtual sample pools to address small sample problem caused by sample atypicality and uneven distribution often overlook data sparsity and inverse sample generation challenges, which limits the accuracy of subsequent modeling. To address above problem, we propose a novel regression-assisted conditional style generative adversarial network (RAC-StyleGAN). The proposed method leverages the strengths of StyleGAN in latent space mapping to enhance data diversity and granularity, while incorporating regression-assisted conditions to improve modeling performance. Specifically, RAC-StyleGAN utilizes kernel density estimation and radial basis function interpolation to ensure that the generated output variables are uniformly distributed. Based on the principle of inverse transformation, the interpolated output variables are then used as conditional inputs for the StyleGAN model, generating virtual input variables that faithfully reflect the marginal distribution of the original data. Furthermore, to preserve the complex nonlinear relationships between input and output variables, RAC-StyleGAN integrates a regression loss strategy based on empirical risk minimization into the StyleGAN framework. By fine-tuning the generation process, the soft-sensing model effectively captures the nonlinear mapping between inputs and outputs. Experimental validations on synthetic nonlinear functions, University of California Irvine machine learning (UCI) datasets, and a real-world purified terephthalic acid (PTA) solvent system demonstrate that RAC-StyleGAN effectively generates high-quality virtual samples, significantly enhancing the modeling performance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"147 ","pages":"Article 110306"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625003069","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Existing methods that extend virtual sample pools to address small sample problem caused by sample atypicality and uneven distribution often overlook data sparsity and inverse sample generation challenges, which limits the accuracy of subsequent modeling. To address above problem, we propose a novel regression-assisted conditional style generative adversarial network (RAC-StyleGAN). The proposed method leverages the strengths of StyleGAN in latent space mapping to enhance data diversity and granularity, while incorporating regression-assisted conditions to improve modeling performance. Specifically, RAC-StyleGAN utilizes kernel density estimation and radial basis function interpolation to ensure that the generated output variables are uniformly distributed. Based on the principle of inverse transformation, the interpolated output variables are then used as conditional inputs for the StyleGAN model, generating virtual input variables that faithfully reflect the marginal distribution of the original data. Furthermore, to preserve the complex nonlinear relationships between input and output variables, RAC-StyleGAN integrates a regression loss strategy based on empirical risk minimization into the StyleGAN framework. By fine-tuning the generation process, the soft-sensing model effectively captures the nonlinear mapping between inputs and outputs. Experimental validations on synthetic nonlinear functions, University of California Irvine machine learning (UCI) datasets, and a real-world purified terephthalic acid (PTA) solvent system demonstrate that RAC-StyleGAN effectively generates high-quality virtual samples, significantly enhancing the modeling performance.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.