基于变分自编码器的特征空间插值数据增强改进在线无损水分估计

2023 IEEE 21st International Conference on Industrial Informatics (INDIN) Pub Date : 2023-07-18 DOI:10.1109/INDIN51400.2023.10218063

C. R. Wewer, A. Iosifidis

{"title":"基于变分自编码器的特征空间插值数据增强改进在线无损水分估计","authors":"C. R. Wewer, A. Iosifidis","doi":"10.1109/INDIN51400.2023.10218063","DOIUrl":null,"url":null,"abstract":"Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.","PeriodicalId":174443,"journal":{"name":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Online non-destructive Moisture Content Estimation using Data Augmentation by Feature Space Interpolation with Variational Autoencoders\",\"authors\":\"C. R. Wewer, A. Iosifidis\",\"doi\":\"10.1109/INDIN51400.2023.10218063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.\",\"PeriodicalId\":174443,\"journal\":{\"name\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN51400.2023.10218063\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN51400.2023.10218063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

数据增强技术已被证明对许多类型的问题都非常有效。然而，对于回归问题中连续输入-输出映射的数据扩充方法的发展并没有受到太多的关注。训练数据不足仍然是机器学习的一个重大挑战，特别是在工业应用中，因为在生产线上进行实验的成本可能非常昂贵。这是在工业应用中采用机器学习方法的障碍。在这项研究中，我们提出了一种称为特征空间插值的数据增强方法，用于基于数据中有明显间隙的不连续数据集的连续输入输出回归问题。提出的方法应用于工业干燥大体积过滤介质产品的数据集。研究表明，通过在训练良好的变分自编码器(VAE)的潜在空间内插值，在数据集的间隙中生成合成数据点来增强原始数据集，可以使最先进的大块过滤介质产品水分含量估计模型的性能提高4.82%，平均绝对误差和均方误差分别提高6.32%。并且优于基线生成数据增强方法，例如来自vae的潜在空间采样。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving Online non-destructive Moisture Content Estimation using Data Augmentation by Feature Space Interpolation with Variational Autoencoders

Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE 21st International Conference on Industrial Informatics (INDIN)

自引率

0.00%

发文量