{"title":"基于变分自编码器的特征空间插值数据增强改进在线无损水分估计","authors":"C. R. Wewer, A. Iosifidis","doi":"10.1109/INDIN51400.2023.10218063","DOIUrl":null,"url":null,"abstract":"Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.","PeriodicalId":174443,"journal":{"name":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Online non-destructive Moisture Content Estimation using Data Augmentation by Feature Space Interpolation with Variational Autoencoders\",\"authors\":\"C. R. Wewer, A. Iosifidis\",\"doi\":\"10.1109/INDIN51400.2023.10218063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.\",\"PeriodicalId\":174443,\"journal\":{\"name\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN51400.2023.10218063\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN51400.2023.10218063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Online non-destructive Moisture Content Estimation using Data Augmentation by Feature Space Interpolation with Variational Autoencoders
Data augmentation techniques have proven to be highly effective for many types of problems. However, the development of data augmentation for continuous input-output mappings in regression problems has not received much attention. Insufficient training data remains a significant challenge in machine learning, especially for industrial applications, as the cost of experimentation on the production line can be prohibitively expensive. This acts as a barrier to adoption of machine learning methods in industrial applications. In this study, we propose a data augmentation method called feature space interpolation for continuous input-output regression problems based on discontinuous data sets with clear gaps in the data. The proposed method is applied to a dataset of industrial drying of bulky filter media products. It is shown, that augmenting the original dataset by generated synthetic data points in the gap of the dataset by interpolation in the latent space of a well-trained variational autoencoder (VAE) can improve the performance of state-of-the-art of bulky filter media product moisture content estimation models, as measured by the mean absolute error and mean squared error by 4.82% and 6.32% respectively, and outperforms baseline generative data augmentation methods such as latent space sampling from VAEs.