工业环境中样品稀缺性的按需多类成像

Proceedings of the 2023 6th International Conference on Machine Vision and Applications Pub Date : 2023-03-10 DOI:10.1145/3589572.3589573

Joan Orti, F. Moreno-Noguer, V. Puig

{"title":"工业环境中样品稀缺性的按需多类成像","authors":"Joan Orti, F. Moreno-Noguer, V. Puig","doi":"10.1145/3589572.3589573","DOIUrl":null,"url":null,"abstract":"While technology pushes towards controlling more and more complex industrial processes, data related issues are still a non-trivial problem to address. In this sense, class imbalances and scarcity of data occupy a lot of time and resources when designing a solution. In the surface defect detection problem, due to the random nature of the process, both situations are very common as well as a general decompensation between the image size and the defect size. In this work, we address a segmentation and classification problem with very few available images from every class, proposing a two-step process. First, by generating fake images using the guided-crop image augmentation method, we train for every single class a Pix2pix model in order to perform a mask-to-image translation. Once the model is trained, we also designed a automatic mask generator, to mimic the shapes of the dataset and thus create real-like images for every class using the pretrained networks. Eventually, using a context aggregation network, we use these fake images as our training set, changing every certain epochs the amount of images of every class on-demand, depending on the evolution of the individual loss term of every class. As a result, we accomplished stable and robust segmentation and classification metrics, regardless of the amount of data available for training, using the NEU Micro surface defect database.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On-Demand Multiclass Imaging for Sample Scarcity in Industrial Environments\",\"authors\":\"Joan Orti, F. Moreno-Noguer, V. Puig\",\"doi\":\"10.1145/3589572.3589573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While technology pushes towards controlling more and more complex industrial processes, data related issues are still a non-trivial problem to address. In this sense, class imbalances and scarcity of data occupy a lot of time and resources when designing a solution. In the surface defect detection problem, due to the random nature of the process, both situations are very common as well as a general decompensation between the image size and the defect size. In this work, we address a segmentation and classification problem with very few available images from every class, proposing a two-step process. First, by generating fake images using the guided-crop image augmentation method, we train for every single class a Pix2pix model in order to perform a mask-to-image translation. Once the model is trained, we also designed a automatic mask generator, to mimic the shapes of the dataset and thus create real-like images for every class using the pretrained networks. Eventually, using a context aggregation network, we use these fake images as our training set, changing every certain epochs the amount of images of every class on-demand, depending on the evolution of the individual loss term of every class. As a result, we accomplished stable and robust segmentation and classification metrics, regardless of the amount of data available for training, using the NEU Micro surface defect database.\",\"PeriodicalId\":296325,\"journal\":{\"name\":\"Proceedings of the 2023 6th International Conference on Machine Vision and Applications\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 6th International Conference on Machine Vision and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3589572.3589573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589572.3589573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

虽然技术推动着控制越来越复杂的工业过程，但与数据相关的问题仍然是一个不容忽视的问题。从这个意义上说，在设计解决方案时，类的不平衡和数据的稀缺性占用了大量的时间和资源。在表面缺陷检测问题中，由于过程的随机性，这两种情况都很常见，并且图像尺寸与缺陷尺寸之间存在普遍的失补偿。在这项工作中，我们解决了每个类别中很少可用图像的分割和分类问题，提出了一个两步过程。首先，通过使用引导裁剪图像增强方法生成假图像，我们为每个类训练一个Pix2pix模型，以便执行蒙版到图像的转换。一旦模型被训练，我们还设计了一个自动掩码生成器，来模仿数据集的形状，从而使用预训练的网络为每个类创建类似真实的图像。最后，使用上下文聚合网络，我们使用这些假图像作为我们的训练集，根据每个类的个体损失项的演变，在每个特定的时代按需改变每个类的图像数量。结果，我们完成了稳定和健壮的分割和分类度量，不管训练可用的数据量有多少，使用NEU Micro表面缺陷数据库。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On-Demand Multiclass Imaging for Sample Scarcity in Industrial Environments

While technology pushes towards controlling more and more complex industrial processes, data related issues are still a non-trivial problem to address. In this sense, class imbalances and scarcity of data occupy a lot of time and resources when designing a solution. In the surface defect detection problem, due to the random nature of the process, both situations are very common as well as a general decompensation between the image size and the defect size. In this work, we address a segmentation and classification problem with very few available images from every class, proposing a two-step process. First, by generating fake images using the guided-crop image augmentation method, we train for every single class a Pix2pix model in order to perform a mask-to-image translation. Once the model is trained, we also designed a automatic mask generator, to mimic the shapes of the dataset and thus create real-like images for every class using the pretrained networks. Eventually, using a context aggregation network, we use these fake images as our training set, changing every certain epochs the amount of images of every class on-demand, depending on the evolution of the individual loss term of every class. As a result, we accomplished stable and robust segmentation and classification metrics, regardless of the amount of data available for training, using the NEU Micro surface defect database.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2023 6th International Conference on Machine Vision and Applications

自引率

0.00%

发文量