用于潮汐流涡轮机转子图像数据集数据增强的基于域变量先验的多类型传输网络

IF 1.1 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Pattern Recognition and Artificial Intelligence Pub Date : 2024-03-15 DOI:10.1142/s021800142454003x

Guohan Jiang, Tianzhen Wang, Dingding Yang, Jingyi You

{"title":"用于潮汐流涡轮机转子图像数据集数据增强的基于域变量先验的多类型传输网络","authors":"Guohan Jiang, Tianzhen Wang, Dingding Yang, Jingyi You","doi":"10.1142/s021800142454003x","DOIUrl":null,"url":null,"abstract":"<p>The style of the underwater images varies according to the region of the sea. However, Tidal Stream Turbine (TST) rotor images captured in the laboratory environment cannot reflect the real underwater environment in image style, resulting in poor generalization of image signal-based fault detection algorithms. Due to the fixed capture position of the camera, the TST rotor image dataset has a high semantic similarity between images, resulting in content loss in conventional image-to-image translation networks. Meanwhile, the one-to-one translation feature in other works cannot meet our requirements. In this work, a Domain Variable Prior-based Multi-style Transfer Network (DVP-MSTN) is proposed to achieve TST rotor image style augmentation. First, the backbone network is trained using a public paired dataset to acquire prior knowledge of domain variable (Knowledge Acquiring, KA). Next, a Multi-domain Transfer Unit (MDT unit) is introduced to enable the conversion of style representations in low-dimensional space. Finally, the prior knowledge is shared to train the MDT unit by fixing the parameters of the backbone network optimized from the KA process (Knowledge Sharing, KS). In addition, an algorithm based on the dark channel of the image is proposed to improve the transfer of low-contrast features. Specifically, a discriminator is used to discriminate the image dark channel to guide the MDT unit to generate low-contrast style representation conditionally. Meanwhile, color loss is employed to preserve the color feature of the image. By controlling the weights of the style code, this method enables control over the image style transfer process, thereby expanding the variety of image styles in the dataset for the purpose of data augmentation.</p>","PeriodicalId":54949,"journal":{"name":"International Journal of Pattern Recognition and Artificial Intelligence","volume":"213 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Domain Variable Prior Based Multi-Style Transfer Network for Data Augmentation of Tidal Stream Turbine Rotor Image Dataset\",\"authors\":\"Guohan Jiang, Tianzhen Wang, Dingding Yang, Jingyi You\",\"doi\":\"10.1142/s021800142454003x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The style of the underwater images varies according to the region of the sea. However, Tidal Stream Turbine (TST) rotor images captured in the laboratory environment cannot reflect the real underwater environment in image style, resulting in poor generalization of image signal-based fault detection algorithms. Due to the fixed capture position of the camera, the TST rotor image dataset has a high semantic similarity between images, resulting in content loss in conventional image-to-image translation networks. Meanwhile, the one-to-one translation feature in other works cannot meet our requirements. In this work, a Domain Variable Prior-based Multi-style Transfer Network (DVP-MSTN) is proposed to achieve TST rotor image style augmentation. First, the backbone network is trained using a public paired dataset to acquire prior knowledge of domain variable (Knowledge Acquiring, KA). Next, a Multi-domain Transfer Unit (MDT unit) is introduced to enable the conversion of style representations in low-dimensional space. Finally, the prior knowledge is shared to train the MDT unit by fixing the parameters of the backbone network optimized from the KA process (Knowledge Sharing, KS). In addition, an algorithm based on the dark channel of the image is proposed to improve the transfer of low-contrast features. Specifically, a discriminator is used to discriminate the image dark channel to guide the MDT unit to generate low-contrast style representation conditionally. Meanwhile, color loss is employed to preserve the color feature of the image. By controlling the weights of the style code, this method enables control over the image style transfer process, thereby expanding the variety of image styles in the dataset for the purpose of data augmentation.</p>\",\"PeriodicalId\":54949,\"journal\":{\"name\":\"International Journal of Pattern Recognition and Artificial Intelligence\",\"volume\":\"213 1\",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Pattern Recognition and Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1142/s021800142454003x\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Pattern Recognition and Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1142/s021800142454003x","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

不同海域的水下图像风格各异。然而，在实验室环境中拍摄的潮汐流涡轮机（TST）转子图像无法反映真实水下环境的图像风格，导致基于图像信号的故障检测算法的普适性较差。由于摄像机的拍摄位置固定，TST 转子图像数据集的图像之间具有较高的语义相似性，导致传统图像到图像转换网络的内容丢失。同时，其他工作中的一对一翻译功能无法满足我们的要求。本研究提出了一种基于域变量先验的多风格转换网络（DVP-MSTN）来实现 TST 转子图像风格增强。首先，利用公共配对数据集对骨干网络进行训练，以获取领域变量的先验知识（知识获取，KA）。然后，引入多域传输单元（MDT 单元），实现低维空间中风格表征的转换。最后，通过固定 KA 过程中优化的骨干网络参数，共享先验知识以训练 MDT 单元（知识共享，KS）。此外，还提出了一种基于图像暗通道的算法，以改善低对比度特征的传输。具体来说，使用一个判别器来判别图像的暗色通道，从而引导 MDT 单元有条件地生成低对比度样式表示。同时，采用颜色损失来保留图像的颜色特征。通过控制风格代码的权重，这种方法可以控制图像风格的转换过程，从而扩大数据集中的图像风格种类，达到数据扩增的目的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Domain Variable Prior Based Multi-Style Transfer Network for Data Augmentation of Tidal Stream Turbine Rotor Image Dataset

The style of the underwater images varies according to the region of the sea. However, Tidal Stream Turbine (TST) rotor images captured in the laboratory environment cannot reflect the real underwater environment in image style, resulting in poor generalization of image signal-based fault detection algorithms. Due to the fixed capture position of the camera, the TST rotor image dataset has a high semantic similarity between images, resulting in content loss in conventional image-to-image translation networks. Meanwhile, the one-to-one translation feature in other works cannot meet our requirements. In this work, a Domain Variable Prior-based Multi-style Transfer Network (DVP-MSTN) is proposed to achieve TST rotor image style augmentation. First, the backbone network is trained using a public paired dataset to acquire prior knowledge of domain variable (Knowledge Acquiring, KA). Next, a Multi-domain Transfer Unit (MDT unit) is introduced to enable the conversion of style representations in low-dimensional space. Finally, the prior knowledge is shared to train the MDT unit by fixing the parameters of the backbone network optimized from the KA process (Knowledge Sharing, KS). In addition, an algorithm based on the dark channel of the image is proposed to improve the transfer of low-contrast features. Specifically, a discriminator is used to discriminate the image dark channel to guide the MDT unit to generate low-contrast style representation conditionally. Meanwhile, color loss is employed to preserve the color feature of the image. By controlling the weights of the style code, this method enables control over the image style transfer process, thereby expanding the variety of image styles in the dataset for the purpose of data augmentation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Pattern Recognition and Artificial Intelligence 工程技术-计算机：人工智能

CiteScore

2.90

自引率

13.30%

发文量

201

审稿时长

15.8 months

期刊介绍： The International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI) welcomes both theory-oriented and innovative applications articles on new developments and is of interest to both researchers in academia and industry. The current scope of this journal includes: • Pattern Recognition • Machine Learning • Deep Learning • Document Analysis • Image Processing • Signal Processing • Computer Vision • Biometrics • Biomedical Image Analysis • Artificial Intelligence In addition to regular papers describing original research work, survey articles on timely and important research topics are highly welcome. Special issues with focused topics within the scope of this journal are also published.