{"title":"The effectiveness of GAN-based synthetic-to-real domain adaptation methods in training a wood ear mushroom detection model","authors":"Hajime Taguchi, Ryo Matsumura, Hironori Kitakaze","doi":"10.1007/s10015-024-00998-9","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, we verify whether computer-generated imagery (CGI) for real image domain adaptation using a generative adversarial network (GAN) based method can improve the accuracy of wood ear mushroom detection. Automated harvesting is important to address labor shortages in agriculture. However, a significant amount of training data is needed to detect crops with features such as wood ear mushroom. In our previous work, a slight improvement in accuracy was achieved with a small number of real-world images by incorporating CGI into the training data. Nevertheless, the reality gap between the CGI and real-world images is expected to degrade detection accuracy. GAN-based methods are effective in bridging the gap, and this study validates this approach. As a result of the experiment, the accuracy improved by approximately 6% in the F2-score between the CGI and GAN-generated datasets. However, there was no significant difference in detection accuracy when training with a combination of real-world and generated image data. In conclusion, this experiment suggests that data augmentation with generated images is not effective for this specific detection target. Nevertheless, this approach should be explored further by generating data on a larger scale with greater computational resources.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"310 - 316"},"PeriodicalIF":0.8000,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Life and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10015-024-00998-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, we verify whether computer-generated imagery (CGI) for real image domain adaptation using a generative adversarial network (GAN) based method can improve the accuracy of wood ear mushroom detection. Automated harvesting is important to address labor shortages in agriculture. However, a significant amount of training data is needed to detect crops with features such as wood ear mushroom. In our previous work, a slight improvement in accuracy was achieved with a small number of real-world images by incorporating CGI into the training data. Nevertheless, the reality gap between the CGI and real-world images is expected to degrade detection accuracy. GAN-based methods are effective in bridging the gap, and this study validates this approach. As a result of the experiment, the accuracy improved by approximately 6% in the F2-score between the CGI and GAN-generated datasets. However, there was no significant difference in detection accuracy when training with a combination of real-world and generated image data. In conclusion, this experiment suggests that data augmentation with generated images is not effective for this specific detection target. Nevertheless, this approach should be explored further by generating data on a larger scale with greater computational resources.