{"title":"使用StarGAN和CBIN的条件驱动环境转换","authors":"Rina Komatsu, Keisuke Yamazaki","doi":"10.1145/3508259.3508267","DOIUrl":null,"url":null,"abstract":"Automatic driving without human's control requires a lot of training to be able to adopt any situation. About situation, the environment at driving has the variety such as dark at night, wet road because of rain and the stack of snow. Preparing the drive image dataset with a lot of environment situations is the difficult task in collecting. This study tried solving the preparing problem by constructing deep learning model with limited data and translating single image to multi environment situations. For accomplishing this subject, we employed N domains translation model called StarGAN. In this paper, we investigated the StarGAN with enough conditional translation performance through comparing visualized results and FID score. We trained StarGANs including original to translate single image to 6 kinds of drive environment domain: daytime & clear, daytime & rainy, daytime & snowy, night & clear, night & rainy and night & snowy. Through experiments, we found the StarGAN employing “CBIN: Central Biasing Instance Normalization” and “AdaLIN: Adaptive Layer-Instance Normalization” at Generator, and “the adversarial loss of CAM Logit” at Discriminator could mark the lower FID score than original StarGAN.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conditional Drive Environment Translation using StarGAN with CBIN\",\"authors\":\"Rina Komatsu, Keisuke Yamazaki\",\"doi\":\"10.1145/3508259.3508267\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic driving without human's control requires a lot of training to be able to adopt any situation. About situation, the environment at driving has the variety such as dark at night, wet road because of rain and the stack of snow. Preparing the drive image dataset with a lot of environment situations is the difficult task in collecting. This study tried solving the preparing problem by constructing deep learning model with limited data and translating single image to multi environment situations. For accomplishing this subject, we employed N domains translation model called StarGAN. In this paper, we investigated the StarGAN with enough conditional translation performance through comparing visualized results and FID score. We trained StarGANs including original to translate single image to 6 kinds of drive environment domain: daytime & clear, daytime & rainy, daytime & snowy, night & clear, night & rainy and night & snowy. Through experiments, we found the StarGAN employing “CBIN: Central Biasing Instance Normalization” and “AdaLIN: Adaptive Layer-Instance Normalization” at Generator, and “the adversarial loss of CAM Logit” at Discriminator could mark the lower FID score than original StarGAN.\",\"PeriodicalId\":259099,\"journal\":{\"name\":\"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3508259.3508267\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3508259.3508267","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Conditional Drive Environment Translation using StarGAN with CBIN
Automatic driving without human's control requires a lot of training to be able to adopt any situation. About situation, the environment at driving has the variety such as dark at night, wet road because of rain and the stack of snow. Preparing the drive image dataset with a lot of environment situations is the difficult task in collecting. This study tried solving the preparing problem by constructing deep learning model with limited data and translating single image to multi environment situations. For accomplishing this subject, we employed N domains translation model called StarGAN. In this paper, we investigated the StarGAN with enough conditional translation performance through comparing visualized results and FID score. We trained StarGANs including original to translate single image to 6 kinds of drive environment domain: daytime & clear, daytime & rainy, daytime & snowy, night & clear, night & rainy and night & snowy. Through experiments, we found the StarGAN employing “CBIN: Central Biasing Instance Normalization” and “AdaLIN: Adaptive Layer-Instance Normalization” at Generator, and “the adversarial loss of CAM Logit” at Discriminator could mark the lower FID score than original StarGAN.