{"title":"Research on Cartoon Face Generation Based on CycleGAN Assisted with Facial Landmarks","authors":"Keyi Ma, Xiaohong Wang","doi":"10.1109/ICCSI55536.2022.9970645","DOIUrl":null,"url":null,"abstract":"Turn real faces into cartoon faces is a topic of style transfer, and style transfer is a hot topic in the application of generative adversarial networks in image. CycleGAN is one of generative adversarial networks. It has obvious universal applicability, and has a good transformation effect on various types of style transfer. But to the facial style transfer, it only focuses on the transformation of the whole face, and it is not ideal for the transformation of the details of the facial features. How can this situation be improved? In this paper, we use facial landmarks to assist the transformation of facial features. In the beginning, we use stacked hourglass networks to detection and capture landmarks of real faces. And then, use them to assist cartoon faces generation. In view of the fact that the hourglass network has its own advantages in feature extraction, we use it to replace the generator structure of the original CycleGAN for transformation. And in order to avoid the Checkerboard Artifacts and ensure the quality of image generation, we use bilinear interpolation in the upsampling part of the generator to replace the deconvolution of the original generator and the nearest interpolation of the hourglass network. Experiments show that these practices have good results in optimizing conversion performance and improving image quality.","PeriodicalId":421514,"journal":{"name":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI55536.2022.9970645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Turn real faces into cartoon faces is a topic of style transfer, and style transfer is a hot topic in the application of generative adversarial networks in image. CycleGAN is one of generative adversarial networks. It has obvious universal applicability, and has a good transformation effect on various types of style transfer. But to the facial style transfer, it only focuses on the transformation of the whole face, and it is not ideal for the transformation of the details of the facial features. How can this situation be improved? In this paper, we use facial landmarks to assist the transformation of facial features. In the beginning, we use stacked hourglass networks to detection and capture landmarks of real faces. And then, use them to assist cartoon faces generation. In view of the fact that the hourglass network has its own advantages in feature extraction, we use it to replace the generator structure of the original CycleGAN for transformation. And in order to avoid the Checkerboard Artifacts and ensure the quality of image generation, we use bilinear interpolation in the upsampling part of the generator to replace the deconvolution of the original generator and the nearest interpolation of the hourglass network. Experiments show that these practices have good results in optimizing conversion performance and improving image quality.