Application of Deep Convolutional Generative Adversarial Networks to Generate Pose Invariant Facial Image Synthesis Data

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Pub Date : 2023-09-30 DOI:10.29207/resti.v7i5.5112

None Jagad Nabil Tuah Imanda, Fitra Bachtiar, None Achmad Ridok

{"title":"Application of Deep Convolutional Generative Adversarial Networks to Generate Pose Invariant Facial Image Synthesis Data","authors":"None Jagad Nabil Tuah Imanda, Fitra Bachtiar, None Achmad Ridok","doi":"10.29207/resti.v7i5.5112","DOIUrl":null,"url":null,"abstract":"The field of technology is currently developing rapidly, one of the developments is artificial intelligence. Artificial intelligence can still find it difficult to solve problems that are easy for humans to do but difficult for computers to describe, such as facial recognition. There are still several problems related to the existing facial recognition model, namely, the facial recognition model is still unable to recognize facial shapes that are not in a perfect state due to several factors such as face position, lighting, expression, and obstacles covering the face. Among these several factors, the most influencing factor is the position of the face. Therefore, in this study, deep convolutional generative adversarial networks (DCGANs) will be applied to generate fake image data with varying face positions. This research will be carried out starting from collecting data, processing data, designing and training models, hyperparameter tuning, and lastly analyzing test results. Based on the results of hyperparameter tuning that were performed sequentially, the best hyperparameter combination produced is 200 epoch, 0.002 Generator learning rate, 0.5 Generator momentum/beta1, Adam as Generator optimizer, 0.0002 Discriminator learning rate, 0.5 Discriminator momentum/beta1, and Adam as Discriminator optimizer. The combination of hyperparameters gives a result with an FID score of 74.05. Based on testing with human observers, generated fake images have relatively good results, but there are still few bad fake image results","PeriodicalId":435683,"journal":{"name":"Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29207/resti.v7i5.5112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The field of technology is currently developing rapidly, one of the developments is artificial intelligence. Artificial intelligence can still find it difficult to solve problems that are easy for humans to do but difficult for computers to describe, such as facial recognition. There are still several problems related to the existing facial recognition model, namely, the facial recognition model is still unable to recognize facial shapes that are not in a perfect state due to several factors such as face position, lighting, expression, and obstacles covering the face. Among these several factors, the most influencing factor is the position of the face. Therefore, in this study, deep convolutional generative adversarial networks (DCGANs) will be applied to generate fake image data with varying face positions. This research will be carried out starting from collecting data, processing data, designing and training models, hyperparameter tuning, and lastly analyzing test results. Based on the results of hyperparameter tuning that were performed sequentially, the best hyperparameter combination produced is 200 epoch, 0.002 Generator learning rate, 0.5 Generator momentum/beta1, Adam as Generator optimizer, 0.0002 Discriminator learning rate, 0.5 Discriminator momentum/beta1, and Adam as Discriminator optimizer. The combination of hyperparameters gives a result with an FID score of 74.05. Based on testing with human observers, generated fake images have relatively good results, but there are still few bad fake image results

查看原文本刊更多论文

深度卷积生成对抗网络在姿态不变人脸图像合成数据中的应用

技术领域目前发展迅速，其中一个发展是人工智能。人工智能仍然很难解决那些对人类来说很容易但对计算机来说很难描述的问题，比如面部识别。现有的人脸识别模型仍然存在几个问题，即由于人脸位置、光照、表情、遮挡人脸的障碍物等多种因素，人脸识别模型仍然无法识别非完美状态的人脸形状。在这几个因素中，影响最大的因素是脸部的位置。因此，在本研究中，将应用深度卷积生成对抗网络(dcgan)来生成具有不同面部位置的假图像数据。本研究将从收集数据、处理数据、设计和训练模型、超参数调优、最后分析测试结果开始。根据依次进行的超参数调优结果，产生的最佳超参数组合为200 epoch、0.002 Generator学习率、0.5 Generator动量/beta1、Adam作为Generator优化器、0.0002 Discriminator学习率、0.5 Discriminator动量/beta1、Adam作为Discriminator优化器。超参数的组合给出了一个FID评分为74.05的结果。基于人类观察者的测试，生成的假图像效果相对较好，但仍有少数假图像效果较差

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

自引率

0.00%

发文量