Application of Deep Convolutional Generative Adversarial Networks to Generate Pose Invariant Facial Image Synthesis Data

None Jagad Nabil Tuah Imanda, Fitra Bachtiar, None Achmad Ridok
{"title":"Application of Deep Convolutional Generative Adversarial Networks to Generate Pose Invariant Facial Image Synthesis Data","authors":"None Jagad Nabil Tuah Imanda, Fitra Bachtiar, None Achmad Ridok","doi":"10.29207/resti.v7i5.5112","DOIUrl":null,"url":null,"abstract":"The field of technology is currently developing rapidly, one of the developments is artificial intelligence. Artificial intelligence can still find it difficult to solve problems that are easy for humans to do but difficult for computers to describe, such as facial recognition. There are still several problems related to the existing facial recognition model, namely, the facial recognition model is still unable to recognize facial shapes that are not in a perfect state due to several factors such as face position, lighting, expression, and obstacles covering the face. Among these several factors, the most influencing factor is the position of the face. Therefore, in this study, deep convolutional generative adversarial networks (DCGANs) will be applied to generate fake image data with varying face positions. This research will be carried out starting from collecting data, processing data, designing and training models, hyperparameter tuning, and lastly analyzing test results. Based on the results of hyperparameter tuning that were performed sequentially, the best hyperparameter combination produced is 200 epoch, 0.002 Generator learning rate, 0.5 Generator momentum/beta1, Adam as Generator optimizer, 0.0002 Discriminator learning rate, 0.5 Discriminator momentum/beta1, and Adam as Discriminator optimizer. The combination of hyperparameters gives a result with an FID score of 74.05. Based on testing with human observers, generated fake images have relatively good results, but there are still few bad fake image results","PeriodicalId":435683,"journal":{"name":"Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29207/resti.v7i5.5112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The field of technology is currently developing rapidly, one of the developments is artificial intelligence. Artificial intelligence can still find it difficult to solve problems that are easy for humans to do but difficult for computers to describe, such as facial recognition. There are still several problems related to the existing facial recognition model, namely, the facial recognition model is still unable to recognize facial shapes that are not in a perfect state due to several factors such as face position, lighting, expression, and obstacles covering the face. Among these several factors, the most influencing factor is the position of the face. Therefore, in this study, deep convolutional generative adversarial networks (DCGANs) will be applied to generate fake image data with varying face positions. This research will be carried out starting from collecting data, processing data, designing and training models, hyperparameter tuning, and lastly analyzing test results. Based on the results of hyperparameter tuning that were performed sequentially, the best hyperparameter combination produced is 200 epoch, 0.002 Generator learning rate, 0.5 Generator momentum/beta1, Adam as Generator optimizer, 0.0002 Discriminator learning rate, 0.5 Discriminator momentum/beta1, and Adam as Discriminator optimizer. The combination of hyperparameters gives a result with an FID score of 74.05. Based on testing with human observers, generated fake images have relatively good results, but there are still few bad fake image results
深度卷积生成对抗网络在姿态不变人脸图像合成数据中的应用
技术领域目前发展迅速,其中一个发展是人工智能。人工智能仍然很难解决那些对人类来说很容易但对计算机来说很难描述的问题,比如面部识别。现有的人脸识别模型仍然存在几个问题,即由于人脸位置、光照、表情、遮挡人脸的障碍物等多种因素,人脸识别模型仍然无法识别非完美状态的人脸形状。在这几个因素中,影响最大的因素是脸部的位置。因此,在本研究中,将应用深度卷积生成对抗网络(dcgan)来生成具有不同面部位置的假图像数据。本研究将从收集数据、处理数据、设计和训练模型、超参数调优、最后分析测试结果开始。根据依次进行的超参数调优结果,产生的最佳超参数组合为200 epoch、0.002 Generator学习率、0.5 Generator动量/beta1、Adam作为Generator优化器、0.0002 Discriminator学习率、0.5 Discriminator动量/beta1、Adam作为Discriminator优化器。超参数的组合给出了一个FID评分为74.05的结果。基于人类观察者的测试,生成的假图像效果相对较好,但仍有少数假图像效果较差
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信