{"title":"使用生成对抗网络和深度强化学习从真实物体图像生成草图","authors":"Shintya Rezky Rahmayanti, C. Fatichah, N. Suciati","doi":"10.1109/ICTS52701.2021.9608634","DOIUrl":null,"url":null,"abstract":"Technology in Robotics and machine learning have been applied in numerous fields including the arts. Paul The Robot is able to draw sketches from human faces using the conventional convolution filter method. Generative Adversarial Network (GAN) has been successful in generating synthetic images. Researches in sketch generation have been conducted either by using Recurrent Neural Network (RNN) or by using Deep Reinforcement Learning, with step-by-step stroke drawing. This research proposes a system to generate sketches from real object images using GAN dan Deep Reinforcement Learning. The training framework used is based on Doodle-SDQ (Doodle with Stroke Demonstration and Deep Q-Network) that combines supervised learning and reinforcement learning. Real object images are converted into contour images by GAN to be the reference images by the reinforcement learning agent to generate the sketch. The experiment is done by modifying pooling layers during the supervised learning stage and rare exploration scenarios during the reinforcement learning stage. The result of this research is a model that can reach an average total reward of 2558.98 with an average pixel error of 0.0489 using 200 as the maximum step in an average time of 3.29 seconds for the sketch generation.","PeriodicalId":6738,"journal":{"name":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","volume":"114 1","pages":"134-139"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sketch Generation From Real Object Images Using Generative Adversarial Network and Deep Reinforcement Learning\",\"authors\":\"Shintya Rezky Rahmayanti, C. Fatichah, N. Suciati\",\"doi\":\"10.1109/ICTS52701.2021.9608634\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Technology in Robotics and machine learning have been applied in numerous fields including the arts. Paul The Robot is able to draw sketches from human faces using the conventional convolution filter method. Generative Adversarial Network (GAN) has been successful in generating synthetic images. Researches in sketch generation have been conducted either by using Recurrent Neural Network (RNN) or by using Deep Reinforcement Learning, with step-by-step stroke drawing. This research proposes a system to generate sketches from real object images using GAN dan Deep Reinforcement Learning. The training framework used is based on Doodle-SDQ (Doodle with Stroke Demonstration and Deep Q-Network) that combines supervised learning and reinforcement learning. Real object images are converted into contour images by GAN to be the reference images by the reinforcement learning agent to generate the sketch. The experiment is done by modifying pooling layers during the supervised learning stage and rare exploration scenarios during the reinforcement learning stage. The result of this research is a model that can reach an average total reward of 2558.98 with an average pixel error of 0.0489 using 200 as the maximum step in an average time of 3.29 seconds for the sketch generation.\",\"PeriodicalId\":6738,\"journal\":{\"name\":\"2021 13th International Conference on Information & Communication Technology and System (ICTS)\",\"volume\":\"114 1\",\"pages\":\"134-139\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Information & Communication Technology and System (ICTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTS52701.2021.9608634\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS52701.2021.9608634","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
机器人技术和机器学习技术已经应用于包括艺术在内的许多领域。机器人保罗能够使用传统的卷积滤波方法从人脸上绘制草图。生成对抗网络(GAN)在生成合成图像方面取得了成功。草图生成的研究有两种,一种是使用递归神经网络(RNN),另一种是使用深度强化学习,逐步绘制笔画。本研究提出了一种利用GAN和深度强化学习从真实物体图像中生成草图的系统。使用的训练框架基于Doodle- sdq (Doodle with Stroke Demonstration and Deep Q-Network),结合了监督学习和强化学习。通过GAN将真实物体图像转换为轮廓图像,作为强化学习代理生成草图的参考图像。实验通过修改监督学习阶段的池化层和强化学习阶段的罕见探索场景来完成。本研究的结果是,在草图生成的平均时间为3.29秒的情况下,以200为最大步长,平均总奖励达到2558.98,平均像素误差为0.0489。
Sketch Generation From Real Object Images Using Generative Adversarial Network and Deep Reinforcement Learning
Technology in Robotics and machine learning have been applied in numerous fields including the arts. Paul The Robot is able to draw sketches from human faces using the conventional convolution filter method. Generative Adversarial Network (GAN) has been successful in generating synthetic images. Researches in sketch generation have been conducted either by using Recurrent Neural Network (RNN) or by using Deep Reinforcement Learning, with step-by-step stroke drawing. This research proposes a system to generate sketches from real object images using GAN dan Deep Reinforcement Learning. The training framework used is based on Doodle-SDQ (Doodle with Stroke Demonstration and Deep Q-Network) that combines supervised learning and reinforcement learning. Real object images are converted into contour images by GAN to be the reference images by the reinforcement learning agent to generate the sketch. The experiment is done by modifying pooling layers during the supervised learning stage and rare exploration scenarios during the reinforcement learning stage. The result of this research is a model that can reach an average total reward of 2558.98 with an average pixel error of 0.0489 using 200 as the maximum step in an average time of 3.29 seconds for the sketch generation.