Study on Feature Extraction Method from 2D Character Illustration based on Human’s Cognitive Characteristics for Automatic Voice Estimation

Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference Pub Date : 2021-12-17 DOI:10.1145/3508259.3508273

Noboru Omichi, Sho Ooi, Mutsuo Sano

{"title":"Study on Feature Extraction Method from 2D Character Illustration based on Human’s Cognitive Characteristics for Automatic Voice Estimation","authors":"Noboru Omichi, Sho Ooi, Mutsuo Sano","doi":"10.1145/3508259.3508273","DOIUrl":null,"url":null,"abstract":"Humans can imagine an approximate voice from a human face, and some studies estimate and generate a voice from a human face. As research applying this, there is research to create a sound from a 2D illustration character. This study considers how a person imagines a voice from a face and examines a method for generating speech from a 2D illustration character. So far, we have verified which voice actor/actress resembles that character’s voice from an illustration of an unknown character by learning by associating a character with a voice actor. As a result, 2 out of 5 characters were judged correctly. We also conducted a questionnaire on where people look in the character’s illustrations to imagine their voices and found that their eyes and hair are powerful features. In consideration of the above results, this study attempts to acquire eye information from 2D illustrated characters. The system improved the extraction system by rotating the image by detecting landmarks to extract the character’s eyes. As a result, when the detection accuracy was verified, the result was 73.7","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3508259.3508273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Humans can imagine an approximate voice from a human face, and some studies estimate and generate a voice from a human face. As research applying this, there is research to create a sound from a 2D illustration character. This study considers how a person imagines a voice from a face and examines a method for generating speech from a 2D illustration character. So far, we have verified which voice actor/actress resembles that character’s voice from an illustration of an unknown character by learning by associating a character with a voice actor. As a result, 2 out of 5 characters were judged correctly. We also conducted a questionnaire on where people look in the character’s illustrations to imagine their voices and found that their eyes and hair are powerful features. In consideration of the above results, this study attempts to acquire eye information from 2D illustrated characters. The system improved the extraction system by rotating the image by detecting landmarks to extract the character’s eyes. As a result, when the detection accuracy was verified, the result was 73.7

查看原文本刊更多论文

基于人类认知特征的二维人物插图特征提取方法研究

人类可以从人脸中想象出近似的声音，一些研究估计并从人脸中产生声音。作为应用这一方法的研究，有一项研究是从2D插图角色中创造声音。本研究考虑了一个人如何从面部想象声音，并研究了一种从2D插图角色生成语音的方法。到目前为止，我们已经通过学习将角色与配音演员联系起来，从未知角色的插图中验证了哪个配音演员与角色的声音相似。结果，5个汉字中有2个被判断正确。我们还进行了一项问卷调查，调查人们在角色的插图中看到的是他们的声音，发现他们的眼睛和头发是强大的特征。考虑到以上结果，本研究尝试从二维插图人物中获取眼睛信息。该系统改进了提取系统，通过检测地标旋转图像提取人物的眼睛。因此，在验证检测精度时，结果为73.7

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference

自引率

0.00%

发文量