H. Kato, Takatsugu Hirayama, Keisuke Doman, I. Ide, Yasutomo Kawanishi, Daisuke Deguchi, H. Murase
{"title":"基于声音象征主义的拟声词步态生成","authors":"H. Kato, Takatsugu Hirayama, Keisuke Doman, I. Ide, Yasutomo Kawanishi, Daisuke Deguchi, H. Murase","doi":"10.1527/TJSAI.36-5_D-KC7","DOIUrl":null,"url":null,"abstract":"The Japanese language is known to have a rich vocabulary of mimetic words, which have the property of sound symbolism; Phonemes that compose the mimetic words are strongly related to the impression of various phenomena. Especially, human gait is one of the most commonly represented phenomena by mimetic words expressing its visually dynamic state. Sound symbolism is useful for modeling the relation between gaits and mimetic words intuitively, but there has been no study on their intuitive generation. Most previous gait generation methods set specific class labels such as “elderly” but have not considered the intuitiveness of the generation model. Thus, in this paper, we propose a framework to generate gaits from a mimetic word based on sound symbolism. This framework enables us to generate gaits from one or more mimetic words. It leads to the construction of a generation model represented in a continuous feature space, which is similar to human intuition. Concretely, we train an encoder-decoder model conditioned by a “phonetic vector”, a quantitive representation of mimetic words, with an adaptive instance normalization module inspired by style transfer. The phonetic vector is a dense description of the intuitive impression of a corresponding gait and is calculated from many mimetic words in the HOYO dataset, which includes gait motion data and corresponding mimetic word annotations. Through experiments, we confirmed the effectiveness of the proposed framework.","PeriodicalId":23256,"journal":{"name":"Transactions of The Japanese Society for Artificial Intelligence","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Gaits Generation from a Mimetic Word based on Sound Symbolism\",\"authors\":\"H. Kato, Takatsugu Hirayama, Keisuke Doman, I. Ide, Yasutomo Kawanishi, Daisuke Deguchi, H. Murase\",\"doi\":\"10.1527/TJSAI.36-5_D-KC7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Japanese language is known to have a rich vocabulary of mimetic words, which have the property of sound symbolism; Phonemes that compose the mimetic words are strongly related to the impression of various phenomena. Especially, human gait is one of the most commonly represented phenomena by mimetic words expressing its visually dynamic state. Sound symbolism is useful for modeling the relation between gaits and mimetic words intuitively, but there has been no study on their intuitive generation. Most previous gait generation methods set specific class labels such as “elderly” but have not considered the intuitiveness of the generation model. Thus, in this paper, we propose a framework to generate gaits from a mimetic word based on sound symbolism. This framework enables us to generate gaits from one or more mimetic words. It leads to the construction of a generation model represented in a continuous feature space, which is similar to human intuition. Concretely, we train an encoder-decoder model conditioned by a “phonetic vector”, a quantitive representation of mimetic words, with an adaptive instance normalization module inspired by style transfer. The phonetic vector is a dense description of the intuitive impression of a corresponding gait and is calculated from many mimetic words in the HOYO dataset, which includes gait motion data and corresponding mimetic word annotations. Through experiments, we confirmed the effectiveness of the proposed framework.\",\"PeriodicalId\":23256,\"journal\":{\"name\":\"Transactions of The Japanese Society for Artificial Intelligence\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transactions of The Japanese Society for Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1527/TJSAI.36-5_D-KC7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions of The Japanese Society for Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1527/TJSAI.36-5_D-KC7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Gaits Generation from a Mimetic Word based on Sound Symbolism
The Japanese language is known to have a rich vocabulary of mimetic words, which have the property of sound symbolism; Phonemes that compose the mimetic words are strongly related to the impression of various phenomena. Especially, human gait is one of the most commonly represented phenomena by mimetic words expressing its visually dynamic state. Sound symbolism is useful for modeling the relation between gaits and mimetic words intuitively, but there has been no study on their intuitive generation. Most previous gait generation methods set specific class labels such as “elderly” but have not considered the intuitiveness of the generation model. Thus, in this paper, we propose a framework to generate gaits from a mimetic word based on sound symbolism. This framework enables us to generate gaits from one or more mimetic words. It leads to the construction of a generation model represented in a continuous feature space, which is similar to human intuition. Concretely, we train an encoder-decoder model conditioned by a “phonetic vector”, a quantitive representation of mimetic words, with an adaptive instance normalization module inspired by style transfer. The phonetic vector is a dense description of the intuitive impression of a corresponding gait and is calculated from many mimetic words in the HOYO dataset, which includes gait motion data and corresponding mimetic word annotations. Through experiments, we confirmed the effectiveness of the proposed framework.