{"title":"改进了多回合对话框中响应生成的一致性","authors":"Takamune Onishi, Sakuei Onishi, Hiromitsu Shiina","doi":"10.1109/IIAIAAI55812.2022.00088","DOIUrl":null,"url":null,"abstract":"The consistency of response generation in multi-turn dialog must be improved. The existing VHDER model, that extends the RNN model to dialog generation, does not distinguish turns. Thus, in this study, we improve it by adding implementing a User-RNN that corresponds to a speaker who conveys information every odd and even turn. In addition, the global vibrational transformer model, which applies the CVAE, an advanced autoencoder, enables diverse response generation. However, latent variables sampled from the entire context are diluted by speaker characteristics, which degrades the consistency of generated responses. In this study, we employed speaker-specific latent variables to generate speaker-consistent responses.","PeriodicalId":156230,"journal":{"name":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Response Generation Consistency in Multiturn Dialog\",\"authors\":\"Takamune Onishi, Sakuei Onishi, Hiromitsu Shiina\",\"doi\":\"10.1109/IIAIAAI55812.2022.00088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The consistency of response generation in multi-turn dialog must be improved. The existing VHDER model, that extends the RNN model to dialog generation, does not distinguish turns. Thus, in this study, we improve it by adding implementing a User-RNN that corresponds to a speaker who conveys information every odd and even turn. In addition, the global vibrational transformer model, which applies the CVAE, an advanced autoencoder, enables diverse response generation. However, latent variables sampled from the entire context are diluted by speaker characteristics, which degrades the consistency of generated responses. In this study, we employed speaker-specific latent variables to generate speaker-consistent responses.\",\"PeriodicalId\":156230,\"journal\":{\"name\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAIAAI55812.2022.00088\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAIAAI55812.2022.00088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Response Generation Consistency in Multiturn Dialog
The consistency of response generation in multi-turn dialog must be improved. The existing VHDER model, that extends the RNN model to dialog generation, does not distinguish turns. Thus, in this study, we improve it by adding implementing a User-RNN that corresponds to a speaker who conveys information every odd and even turn. In addition, the global vibrational transformer model, which applies the CVAE, an advanced autoencoder, enables diverse response generation. However, latent variables sampled from the entire context are diluted by speaker characteristics, which degrades the consistency of generated responses. In this study, we employed speaker-specific latent variables to generate speaker-consistent responses.