{"title":"Improved Response Generation Consistency in Multiturn Dialog","authors":"Takamune Onishi, Sakuei Onishi, Hiromitsu Shiina","doi":"10.1109/IIAIAAI55812.2022.00088","DOIUrl":null,"url":null,"abstract":"The consistency of response generation in multi-turn dialog must be improved. The existing VHDER model, that extends the RNN model to dialog generation, does not distinguish turns. Thus, in this study, we improve it by adding implementing a User-RNN that corresponds to a speaker who conveys information every odd and even turn. In addition, the global vibrational transformer model, which applies the CVAE, an advanced autoencoder, enables diverse response generation. However, latent variables sampled from the entire context are diluted by speaker characteristics, which degrades the consistency of generated responses. In this study, we employed speaker-specific latent variables to generate speaker-consistent responses.","PeriodicalId":156230,"journal":{"name":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAIAAI55812.2022.00088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The consistency of response generation in multi-turn dialog must be improved. The existing VHDER model, that extends the RNN model to dialog generation, does not distinguish turns. Thus, in this study, we improve it by adding implementing a User-RNN that corresponds to a speaker who conveys information every odd and even turn. In addition, the global vibrational transformer model, which applies the CVAE, an advanced autoencoder, enables diverse response generation. However, latent variables sampled from the entire context are diluted by speaker characteristics, which degrades the consistency of generated responses. In this study, we employed speaker-specific latent variables to generate speaker-consistent responses.