{"title":"Multi-turn dialogue generation considering speech information of each speaker by adding recurrent neural networks","authors":"Takamune Onishi, Hiromitsu Shiina","doi":"10.1109/iiai-aai53430.2021.00093","DOIUrl":null,"url":null,"abstract":"A dialogue generation method using neural networks (NNs) has been proposed. The HRED model is a model for multi-turn dialogues by creating a hierarchical structure by layering several encoder-decoder models. Furthermore, the VHRED model generates a variety of responses by adding latent variables. However, since these models do not consider the user who has spoken, they generate inconsistent responses in the same dialogue, which is a problem. In this study, instead of using the user's embedding vector, we add a user recurrent NN (User-RNN) to retain the speech information of each speaker and generate consistent responses.","PeriodicalId":414070,"journal":{"name":"2021 10th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 10th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iiai-aai53430.2021.00093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A dialogue generation method using neural networks (NNs) has been proposed. The HRED model is a model for multi-turn dialogues by creating a hierarchical structure by layering several encoder-decoder models. Furthermore, the VHRED model generates a variety of responses by adding latent variables. However, since these models do not consider the user who has spoken, they generate inconsistent responses in the same dialogue, which is a problem. In this study, instead of using the user's embedding vector, we add a user recurrent NN (User-RNN) to retain the speech information of each speaker and generate consistent responses.