{"title":"Improved Consistency in Price Negotiation Dialogue System Using Parameterized Action Space with Generative Adversarial Imitation Learning","authors":"Makoto Sato, T. Takagi","doi":"10.1109/ICICT58900.2023.00039","DOIUrl":null,"url":null,"abstract":"A price negotiation dialogue system should not only gain profit but also pay attention to the negotiation process, such as the consistency of price proposals. In particular, when using a parameterized action space to make price proposals as a continuous action, the proposals can be more flexible but potentially inconsistent. In this study, we propose introducing Generative Adversarial Imitation Learning (GAIL) to price negotiation dialogues with a parameterized action space. To the best of our knowledge, this is the first case study to introduce GAIL in parameterized action space. In addition, we work on extending the dialogue act to maintain consistency, and on combining parameteried action reinforcement learning(RL) and GAIL by using Multi-Critic. The proposed method is applied to the CRAIGSLISTBARGAIN negotiation task, which is a practical negotiation task and is trained in a multi-agent format as a seller and buyer without using a simulator, and is evaluated by interacting with agents that combine supervised learning and rule-based methods. The results show that GAIL can reduce price inconsistencies better than RL with the designed reward function for consistency maintenance or the combination of RL and behavior cloning. Furthermore, we confirmed that the combined method of RL with the designed reward function and GAIL can reduce price inconsistencies the most and also enhance profit-seeking abilities.","PeriodicalId":425057,"journal":{"name":"2023 6th International Conference on Information and Computer Technologies (ICICT)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT58900.2023.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A price negotiation dialogue system should not only gain profit but also pay attention to the negotiation process, such as the consistency of price proposals. In particular, when using a parameterized action space to make price proposals as a continuous action, the proposals can be more flexible but potentially inconsistent. In this study, we propose introducing Generative Adversarial Imitation Learning (GAIL) to price negotiation dialogues with a parameterized action space. To the best of our knowledge, this is the first case study to introduce GAIL in parameterized action space. In addition, we work on extending the dialogue act to maintain consistency, and on combining parameteried action reinforcement learning(RL) and GAIL by using Multi-Critic. The proposed method is applied to the CRAIGSLISTBARGAIN negotiation task, which is a practical negotiation task and is trained in a multi-agent format as a seller and buyer without using a simulator, and is evaluated by interacting with agents that combine supervised learning and rule-based methods. The results show that GAIL can reduce price inconsistencies better than RL with the designed reward function for consistency maintenance or the combination of RL and behavior cloning. Furthermore, we confirmed that the combined method of RL with the designed reward function and GAIL can reduce price inconsistencies the most and also enhance profit-seeking abilities.