Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining

S. Jamali, K. Faez
{"title":"Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining","authors":"S. Jamali, K. Faez","doi":"10.1109/UKSim.2012.39","DOIUrl":null,"url":null,"abstract":"In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.","PeriodicalId":405479,"journal":{"name":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKSim.2012.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.
交易主体在双边议价中的应用
在本研究中,我们使用了一种称为SAQ-Learning的学习方法,用于代理在单一问题的讨价还价过程中。SAQ-Learning算法是Q-Learning算法的改进版本,它利用了模拟退火算法的Metropolis准则来克服在探索和利用之间寻找平衡的挑战。Q-Learning是强化学习(RL)最重要的类型之一,因为它不需要环境的过渡模型。人工智能(AI)方法已引起人们对解决议价问题的兴趣。这是因为博弈论(GT)需要一些不切实际的假设来解决议价问题。完全理性主体的存在就是这些假设的一个例子。因此,通过设计SAQ-Learning agent进行价格讨价还价,我们在结算率、平均收益和agent找到最优策略所需时间的情况下获得了更高的性能。这种学习方法可以作为电子商务中自动在线议价代理的一种合适的学习算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信