{"title":"Comparing Epsilon Greedy and Thompson Sampling model for Multi-Armed Bandit algorithm on Marketing Dataset","authors":"Izzatul Umami, Lailia Rahmawati","doi":"10.47738/jads.v2i2.28","DOIUrl":null,"url":null,"abstract":"A/B checking is a regular measure in many marketing procedures for e-Commerce companies. Through well-designed A/B research, advertisers can gain insight about when and how marketing efforts can be maximized and active promotions driven. Whilst many algorithms for the problem are theoretically well developed, empirical confirmation is typically restricted. In practical terms, standard A/B experimentation makes less money relative to more advanced machine learning methods. This paper presents a thorough empirical study of the most popular multi-strategy algorithms. Three important observations can be made from our results. First, simple heuristics such as Epsilon Greedy and Thompson Sampling outperform theoretically sound algorithms in most settings by a significant margin. In this report, the state of A/B testing is addressed, some typical A/B learning algorithms (Multi-Arms Bandits) used to optimize A/B testing are described and comparable. We found that Epsilon Greedy, be an exceptional winner to optimize payouts in this situation.","PeriodicalId":341738,"journal":{"name":"Journal of Applied Data Sciences","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Data Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47738/jads.v2i2.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
A/B checking is a regular measure in many marketing procedures for e-Commerce companies. Through well-designed A/B research, advertisers can gain insight about when and how marketing efforts can be maximized and active promotions driven. Whilst many algorithms for the problem are theoretically well developed, empirical confirmation is typically restricted. In practical terms, standard A/B experimentation makes less money relative to more advanced machine learning methods. This paper presents a thorough empirical study of the most popular multi-strategy algorithms. Three important observations can be made from our results. First, simple heuristics such as Epsilon Greedy and Thompson Sampling outperform theoretically sound algorithms in most settings by a significant margin. In this report, the state of A/B testing is addressed, some typical A/B learning algorithms (Multi-Arms Bandits) used to optimize A/B testing are described and comparable. We found that Epsilon Greedy, be an exceptional winner to optimize payouts in this situation.