Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits

2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing Pub Date : 2012-09-03 DOI:10.1109/SocialCom-PASSAT.2012.50

Alexey Tarasov, Sarah Jane Delany, Brian Mac Namee

{"title":"Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits","authors":"Alexey Tarasov, Sarah Jane Delany, Brian Mac Namee","doi":"10.1109/SocialCom-PASSAT.2012.50","DOIUrl":null,"url":null,"abstract":"Many application areas that use supervised machine learning make use of multiple raters to collect target ratings for training data. Usage of multiple raters, however, inevitably introduces the risk that a proportion of them will be unreliable. The presence of unreliable raters can prolong the rating process, make it more expensive and lead to inaccurate ratings. The dominant, \"static\" approach of solving this problem in state-of-the-art research is to estimate the rater reliability and to calculate the target ratings when all ratings have been gathered. However, doing it dynamically while raters rate training data can make the acquisition of ratings faster and cheaper compared to static techniques. We propose to cast the problem of the dynamic estimation of rater reliability as a multi-armed bandit problem. Experiments show that the usage of multi-armed bandits for this problem is worthwhile, providing that each rater can rate any asset when asked. The purpose of this paper is to outline the directions of future research in this area.","PeriodicalId":129526,"journal":{"name":"2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SocialCom-PASSAT.2012.50","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Many application areas that use supervised machine learning make use of multiple raters to collect target ratings for training data. Usage of multiple raters, however, inevitably introduces the risk that a proportion of them will be unreliable. The presence of unreliable raters can prolong the rating process, make it more expensive and lead to inaccurate ratings. The dominant, "static" approach of solving this problem in state-of-the-art research is to estimate the rater reliability and to calculate the target ratings when all ratings have been gathered. However, doing it dynamically while raters rate training data can make the acquisition of ratings faster and cheaper compared to static techniques. We propose to cast the problem of the dynamic estimation of rater reliability as a multi-armed bandit problem. Experiments show that the usage of multi-armed bandits for this problem is worthwhile, providing that each rater can rate any asset when asked. The purpose of this paper is to outline the directions of future research in this area.

查看原文本刊更多论文

基于多臂强盗的主观任务评价信度动态估计

许多使用监督机器学习的应用领域都使用多个评分者来收集训练数据的目标评分。然而，使用多个评级机构不可避免地会带来其中一部分评级机构不可靠的风险。不可靠评级机构的存在会延长评级过程，使其成本更高，并导致评级不准确。在最先进的研究中，解决这一问题的主要“静态”方法是估计评分者的可靠性，并在收集了所有评分后计算目标评分。然而，与静态技术相比，在评级员对训练数据进行评级时动态地进行评级可以使评级的获取更快、更便宜。我们提出将评价可靠性的动态估计问题看作是一个多臂强盗问题。实验表明，在这个问题上使用多武装强盗是值得的，因为每个评价者都可以在被要求时对任何资产进行评级。本文的目的是概述该领域未来的研究方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing

自引率

0.00%

发文量