Crowdsourcing System for Numerical Tasks based on Latent Topic Aware Worker Reliability

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Pub Date : 2021-05-10 DOI:10.1109/INFOCOM42981.2021.9488748

Zhuan Shi, Shanyang Jiang, Lan Zhang, Yang Du, Xiangyang Li

{"title":"Crowdsourcing System for Numerical Tasks based on Latent Topic Aware Worker Reliability","authors":"Zhuan Shi, Shanyang Jiang, Lan Zhang, Yang Du, Xiangyang Li","doi":"10.1109/INFOCOM42981.2021.9488748","DOIUrl":null,"url":null,"abstract":"Crowdsourcing is a widely adopted way for various labor-intensive tasks. One of the core problems in crowdsourcing systems is how to assign tasks to most suitable workers for better results, which heavily relies on the accurate profiling of each worker’s reliability for different topics of tasks. Many previous work have studied worker reliability for either explicit topics represented by task descriptions or latent topics for categorical tasks. In this work, we aim to accurately estimate more fine-grained worker reliability for latent topics in numerical tasks, so as to further improve the result quality. We propose a bayesian probabilistic model named Gaussian Latent Topic Model(GLTM) to mine the latent topics of numerical tasks based on workers’ behaviors and to estimate workers’ topic-level reliability. By utilizing the GLTM, we propose a truth inference algorithm named TI-GLTM to accurately infer the tasks’ truth and topics simultaneously and dynamically update workers’ topic-level reliability. We also design an online task assignment mechanism called MRA-GLTM, which assigns appropriate tasks to workers with the Maximum Reduced Ambiguity principle. The experiment results show our algorithms can achieve significantly lower MAE and MSE than that of the state-of-the-art approaches.","PeriodicalId":293079,"journal":{"name":"IEEE INFOCOM 2021 - IEEE Conference on Computer Communications","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2021 - IEEE Conference on Computer Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOM42981.2021.9488748","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Crowdsourcing is a widely adopted way for various labor-intensive tasks. One of the core problems in crowdsourcing systems is how to assign tasks to most suitable workers for better results, which heavily relies on the accurate profiling of each worker’s reliability for different topics of tasks. Many previous work have studied worker reliability for either explicit topics represented by task descriptions or latent topics for categorical tasks. In this work, we aim to accurately estimate more fine-grained worker reliability for latent topics in numerical tasks, so as to further improve the result quality. We propose a bayesian probabilistic model named Gaussian Latent Topic Model(GLTM) to mine the latent topics of numerical tasks based on workers’ behaviors and to estimate workers’ topic-level reliability. By utilizing the GLTM, we propose a truth inference algorithm named TI-GLTM to accurately infer the tasks’ truth and topics simultaneously and dynamically update workers’ topic-level reliability. We also design an online task assignment mechanism called MRA-GLTM, which assigns appropriate tasks to workers with the Maximum Reduced Ambiguity principle. The experiment results show our algorithms can achieve significantly lower MAE and MSE than that of the state-of-the-art approaches.

查看原文本刊更多论文

基于潜在主题感知工作者可靠性的数字任务众包系统

众包是各种劳动密集型任务广泛采用的方式。众包系统的核心问题之一是如何将任务分配给最合适的工人以获得更好的结果，这在很大程度上依赖于每个工人对不同任务主题的可靠性的准确分析。许多先前的工作研究了工作人员对任务描述所代表的显性主题或分类任务的潜在主题的信度。在这项工作中，我们的目标是准确地估计数字任务中潜在主题的更细粒度的工作者可靠性，从而进一步提高结果质量。本文提出了一种基于工作人员行为的贝叶斯概率模型高斯潜在话题模型(Gaussian Latent Topic model, GLTM)来挖掘数字任务的潜在话题，并估计工作人员的话题级信度。利用GLTM，我们提出了一种真实度推断算法TI-GLTM，能够同时准确地推断任务的真实度和主题，并动态更新工作人员的主题级可靠性。我们还设计了一种名为MRA-GLTM的在线任务分配机制，该机制根据最大减少歧义原则为工作人员分配适当的任务。实验结果表明，我们的算法可以获得比现有方法更低的MAE和MSE。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications

自引率

0.00%

发文量