微任务众包的梯度下降矩阵分解

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-05-02 DOI:10.1016/j.engappai.2025.111003

Alireza Moayedikia

{"title":"微任务众包的梯度下降矩阵分解","authors":"Alireza Moayedikia","doi":"10.1016/j.engappai.2025.111003","DOIUrl":null,"url":null,"abstract":"<div><div>Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"154 ","pages":"Article 111003"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A gradient descent matrix factorization for microtask crowdsourcing\",\"authors\":\"Alireza Moayedikia\",\"doi\":\"10.1016/j.engappai.2025.111003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"154 \",\"pages\":\"Article 111003\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010036\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010036","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

传统的众包算法将所有任务分配给所有工人，并收集他们的最终答案。然而，这些基线方法的一个主要问题是它们无法识别和消除低质量或不真实的员工的答案。为了应对这一挑战，提出了新的微任务算法，其中估计工人的专业知识来相应地分配任务。这些算法依赖于一个工人-任务矩阵，但一个常见的问题是这个矩阵的潜在稀疏性，因为不是所有的工人都回答分配的任务，而且任务与工人的比例很高。为了解决这个问题，本文介绍了一种新的基于优化的矩阵分解方法，使用梯度下降称为GRADi。GRADi旨在通过分解工人-任务矩阵来预测不同工人的缺失答案。在此过程中，GRADi结合了工人相似度信息来提高分解精度。使用Amazon Mechanical Turk和类似平台的数据集评估GRADi的有效性，评估准确性和均方根误差。与其他矩阵分解算法（包括优化和非优化技术，以及现有的微任务算法）的比较分析表明，GRADi始终优于这些方法。它通过更好地预测工人的反应和处理稀疏工人-任务矩阵的固有挑战，在改善微任务结果方面显示了有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A gradient descent matrix factorization for microtask crowdsourcing

Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.