微任务众包的梯度下降矩阵分解

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Alireza Moayedikia
{"title":"微任务众包的梯度下降矩阵分解","authors":"Alireza Moayedikia","doi":"10.1016/j.engappai.2025.111003","DOIUrl":null,"url":null,"abstract":"<div><div>Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"154 ","pages":"Article 111003"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A gradient descent matrix factorization for microtask crowdsourcing\",\"authors\":\"Alireza Moayedikia\",\"doi\":\"10.1016/j.engappai.2025.111003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"154 \",\"pages\":\"Article 111003\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010036\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010036","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

传统的众包算法将所有任务分配给所有工人,并收集他们的最终答案。然而,这些基线方法的一个主要问题是它们无法识别和消除低质量或不真实的员工的答案。为了应对这一挑战,提出了新的微任务算法,其中估计工人的专业知识来相应地分配任务。这些算法依赖于一个工人-任务矩阵,但一个常见的问题是这个矩阵的潜在稀疏性,因为不是所有的工人都回答分配的任务,而且任务与工人的比例很高。为了解决这个问题,本文介绍了一种新的基于优化的矩阵分解方法,使用梯度下降称为GRADi。GRADi旨在通过分解工人-任务矩阵来预测不同工人的缺失答案。在此过程中,GRADi结合了工人相似度信息来提高分解精度。使用Amazon Mechanical Turk和类似平台的数据集评估GRADi的有效性,评估准确性和均方根误差。与其他矩阵分解算法(包括优化和非优化技术,以及现有的微任务算法)的比较分析表明,GRADi始终优于这些方法。它通过更好地预测工人的反应和处理稀疏工人-任务矩阵的固有挑战,在改善微任务结果方面显示了有希望的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A gradient descent matrix factorization for microtask crowdsourcing
Conventional crowdsourcing algorithms assign all tasks to all workers and collect their final answers. However, a major issue with these baseline approaches is their inability to identify and eliminate answers from low-quality or non-genuine workers. To tackle this challenge, novel microtasking algorithms have been proposed where workers' expertise is estimated to allocate tasks accordingly. These algorithms rely on a worker-task matrix, but a common problem is the potential sparsity of this matrix due to not all workers answering allocated tasks and a high ratio of tasks to workers. To address this issue, this paper introduces a novel optimization-based matrix factorization approach using Gradient Descent known as GRADi. GRADi aims to predict missing answers from different workers while factorizing the worker-task matrix. During this process, GRADi incorporates worker similarity information to enhance factorization accuracy. The effectiveness of GRADi is evaluated using datasets from Amazon Mechanical Turk and similar platforms, assessing accuracy and Root Mean Square Error. Comparative analyses against other matrix factorization algorithms, including both optimization and non-optimization techniques, as well as existing microtasking algorithms, demonstrate that GRADi consistently outperforms these methods. It shows promising results in improving microtasking outcomes by better predicting worker responses and handling the inherent challenges of sparse worker-task matrices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信