{"title":"带有偏见的众包标签聚类的标签聚合","authors":"Ming Wu, Qianmu Li, Jing Zhang, J. Hou","doi":"10.1145/3529836.3529861","DOIUrl":null,"url":null,"abstract":"With the rapid development of crowdsourcing learning, amount of label aggregation methods are proposed to infer the true labels of instances from multiple noisy labels provided by inexpert crowd workers. Most of the label aggregation methods take the reliabilities of workers and the difficulties of instances into account and construct the probabilistic models, then infer the aggregated label and estimate the parameters simultaneously. However, to the best of our knowledge, label aggregation for biased crowdsourced labeling scenarios has not been sufficiently studied. Biased labeling is a critical factor that affects the performance of label aggregation and is hard to detect and model. To this end, this paper proposes a novel Label Aggregation with Clustering method for Biased Labeling (LACBL), to improve the quality of crowd labels by mitigating the labeling bias. LACBL detects the labeling bias of the dataset using clustering methods and then decreases the ratio of the biased class labels according to the bias. Finally, a label aggregation method is applied to the renewed label set. Experimental results on four real-world datasets show that LACBL outperforms other state-of-the-art label aggregation algorithms.","PeriodicalId":285191,"journal":{"name":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","volume":"262 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Label Aggregation with Clustering for Biased Crowdsourced Labeling\",\"authors\":\"Ming Wu, Qianmu Li, Jing Zhang, J. Hou\",\"doi\":\"10.1145/3529836.3529861\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of crowdsourcing learning, amount of label aggregation methods are proposed to infer the true labels of instances from multiple noisy labels provided by inexpert crowd workers. Most of the label aggregation methods take the reliabilities of workers and the difficulties of instances into account and construct the probabilistic models, then infer the aggregated label and estimate the parameters simultaneously. However, to the best of our knowledge, label aggregation for biased crowdsourced labeling scenarios has not been sufficiently studied. Biased labeling is a critical factor that affects the performance of label aggregation and is hard to detect and model. To this end, this paper proposes a novel Label Aggregation with Clustering method for Biased Labeling (LACBL), to improve the quality of crowd labels by mitigating the labeling bias. LACBL detects the labeling bias of the dataset using clustering methods and then decreases the ratio of the biased class labels according to the bias. Finally, a label aggregation method is applied to the renewed label set. Experimental results on four real-world datasets show that LACBL outperforms other state-of-the-art label aggregation algorithms.\",\"PeriodicalId\":285191,\"journal\":{\"name\":\"2022 14th International Conference on Machine Learning and Computing (ICMLC)\",\"volume\":\"262 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Machine Learning and Computing (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3529836.3529861\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3529836.3529861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Label Aggregation with Clustering for Biased Crowdsourced Labeling
With the rapid development of crowdsourcing learning, amount of label aggregation methods are proposed to infer the true labels of instances from multiple noisy labels provided by inexpert crowd workers. Most of the label aggregation methods take the reliabilities of workers and the difficulties of instances into account and construct the probabilistic models, then infer the aggregated label and estimate the parameters simultaneously. However, to the best of our knowledge, label aggregation for biased crowdsourced labeling scenarios has not been sufficiently studied. Biased labeling is a critical factor that affects the performance of label aggregation and is hard to detect and model. To this end, this paper proposes a novel Label Aggregation with Clustering method for Biased Labeling (LACBL), to improve the quality of crowd labels by mitigating the labeling bias. LACBL detects the labeling bias of the dataset using clustering methods and then decreases the ratio of the biased class labels according to the bias. Finally, a label aggregation method is applied to the renewed label set. Experimental results on four real-world datasets show that LACBL outperforms other state-of-the-art label aggregation algorithms.