{"title":"A Novel Homophily-aware Correction Approach for Crowdsourced Labels Using Information Entropy","authors":"Kang Yan, Jian Lu, Qingren Wang, Wei Li","doi":"10.1109/ICKG52313.2021.00013","DOIUrl":null,"url":null,"abstract":"Crowdsourcing provides a cost effective and conve-nient way for label collection. However, it fails to guarantee the quality of crowdsourced labels. Inspired by homophily in social networks denoting the tendency of individuals with similar char-acteristics to be friends with each other, in this paper we propose a novel Homophily-aware Correction Approach for crowdsourced labels using Information Entropy (namely HaCAIE), to further achieve quality improvement of crowdsourced labels. Specifically, Our HaCAIE can be decomposed into three phases: $i$) seeking full semantic relations among entities, where HaCAIE models multiple explicit and implicit semantic relations among labelers, tasks and categories, based on homogeneous information network and related techniques; ii) calculating homophily, where HaCAIE utilizes adjacent relation matrices of labelers and tasks to calculate homophily among labelers; and iii) correcting labels, where for each task, HaCAIE employs information entropy and constructs a corresponding star homophily network to perform label correction. Our experimental results on six real-world datasets not only show that our HaCAIE performs well, but also demonstrate that HaCAIE can collaborate well with different inference algorithms in the field of crowdsourcing.","PeriodicalId":174126,"journal":{"name":"2021 IEEE International Conference on Big Knowledge (ICBK)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Big Knowledge (ICBK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKG52313.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Crowdsourcing provides a cost effective and conve-nient way for label collection. However, it fails to guarantee the quality of crowdsourced labels. Inspired by homophily in social networks denoting the tendency of individuals with similar char-acteristics to be friends with each other, in this paper we propose a novel Homophily-aware Correction Approach for crowdsourced labels using Information Entropy (namely HaCAIE), to further achieve quality improvement of crowdsourced labels. Specifically, Our HaCAIE can be decomposed into three phases: $i$) seeking full semantic relations among entities, where HaCAIE models multiple explicit and implicit semantic relations among labelers, tasks and categories, based on homogeneous information network and related techniques; ii) calculating homophily, where HaCAIE utilizes adjacent relation matrices of labelers and tasks to calculate homophily among labelers; and iii) correcting labels, where for each task, HaCAIE employs information entropy and constructs a corresponding star homophily network to perform label correction. Our experimental results on six real-world datasets not only show that our HaCAIE performs well, but also demonstrate that HaCAIE can collaborate well with different inference algorithms in the field of crowdsourcing.