{"title":"Three cobblers worth the mastermind? The potential of ensemble in crowdsourced classification problems","authors":"Wangcheng Yan, Paolo Letizia, Wenjun Zhou","doi":"10.1111/deci.12516","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Classification problems, where the objective is to identify the class labels of given data points, are most often the subject of open contests, in which solvers compete for awards offered by solution seekers. Extant literature in open contests has studied both the winner-takes-all and top-K award schemes, in which the award is granted to the best one and the best K solutions, respectively. However, in comparing these two schemes, researchers have never considered that under a top-K award scheme, seekers may often benefit from aggregating the solutions as an ensemble, which could achieve a performance that is superior to any individual solution. The practice of ensemble is very common especially in classification and other data science problems, and in this work, we are the first to model it with the scope of deriving the optimal award scheme in open contests. Our results formally show that, given the option of ensemble, a top-K award scheme may have the advantage to grant a higher profit to the seeker than the winner-takes-all award scheme. Further, the optimal number of awards is positively affected by contest parameters including the solvers' expertise, the number of solvers, and the return on effort, whereas it is negatively affected by the technical uncertainty when such uncertainty is sufficiently large. These findings are consistent with the increasing adoption of the top-K award scheme in contests held on Kaggle and other similar platforms.</p>\n </div>","PeriodicalId":48256,"journal":{"name":"DECISION SCIENCES","volume":"53 2","pages":"223-238"},"PeriodicalIF":2.8000,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1111/deci.12516","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DECISION SCIENCES","FirstCategoryId":"91","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/deci.12516","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 1
Abstract
Classification problems, where the objective is to identify the class labels of given data points, are most often the subject of open contests, in which solvers compete for awards offered by solution seekers. Extant literature in open contests has studied both the winner-takes-all and top-K award schemes, in which the award is granted to the best one and the best K solutions, respectively. However, in comparing these two schemes, researchers have never considered that under a top-K award scheme, seekers may often benefit from aggregating the solutions as an ensemble, which could achieve a performance that is superior to any individual solution. The practice of ensemble is very common especially in classification and other data science problems, and in this work, we are the first to model it with the scope of deriving the optimal award scheme in open contests. Our results formally show that, given the option of ensemble, a top-K award scheme may have the advantage to grant a higher profit to the seeker than the winner-takes-all award scheme. Further, the optimal number of awards is positively affected by contest parameters including the solvers' expertise, the number of solvers, and the return on effort, whereas it is negatively affected by the technical uncertainty when such uncertainty is sufficiently large. These findings are consistent with the increasing adoption of the top-K award scheme in contests held on Kaggle and other similar platforms.
期刊介绍:
Decision Sciences, a premier journal of the Decision Sciences Institute, publishes scholarly research about decision making within the boundaries of an organization, as well as decisions involving inter-firm coordination. The journal promotes research advancing decision making at the interfaces of business functions and organizational boundaries. The journal also seeks articles extending established lines of work assuming the results of the research have the potential to substantially impact either decision making theory or industry practice. Ground-breaking research articles that enhance managerial understanding of decision making processes and stimulate further research in multi-disciplinary domains are particularly encouraged.