Improving a Gold Standard: Treating Human Relevance Judgments of MEDLINE Document Pairs.

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications Pub Date : 2011-02-04 DOI:10.1109/ICMLA.2010.79

Won Kim, W John Wilbur

{"title":"Improving a Gold Standard: Treating Human Relevance Judgments of MEDLINE Document Pairs.","authors":"Won Kim, W John Wilbur","doi":"10.1109/ICMLA.2010.79","DOIUrl":null,"url":null,"abstract":"<p><p>Given prior human judgments of the condition of an object it is possible to use these judgments to make a maximal likelihood estimate of what future human judgments of the condition of that object will be. However, if one has a reasonably large collection of similar objects and the prior human judgments of a number of judges regarding the condition of each object in the collection, then it is possible to make predictions of future human judgments for the whole collection that are superior to the simple maximal likelihood estimate for each object in isolation. This is possible because the multiple judgments over the collection allow an analysis to determine the relative value of a judge as compared with the other judges in the group and this value can be used to augment or diminish a particular judge's influence in predicting future judgments. Here we study and compare five different methods for making such improved predictions and show that each is superior to simple maximal likelihood estimates.</p>","PeriodicalId":74528,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","volume":"2010 ","pages":"491-498"},"PeriodicalIF":0.0000,"publicationDate":"2011-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3086285/pdf/nihms238822.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.79","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Given prior human judgments of the condition of an object it is possible to use these judgments to make a maximal likelihood estimate of what future human judgments of the condition of that object will be. However, if one has a reasonably large collection of similar objects and the prior human judgments of a number of judges regarding the condition of each object in the collection, then it is possible to make predictions of future human judgments for the whole collection that are superior to the simple maximal likelihood estimate for each object in isolation. This is possible because the multiple judgments over the collection allow an analysis to determine the relative value of a judge as compared with the other judges in the group and this value can be used to augment or diminish a particular judge's influence in predicting future judgments. Here we study and compare five different methods for making such improved predictions and show that each is superior to simple maximal likelihood estimates.

查看原文本刊更多论文

改进金标准：处理MEDLINE文档对的人类相关性判断。

给定人类先前对一个物体的状态的判断，就有可能利用这些判断来对未来人类对该物体的状态的判断做出最大似然估计。然而，如果一个人有一个相当大的类似物体的集合，以及许多法官对集合中每个物体的状况的先前人类判断，那么就有可能对整个集合的未来人类判断做出预测，这种预测优于对孤立的每个物体的简单最大似然估计。这是可能的，因为收集的多个判决允许分析确定法官与组中其他法官相比的相对价值，这个价值可以用来增加或减少特定法官在预测未来判决方面的影响。在这里，我们研究和比较了五种不同的方法来做出这种改进的预测，并表明每一种方法都优于简单的最大似然估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

自引率

0.00%

发文量