估计检索系统排名的可靠性

2016 International Conference on Software Networking (ICSN) Pub Date : 2016-05-23 DOI:10.1109/ICSN.2016.7501924

Sri Devi Ravana, Zhang Shuxiang

{"title":"估计检索系统排名的可靠性","authors":"Sri Devi Ravana, Zhang Shuxiang","doi":"10.1109/ICSN.2016.7501924","DOIUrl":null,"url":null,"abstract":"Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.","PeriodicalId":282295,"journal":{"name":"2016 International Conference on Software Networking (ICSN)","volume":"160 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Estimating the Reliability of the Retrieval Systems Rankings\",\"authors\":\"Sri Devi Ravana, Zhang Shuxiang\",\"doi\":\"10.1109/ICSN.2016.7501924\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.\",\"PeriodicalId\":282295,\"journal\":{\"name\":\"2016 International Conference on Software Networking (ICSN)\",\"volume\":\"160 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Software Networking (ICSN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSN.2016.7501924\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Software Networking (ICSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSN.2016.7501924","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

基于池化方法的信息检索评估固有地对那些对被判断文档池有贡献的系统造成了偏见。这可能会扭曲关于不同检索策略的相对有效性的结果，或者更确切地说，是检索系统，从而导致不可靠的系统排名。本研究的目的是提出一种基于检索系统在先前实验中的表现来估计排序系统列表中检索系统有效性排名的可靠性的技术。这也可以定义为单个检索系统的等级强度。通过这样做，我们将能够预测未来实验中每个系统的性能。为了验证所提出的排名强度估计方法，提出了一种替代的系统排名方法来生成新的系统排名列表，该列表与所提出的排名强度估计方法一起使用。实验表明，在不同次数的实验中，相关系数保持在0.8以上，这意味着新系统排名与金标准高度相关。结果表明，秩信度估计方法能够有效地预测系统秩的强度。此外，TREC 2004和TREC 8的结果也显示出相似的结果，进一步证实了所提出的秩信度估计方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Estimating the Reliability of the Retrieval Systems Rankings

Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Software Networking (ICSN)

自引率

0.00%

发文量