{"title":"估计检索系统排名的可靠性","authors":"Sri Devi Ravana, Zhang Shuxiang","doi":"10.1109/ICSN.2016.7501924","DOIUrl":null,"url":null,"abstract":"Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.","PeriodicalId":282295,"journal":{"name":"2016 International Conference on Software Networking (ICSN)","volume":"160 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Estimating the Reliability of the Retrieval Systems Rankings\",\"authors\":\"Sri Devi Ravana, Zhang Shuxiang\",\"doi\":\"10.1109/ICSN.2016.7501924\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.\",\"PeriodicalId\":282295,\"journal\":{\"name\":\"2016 International Conference on Software Networking (ICSN)\",\"volume\":\"160 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Software Networking (ICSN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSN.2016.7501924\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Software Networking (ICSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSN.2016.7501924","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Estimating the Reliability of the Retrieval Systems Rankings
Information retrieval evaluation based on the pooling method inherently poses biasness towards systems that contributed to the pool of judged documents. This may distort the results about the relative effectiveness of different retrieval strategies, or rather, the retrieval systems and thus result in unreliable system rankings. The purpose of this study is to suggest a technique to estimate the reliability of the retrieval system effectiveness rank in a list of ranked systems based on its performance in previous experiments. This can be also defined as the strength of rank for the individual retrieval system. By doing so, we will be able to predict the performance of each system in future experiments. To validate the proposed rank strength estimation method, an alternative systems ranking method is proposed to generate a new list of systems rankings which is used together with the proposed rank strength estimation method. The experimentation shows that the correlation coefficients remain above 0.8 across different number of experiments which means the new systems ranking is highly correlated with the gold standard. It suggests that the rank reliability estimation methods have effectively predicted the strength of the system ranks. And also, the results from both TREC 2004 and TREC 8 show the similar outcome which further confirms the effectiveness of the proposed rank reliability estimation method.