On the Effectiveness of Query Weighting for Adapting Rank Learners to New Unlabelled Collections

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management Pub Date : 2016-10-24 DOI:10.1145/2983323.2983852

Pengfei Li, M. Sanderson, Mark James Carman, Falk Scholer

{"title":"On the Effectiveness of Query Weighting for Adapting Rank Learners to New Unlabelled Collections","authors":"Pengfei Li, M. Sanderson, Mark James Carman, Falk Scholer","doi":"10.1145/2983323.2983852","DOIUrl":null,"url":null,"abstract":"Query-level instance weighting is a technique for unsupervised transfer ranking, which aims to train a ranker on a source collection so that it also performs effectively on a target collection, even if no judgement information exists for the latter. Past work has shown that this approach can be used to significantly improve effectiveness; in this work, the approach is re-examined on a wide set of publicly available L2R test collections with more advanced learning to rank algorithms. Different query-level weighting strategies are examined against two transfer ranking frameworks: AdaRank and a new weighted LambdaMART algorithm. Our experimental results show that the effectiveness of different weighting strategies, including those shown in past work, vary under different transferring environments. In particular, (i) Kullback-Leibler based density-ratio estimation tends to outperform a classification-based approach and (ii) aggregating document-level weights into query-level weights is likely superior to direct estimation using a query-level representation. The Nemenyi statistical test, applied across multiple datasets, indicates that most weighting transfer learning methods do not significantly outperform baselines, although there is potential for the further development of such techniques.","PeriodicalId":250808,"journal":{"name":"Proceedings of the 25th ACM International on Conference on Information and Knowledge Management","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 25th ACM International on Conference on Information and Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2983323.2983852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Query-level instance weighting is a technique for unsupervised transfer ranking, which aims to train a ranker on a source collection so that it also performs effectively on a target collection, even if no judgement information exists for the latter. Past work has shown that this approach can be used to significantly improve effectiveness; in this work, the approach is re-examined on a wide set of publicly available L2R test collections with more advanced learning to rank algorithms. Different query-level weighting strategies are examined against two transfer ranking frameworks: AdaRank and a new weighted LambdaMART algorithm. Our experimental results show that the effectiveness of different weighting strategies, including those shown in past work, vary under different transferring environments. In particular, (i) Kullback-Leibler based density-ratio estimation tends to outperform a classification-based approach and (ii) aggregating document-level weights into query-level weights is likely superior to direct estimation using a query-level representation. The Nemenyi statistical test, applied across multiple datasets, indicates that most weighting transfer learning methods do not significantly outperform baselines, although there is potential for the further development of such techniques.

查看原文本刊更多论文

基于查询加权的秩学习器适应新未标记集合的有效性研究

查询级实例加权是一种用于无监督转移排序的技术，其目的是训练源集合上的排序器，以便它在目标集合上也能有效地执行，即使后者不存在判断信息。过去的工作表明，这种方法可以显著提高有效性;在这项工作中，该方法在广泛的公开可用的L2R测试集合上进行了重新检查，并使用了更高级的学习排序算法。针对AdaRank和一种新的LambdaMART加权算法这两种传输排序框架，研究了不同的查询级加权策略。我们的实验结果表明，在不同的迁移环境下，不同的权重策略(包括过去的研究)的有效性是不同的。特别是，(i)基于Kullback-Leibler的密度比估计往往优于基于分类的方法，(ii)将文档级权重聚合到查询级权重可能优于使用查询级表示的直接估计。应用于多个数据集的Nemenyi统计检验表明，大多数加权迁移学习方法并没有显著优于基线，尽管此类技术有进一步发展的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

自引率

0.00%

发文量