Improved Combination of Multiple Retrieval Systems Using a Dynamic Combinatorial Fusion Algorithm

Hongzhi Liu, Zhonghai Wu, D. Hsu, B. Kristal
{"title":"Improved Combination of Multiple Retrieval Systems Using a Dynamic Combinatorial Fusion Algorithm","authors":"Hongzhi Liu, Zhonghai Wu, D. Hsu, B. Kristal","doi":"10.1109/WI.2016.0102","DOIUrl":null,"url":null,"abstract":"A combination of multiple retrieval systems can outperform its individual component systems, but it remains a challenging problem to predict whether two systems can be beneficially combined and, if so, the optimal means by which they should be merged. The performance of combined systems is affected by many factors, including the performance of individual systems, the diversity between a pair of systems, and the method for combination. In this paper, we undertake the study of these issues using combinatorial fusion algorithm (CFA) utilizing the rank-score characteristic (RSC) function and the notion of a weighted cognitive diversity. Using the selected eight TREC datasets, we demonstrated that: (a) the combination of two retrieval systems performs better than each individual system only when the individual systems have relatively good performance and they are diverse, (b) a dynamic combination method, using rank vs. score combination based on cognitive diversity which does not display a tight correlation with other statistical diversity measures, can improve the performance of the combined system, even when performance of each individual system is not known or in the context of an unsupervised learning environment. Within the TREC datasets, the proposed dynamic approach offers a potential for substantial improvement with no significant risk. Our results provide a new paradigm of dynamic fusion to the study of the combination of multiple retrieval systems.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"1 1","pages":"592-596"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2016.0102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A combination of multiple retrieval systems can outperform its individual component systems, but it remains a challenging problem to predict whether two systems can be beneficially combined and, if so, the optimal means by which they should be merged. The performance of combined systems is affected by many factors, including the performance of individual systems, the diversity between a pair of systems, and the method for combination. In this paper, we undertake the study of these issues using combinatorial fusion algorithm (CFA) utilizing the rank-score characteristic (RSC) function and the notion of a weighted cognitive diversity. Using the selected eight TREC datasets, we demonstrated that: (a) the combination of two retrieval systems performs better than each individual system only when the individual systems have relatively good performance and they are diverse, (b) a dynamic combination method, using rank vs. score combination based on cognitive diversity which does not display a tight correlation with other statistical diversity measures, can improve the performance of the combined system, even when performance of each individual system is not known or in the context of an unsupervised learning environment. Within the TREC datasets, the proposed dynamic approach offers a potential for substantial improvement with no significant risk. Our results provide a new paradigm of dynamic fusion to the study of the combination of multiple retrieval systems.
基于动态组合融合算法的多检索系统改进组合
多个检索系统的组合可以优于其单独的组件系统,但预测两个系统是否可以有益地组合以及如果可以,则合并它们的最佳方法仍然是一个具有挑战性的问题。组合系统的性能受多个因素的影响,包括单个系统的性能、一对系统之间的差异性以及组合方式等。在本文中,我们使用组合融合算法(CFA)利用排名得分特征(RSC)函数和加权认知多样性的概念进行这些问题的研究。使用选定的8个TREC数据集,我们证明了:(a)只有当单个检索系统具有较好的检索性能和多样性时,两个检索系统的组合才比单个检索系统的组合性能更好;(b)采用基于认知多样性的等级与分数组合的动态组合方法可以提高组合系统的性能,该方法与其他统计多样性指标的相关性不强;即使在每个单独系统的性能未知或处于无监督学习环境的情况下。在TREC数据集中,提出的动态方法提供了实质性改进的潜力,而没有重大风险。我们的研究结果为多检索系统的组合研究提供了一种新的动态融合范式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信