排序信息搜索的多重线性组合方法

Yizheng Huang, L. Zeng
{"title":"排序信息搜索的多重线性组合方法","authors":"Yizheng Huang, L. Zeng","doi":"10.1109/WI-IAT55865.2022.00119","DOIUrl":null,"url":null,"abstract":"Since the well-known BM25 [1] was proposed, BM25 and its enhanced version [2] – [4] have dominated the document/passage ranking task for a long time. However, with the advent of deep learning models like BERT [5] , these pre-trained models have achieved noticeable progress in various information retrieval (IR) tasks. But, as BM25 is a \"bag of words\" retrieval method by matching keywords, it remains a better option for passage ranking in some exceptional cases, like identifying names [6] . Therefore, fusing BM25 with deep learning models is a natural idea to benefit the ranking results. This paper discusses various linear methods of combing BM25 with BERT to see how they affect the final results of the models. We conduct experiments on the MS MARCO V2 dataset, which show convincing results.","PeriodicalId":345445,"journal":{"name":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiple Linear Combination Approaches for Information Search in Ranking\",\"authors\":\"Yizheng Huang, L. Zeng\",\"doi\":\"10.1109/WI-IAT55865.2022.00119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since the well-known BM25 [1] was proposed, BM25 and its enhanced version [2] – [4] have dominated the document/passage ranking task for a long time. However, with the advent of deep learning models like BERT [5] , these pre-trained models have achieved noticeable progress in various information retrieval (IR) tasks. But, as BM25 is a \\\"bag of words\\\" retrieval method by matching keywords, it remains a better option for passage ranking in some exceptional cases, like identifying names [6] . Therefore, fusing BM25 with deep learning models is a natural idea to benefit the ranking results. This paper discusses various linear methods of combing BM25 with BERT to see how they affect the final results of the models. We conduct experiments on the MS MARCO V2 dataset, which show convincing results.\",\"PeriodicalId\":345445,\"journal\":{\"name\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI-IAT55865.2022.00119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT55865.2022.00119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

自著名的BM25[1]被提出以来,BM25及其增强版本[2]-[4]长期主导着文档/段落排序任务。然而,随着BERT[5]等深度学习模型的出现,这些预训练模型在各种信息检索(IR)任务中取得了显著进展。但是,由于BM25是一种通过匹配关键字的“词包”检索方法,因此在某些特殊情况下,例如识别姓名,它仍然是一种更好的选择[6]。因此,将BM25与深度学习模型融合是有利于排名结果的自然想法。本文讨论了BM25与BERT结合的各种线性方法,以了解它们如何影响模型的最终结果。我们在MS MARCO V2数据集上进行了实验,得到了令人信服的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multiple Linear Combination Approaches for Information Search in Ranking
Since the well-known BM25 [1] was proposed, BM25 and its enhanced version [2] – [4] have dominated the document/passage ranking task for a long time. However, with the advent of deep learning models like BERT [5] , these pre-trained models have achieved noticeable progress in various information retrieval (IR) tasks. But, as BM25 is a "bag of words" retrieval method by matching keywords, it remains a better option for passage ranking in some exceptional cases, like identifying names [6] . Therefore, fusing BM25 with deep learning models is a natural idea to benefit the ranking results. This paper discusses various linear methods of combing BM25 with BERT to see how they affect the final results of the models. We conduct experiments on the MS MARCO V2 dataset, which show convincing results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信