Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search

Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining Pub Date : 2018-02-02 DOI:10.1145/3159652.3159659

Zhuyun Dai, Chenyan Xiong, Jamie Callan, Zhiyuan Liu

引用次数: 300

Abstract

This paper presents \textttConv-KNRM, a Convolutional Kernel-based Neural Ranking Model that models n-gram soft matches for ad-hoc search. Instead of exact matching query and document n-grams, \textttConv-KNRM uses Convolutional Neural Networks to represent n-grams of various lengths and soft matches them in a unified embedding space. The n-gram soft matches are then utilized by the kernel pooling and learning-to-rank layers to generate the final ranking score. \textttConv-KNRM can be learned end-to-end and fully optimized from user feedback. The learned model»s generalizability is investigated by testing how well it performs in a related domain with small amounts of training data. Experiments on English search logs, Chinese search logs, and TREC Web track tasks demonstrated consistent advantages of \textttConv-KNRM over prior neural IR methods and feature-based methods.

查看原文本刊更多论文

卷积神经网络在自组织搜索中的软匹配N-Grams

本文提出了\ texttconvn - knrm，这是一个基于卷积核的神经排序模型，用于模拟n-gram软匹配以进行自组织搜索。\ texttconvn - knrm不是精确匹配查询和文档n-gram，而是使用卷积神经网络来表示不同长度的n-gram，并在统一的嵌入空间中对它们进行软匹配。然后，内核池和学习排序层利用n-gram软匹配来生成最终的排序分数。\ texttconvr - knrm可以端到端学习，并从用户反馈中充分优化。通过使用少量训练数据测试其在相关领域的表现，来研究学习模型的泛化性。在英文搜索日志、中文搜索日志和TREC Web track任务上的实验表明，\ texttconvr - knrm相对于先前的神经IR方法和基于特征的方法具有一致的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量