Semantic Augmentation Transformer Model for Ad-hoc Retrieval

2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC) Pub Date : 2022-03-04 DOI:10.1109/ITOEC53115.2022.9734522

Chongyang Li

引用次数: 0

Abstract

We propose SATM, a Semantic Augmentation Transformer Model for ad-hoc retrieval. SATM adapts contrastive learning, which improves the performance of semantic similarity and the ranking results of ad-hoc retrieval. It can also augment the semantic representation of sentence embeddings. Specifically, we first use an unsup-ervised contrastive learning augmentation module to learn query similarity so that the projection head can accurately capture query semantics. Then we use the trained encoder network to map queries and perform semantic similarity calculations and rankings with document embeddings. We use BERT to extract the contextual representation of the sentence, and use the augmentation module to enhance the semantics and eliminate the anisotropy of sentence embedding. Experimental results show that in the TrecQA data set, SATM has 4% and 1.4% improvements in MRR and MAP over Bert-base.

查看原文本刊更多论文

面向自组织检索的语义增强转换模型

我们提出了一种用于自组织检索的语义增强转换模型SATM。SATM采用了对比学习，提高了语义相似度的性能和自组织检索的排序结果。它还可以增强句子嵌入的语义表示。具体来说，我们首先使用无监督的对比学习增强模块来学习查询相似度，以便投影头能够准确地捕获查询语义。然后我们使用训练好的编码器网络来映射查询，并使用文档嵌入执行语义相似度计算和排名。我们使用BERT提取句子的上下文表示，并使用增强模块增强语义，消除句子嵌入的各向异性。实验结果表明，在treqa数据集中，SATM在MRR和MAP方面比Bert-base分别提高了4%和1.4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC)

自引率

0.00%

发文量