Sentence Semantic Matching with Hierarchical CNN Based on Dimension-augmented Representation

Rui Yu, Wenpeng Lu, Yifeng Li, Jiguo Yu, Guoqiang Zhang, Xu Zhang
{"title":"Sentence Semantic Matching with Hierarchical CNN Based on Dimension-augmented Representation","authors":"Rui Yu, Wenpeng Lu, Yifeng Li, Jiguo Yu, Guoqiang Zhang, Xu Zhang","doi":"10.1109/IJCNN52387.2021.9533498","DOIUrl":null,"url":null,"abstract":"As a fundamental task in natural language processing, sentence semantic matching (SSM) is critical yet challenging due to difficulties in learning expressive sentence representation while capturing complex interactions between sentences. Recent work has shown the great potential of deep neural models in improving the performance of SSM task. However, existing work usually employs recurrent neural networks (RNNs) or 1D (one-dimensional) convolutional neural networks (CNNs) to learn sentence representation, leading to limited performance improvement. Benefiting from the multi-dimensional structure, 2D convolutional neural networks are expected to be more powerful to learn expressive sentence representation by capturing the implicit inter-sentence interactions and thus can further improve the performance of SSM. To this end, in this paper, we propose a novel sentence semantic matching model named Hierarchical CNN based on Dimension-augmented Representation (HiDR). In HiDR, first, bidirectional long short-term memory networks (LSTMs) are utilized to generate dimension-augmented representation for each of the input sentences; then, a hierarchical 2D CNN is devised to learn sentence representation while capturing the inter-sentence interactions, followed by a prediction layer based on sigmoid function to output the matching degree between sentences. To evaluate the performance of our proposed model, we conducted extensive experiments on two public real-world data sets. The empirical results show that HiDR has achieved remarkable results, which demonstrates either better or comparable performance w.r.t. BERT-based models.","PeriodicalId":396583,"journal":{"name":"2021 International Joint Conference on Neural Networks (IJCNN)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN52387.2021.9533498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

As a fundamental task in natural language processing, sentence semantic matching (SSM) is critical yet challenging due to difficulties in learning expressive sentence representation while capturing complex interactions between sentences. Recent work has shown the great potential of deep neural models in improving the performance of SSM task. However, existing work usually employs recurrent neural networks (RNNs) or 1D (one-dimensional) convolutional neural networks (CNNs) to learn sentence representation, leading to limited performance improvement. Benefiting from the multi-dimensional structure, 2D convolutional neural networks are expected to be more powerful to learn expressive sentence representation by capturing the implicit inter-sentence interactions and thus can further improve the performance of SSM. To this end, in this paper, we propose a novel sentence semantic matching model named Hierarchical CNN based on Dimension-augmented Representation (HiDR). In HiDR, first, bidirectional long short-term memory networks (LSTMs) are utilized to generate dimension-augmented representation for each of the input sentences; then, a hierarchical 2D CNN is devised to learn sentence representation while capturing the inter-sentence interactions, followed by a prediction layer based on sigmoid function to output the matching degree between sentences. To evaluate the performance of our proposed model, we conducted extensive experiments on two public real-world data sets. The empirical results show that HiDR has achieved remarkable results, which demonstrates either better or comparable performance w.r.t. BERT-based models.
基于维增表示的分层CNN句子语义匹配
作为自然语言处理的一项基本任务,句子语义匹配(SSM)是一项关键而又具有挑战性的任务,因为在捕捉句子之间复杂的相互作用的同时,很难学习到表达性的句子表示。近年来的研究表明,深度神经模型在提高SSM任务性能方面具有巨大的潜力。然而,现有的工作通常使用递归神经网络(rnn)或一维卷积神经网络(cnn)来学习句子表示,导致性能提高有限。得益于二维卷积神经网络的多维结构,二维卷积神经网络有望更强大地通过捕获隐含的句子间相互作用来学习表达性句子表示,从而进一步提高SSM的性能。为此,本文提出了一种新的基于维度增强表示(HiDR)的句子语义匹配模型——分层CNN。在HiDR中,首先,利用双向长短期记忆网络(LSTMs)为每个输入句子生成维度增强表示;然后,设计了一种分层二维CNN来学习句子表示,同时捕捉句子间的相互作用,然后基于sigmoid函数的预测层输出句子之间的匹配程度。为了评估我们提出的模型的性能,我们在两个公开的真实世界数据集上进行了广泛的实验。实证结果表明,HiDR取得了显著的效果,这表明基于bert的模型具有更好或相当的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信