具有丰富特征的轻量级文本匹配方法

Changhua Ji, Zhang Tao, Jiayi Mao, Li Zhang
{"title":"具有丰富特征的轻量级文本匹配方法","authors":"Changhua Ji, Zhang Tao, Jiayi Mao, Li Zhang","doi":"10.1145/3532213.3532262","DOIUrl":null,"url":null,"abstract":"Text matching is one of the research hotspots in Natural Language Processing (NLP). The study of text matching is of great practical importance for applications such as text de-duplication, web retrieval, and question answering systems. A lightweight text matching method with rich features is proposed for the problem of large number of model parameters and low efficiency of text matching tasks in natural language processing. The whole model architecture is based on Siamese neural networks with shared parameters. Furthermore, the method utilizes an improved residual Network and attention mechanism for the extraction and alignment of vector representations. Only three key features for alignment operations are retained. In addition, an averaging operation is added to the fusion layer to provide vector representations with rich information for the prediction layer. Experimental results on the paraphrase identification dataset and two natural language inference datasets show that the proposed approach not only effectively reduces the number of parameters compared with existing models but also ensures good text matching performance. Experiments demonstrate that this method can be used in general text matching tasks.","PeriodicalId":333199,"journal":{"name":"Proceedings of the 8th International Conference on Computing and Artificial Intelligence","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight Text Matching Method with Rich Features\",\"authors\":\"Changhua Ji, Zhang Tao, Jiayi Mao, Li Zhang\",\"doi\":\"10.1145/3532213.3532262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text matching is one of the research hotspots in Natural Language Processing (NLP). The study of text matching is of great practical importance for applications such as text de-duplication, web retrieval, and question answering systems. A lightweight text matching method with rich features is proposed for the problem of large number of model parameters and low efficiency of text matching tasks in natural language processing. The whole model architecture is based on Siamese neural networks with shared parameters. Furthermore, the method utilizes an improved residual Network and attention mechanism for the extraction and alignment of vector representations. Only three key features for alignment operations are retained. In addition, an averaging operation is added to the fusion layer to provide vector representations with rich information for the prediction layer. Experimental results on the paraphrase identification dataset and two natural language inference datasets show that the proposed approach not only effectively reduces the number of parameters compared with existing models but also ensures good text matching performance. Experiments demonstrate that this method can be used in general text matching tasks.\",\"PeriodicalId\":333199,\"journal\":{\"name\":\"Proceedings of the 8th International Conference on Computing and Artificial Intelligence\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 8th International Conference on Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3532213.3532262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th International Conference on Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3532213.3532262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

文本匹配是自然语言处理(NLP)中的研究热点之一。文本匹配的研究对于文本去重复、网络检索和问答系统等应用具有重要的实际意义。针对自然语言处理中模型参数多、文本匹配任务效率低的问题,提出了一种具有丰富特征的轻量级文本匹配方法。整个模型架构基于具有共享参数的Siamese神经网络。此外,该方法利用改进的残差网络和注意机制来提取和对齐向量表示。只保留了对齐操作的三个关键特性。此外,在融合层中加入平均运算,为预测层提供具有丰富信息的向量表示。在释义识别数据集和两个自然语言推理数据集上的实验结果表明,与现有模型相比,该方法不仅有效地减少了参数数量,而且保证了良好的文本匹配性能。实验表明,该方法可用于一般的文本匹配任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Lightweight Text Matching Method with Rich Features
Text matching is one of the research hotspots in Natural Language Processing (NLP). The study of text matching is of great practical importance for applications such as text de-duplication, web retrieval, and question answering systems. A lightweight text matching method with rich features is proposed for the problem of large number of model parameters and low efficiency of text matching tasks in natural language processing. The whole model architecture is based on Siamese neural networks with shared parameters. Furthermore, the method utilizes an improved residual Network and attention mechanism for the extraction and alignment of vector representations. Only three key features for alignment operations are retained. In addition, an averaging operation is added to the fusion layer to provide vector representations with rich information for the prediction layer. Experimental results on the paraphrase identification dataset and two natural language inference datasets show that the proposed approach not only effectively reduces the number of parameters compared with existing models but also ensures good text matching performance. Experiments demonstrate that this method can be used in general text matching tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信