T. Zhang, Zhe Zhang, Xiang Li, Yulin Wu, Bo Peng, Yurong Qian, Mengnan Ma, Hongyong Leng
{"title":"基于Bert和对抗网络的短文本语义匹配模型","authors":"T. Zhang, Zhe Zhang, Xiang Li, Yulin Wu, Bo Peng, Yurong Qian, Mengnan Ma, Hongyong Leng","doi":"10.1109/NaNA56854.2022.00073","DOIUrl":null,"url":null,"abstract":"Short text semantic matching plays an important role in the fields of natural language processing such as fast retrieval, intelligent question answering, and information matching. Aiming at the problem of word polysemy that is difficult to solve by conventional models, this paper proposes a short text semantic matching model BERT-GAN based on the BERT (Bidirectional Encoder Representations from Transformers) pre-training model and combined with the adversarial network in the fine-tuning stage. The basic idea is as follows: Using BERT to extract text features, and then introducing an adversarial network in the fine-tuning stage of the task to add perturbation to the embedding layer to improve the generalization ability and robustness of the model. The experimental results show that the BERT-GAN short text semantic matching model is better than the comparison model, and the F1 value is improved by 10.5%, 6.6% and 0.9% respectively compared with the comparison model.","PeriodicalId":113743,"journal":{"name":"2022 International Conference on Networking and Network Applications (NaNA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Short Text Semantic Matching Model based on Bert and Adversarial Network\",\"authors\":\"T. Zhang, Zhe Zhang, Xiang Li, Yulin Wu, Bo Peng, Yurong Qian, Mengnan Ma, Hongyong Leng\",\"doi\":\"10.1109/NaNA56854.2022.00073\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Short text semantic matching plays an important role in the fields of natural language processing such as fast retrieval, intelligent question answering, and information matching. Aiming at the problem of word polysemy that is difficult to solve by conventional models, this paper proposes a short text semantic matching model BERT-GAN based on the BERT (Bidirectional Encoder Representations from Transformers) pre-training model and combined with the adversarial network in the fine-tuning stage. The basic idea is as follows: Using BERT to extract text features, and then introducing an adversarial network in the fine-tuning stage of the task to add perturbation to the embedding layer to improve the generalization ability and robustness of the model. The experimental results show that the BERT-GAN short text semantic matching model is better than the comparison model, and the F1 value is improved by 10.5%, 6.6% and 0.9% respectively compared with the comparison model.\",\"PeriodicalId\":113743,\"journal\":{\"name\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NaNA56854.2022.00073\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Networking and Network Applications (NaNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaNA56854.2022.00073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
短文本语义匹配在快速检索、智能问答、信息匹配等自然语言处理领域发挥着重要作用。针对传统模型难以解决的词多义问题,本文在BERT (Bidirectional Encoder Representations from Transformers)预训练模型的基础上,结合微调阶段的对抗网络,提出了一种短文本语义匹配模型BERT- gan。其基本思路是:利用BERT提取文本特征,然后在任务的微调阶段引入对抗网络,在嵌入层中加入扰动,提高模型的泛化能力和鲁棒性。实验结果表明,BERT-GAN短文本语义匹配模型优于对比模型,F1值分别比对比模型提高了10.5%、6.6%和0.9%。
Short Text Semantic Matching Model based on Bert and Adversarial Network
Short text semantic matching plays an important role in the fields of natural language processing such as fast retrieval, intelligent question answering, and information matching. Aiming at the problem of word polysemy that is difficult to solve by conventional models, this paper proposes a short text semantic matching model BERT-GAN based on the BERT (Bidirectional Encoder Representations from Transformers) pre-training model and combined with the adversarial network in the fine-tuning stage. The basic idea is as follows: Using BERT to extract text features, and then introducing an adversarial network in the fine-tuning stage of the task to add perturbation to the embedding layer to improve the generalization ability and robustness of the model. The experimental results show that the BERT-GAN short text semantic matching model is better than the comparison model, and the F1 value is improved by 10.5%, 6.6% and 0.9% respectively compared with the comparison model.