Deep Semantic Understanding and Sequence Relevance Learning for Question Routing in Community Question Answering

IF 2 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS
Hong Li, Jianjun Li, Guohui Li, Chunzhi Wang, Wenjun Cao, Zixuan Chen
{"title":"Deep Semantic Understanding and Sequence Relevance Learning for Question Routing in Community Question Answering","authors":"Hong Li, Jianjun Li, Guohui Li, Chunzhi Wang, Wenjun Cao, Zixuan Chen","doi":"10.5755/j01.itc.52.3.33449","DOIUrl":null,"url":null,"abstract":"Question routing (QR) aims to route newly submitted questions to the potential experts most likely to provide answers. Many previous works formalize the question routing task as a text matching and ranking problem between questions and user profiles, focusing on text representation and semantic similarity computation. However, these works often fail to extract matching features efficiently and lack deep contextual textual understanding. Moreover, we argue that in addition to the semantic similarity between terms, the interactive relationship between question sequences and user profile sequences also plays an important role in matching. In this paper, we proposed two BERT-based models called QR-BERTrep and QR-tBERTint to address these issues from different perspectives. QR-BERTrep is a representation-based feature ensemble model in which we integrated a weighted sum of BERT layer outputs as an extra feature into a Siamese deep matching network, aiming to address the non-context-aware word embedding and limited semantic understanding. QR-tBERTint is an interaction-based model that explores the interactive relationships between sequences as well as the semantic similarity of terms through a topic-enhanced BERT model. Specifically, it fuses a short-text-friendly topic model to capture corpus-level semantic information. Experimental results on real-world data demonstrate that QR-BERTrep significantly outperforms other traditional representation-based models. Meanwhile, QR-tBERTint exceeds QR-BERTrep and QR-BERTint with a maximum increase of 17.26% and 11.52% in MAP, respectively, showing that combining global topic information and exploring interactive relationships between sequences is quite effective for question routing tasks.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"2012 1","pages":"0"},"PeriodicalIF":2.0000,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Technology and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5755/j01.itc.52.3.33449","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Question routing (QR) aims to route newly submitted questions to the potential experts most likely to provide answers. Many previous works formalize the question routing task as a text matching and ranking problem between questions and user profiles, focusing on text representation and semantic similarity computation. However, these works often fail to extract matching features efficiently and lack deep contextual textual understanding. Moreover, we argue that in addition to the semantic similarity between terms, the interactive relationship between question sequences and user profile sequences also plays an important role in matching. In this paper, we proposed two BERT-based models called QR-BERTrep and QR-tBERTint to address these issues from different perspectives. QR-BERTrep is a representation-based feature ensemble model in which we integrated a weighted sum of BERT layer outputs as an extra feature into a Siamese deep matching network, aiming to address the non-context-aware word embedding and limited semantic understanding. QR-tBERTint is an interaction-based model that explores the interactive relationships between sequences as well as the semantic similarity of terms through a topic-enhanced BERT model. Specifically, it fuses a short-text-friendly topic model to capture corpus-level semantic information. Experimental results on real-world data demonstrate that QR-BERTrep significantly outperforms other traditional representation-based models. Meanwhile, QR-tBERTint exceeds QR-BERTrep and QR-BERTint with a maximum increase of 17.26% and 11.52% in MAP, respectively, showing that combining global topic information and exploring interactive relationships between sequences is quite effective for question routing tasks.
社区问答中问题路由的深度语义理解和序列相关学习
问题路由(QR)旨在将新提交的问题路由到最有可能提供答案的潜在专家。许多先前的工作将问题路由任务形式化为问题与用户配置文件之间的文本匹配和排序问题,重点关注文本表示和语义相似度计算。然而,这些作品往往不能有效地提取匹配特征,缺乏深入的上下文文本理解。此外,我们认为除了术语之间的语义相似度外,问题序列和用户档案序列之间的交互关系也在匹配中起着重要作用。在本文中,我们提出了两个基于bert的模型,即QR-BERTrep和QR-tBERTint,以从不同的角度解决这些问题。QR-BERTrep是一种基于表示的特征集成模型,我们将BERT层输出的加权和作为额外的特征集成到Siamese深度匹配网络中,旨在解决非上下文感知的词嵌入和有限的语义理解问题。QR-tBERTint是一个基于交互的模型,它通过主题增强的BERT模型来探索序列之间的交互关系以及术语的语义相似性。具体来说,它融合了一个短文本友好的主题模型来捕获语料库级的语义信息。实际数据的实验结果表明,QR-BERTrep显著优于其他传统的基于表示的模型。同时,QR-tBERTint在MAP上的最大增幅分别为17.26%和11.52%,超过了QR-BERTrep和QR-BERTint,表明结合全局主题信息和探索序列之间的交互关系对于问题路由任务是非常有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Technology and Control
Information Technology and Control 工程技术-计算机:人工智能
CiteScore
2.70
自引率
9.10%
发文量
36
审稿时长
12 months
期刊介绍: Periodical journal covers a wide field of computer science and control systems related problems including: -Software and hardware engineering; -Management systems engineering; -Information systems and databases; -Embedded systems; -Physical systems modelling and application; -Computer networks and cloud computing; -Data visualization; -Human-computer interface; -Computer graphics, visual analytics, and multimedia systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信