{"title":"Leveraging Perturbation Consistency to Improve Multi-hop Knowledge Base Question Answering","authors":"Xin Wang, Hongbin Shi","doi":"10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00196","DOIUrl":null,"url":null,"abstract":"Multi-hop knowledge base question answering aims to answer natural language questions through multi-hop relation reasoning in the knowledge base. An important challenge of the task is the lack of labels for reasoning paths, which leads to the possibility to produce correct answers through incorrect paths in the training, and cannot generalize well in testing. Recently research has attempted to handle the challenge by devising reward shaping or introducing additional information to generate supervision signals of intermediate paths. But they required extra expert experience and label information. To address this situation, we propose a novel method under the teacher-student framework, it leverages perturbation consistency to learn intermediate paths. In the teacher network, we construct close data points for intermediate path prediction by applying random perturbations. Inspired by the data smoothing assumption that labels of close data points should be the same, a consistency loss over predictions of constructed data points and original ones is evaluated. The student network is used to answer questions more precisely by leveraging the intermediate distribution learned from the teacher network. Extensive experiments on two benchmark datasets are conducted, and the results have demonstrated the effectiveness of the proposed method.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scalable Computing-Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 1
Abstract
Multi-hop knowledge base question answering aims to answer natural language questions through multi-hop relation reasoning in the knowledge base. An important challenge of the task is the lack of labels for reasoning paths, which leads to the possibility to produce correct answers through incorrect paths in the training, and cannot generalize well in testing. Recently research has attempted to handle the challenge by devising reward shaping or introducing additional information to generate supervision signals of intermediate paths. But they required extra expert experience and label information. To address this situation, we propose a novel method under the teacher-student framework, it leverages perturbation consistency to learn intermediate paths. In the teacher network, we construct close data points for intermediate path prediction by applying random perturbations. Inspired by the data smoothing assumption that labels of close data points should be the same, a consistency loss over predictions of constructed data points and original ones is evaluated. The student network is used to answer questions more precisely by leveraging the intermediate distribution learned from the teacher network. Extensive experiments on two benchmark datasets are conducted, and the results have demonstrated the effectiveness of the proposed method.
期刊介绍:
The area of scalable computing has matured and reached a point where new issues and trends require a professional forum. SCPE will provide this avenue by publishing original refereed papers that address the present as well as the future of parallel and distributed computing. The journal will focus on algorithm development, implementation and execution on real-world parallel architectures, and application of parallel and distributed computing to the solution of real-life problems.