{"title":"A Retriever-Reasoner Method for Multi-Document Reading Comprehension","authors":"Yu Fan, Qingwen Liu","doi":"10.1109/ICCWAMTIP56608.2022.10016488","DOIUrl":null,"url":null,"abstract":"Machine reading comprehension (MRC) is one of the main research of question answering system tasks. In real-world scenarios, due to the large scale of document data, and many questions that need to be answered urgently, traditional extractive MRC method faces low efficiency, low accuracy, and has poor reasoning ability. In order to efficiently and accurately extract knowledge from massive unstructured text data, we propose a multi-document MRC model based on a two-stage approach. Through document retrieval and answer reasoning, the model can hierarchically match the relevant passages from coarse-to-fine. Experimental results show that our method outperforms baseline and improves Rouge-L and BLEU-4 by 2.5% and 1.9%, respectively. In addition, the reasoning process is interpretable and can provide support for information understanding.","PeriodicalId":159508,"journal":{"name":"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","volume":"221 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWAMTIP56608.2022.10016488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine reading comprehension (MRC) is one of the main research of question answering system tasks. In real-world scenarios, due to the large scale of document data, and many questions that need to be answered urgently, traditional extractive MRC method faces low efficiency, low accuracy, and has poor reasoning ability. In order to efficiently and accurately extract knowledge from massive unstructured text data, we propose a multi-document MRC model based on a two-stage approach. Through document retrieval and answer reasoning, the model can hierarchically match the relevant passages from coarse-to-fine. Experimental results show that our method outperforms baseline and improves Rouge-L and BLEU-4 by 2.5% and 1.9%, respectively. In addition, the reasoning process is interpretable and can provide support for information understanding.