{"title":"Label-Enhanced Reading Comprehension Model","authors":"苏立新, 郭嘉丰, 范意兴, 兰艳艳, 程学旗","doi":"10.16451/J.CNKI.ISSN1003-6059.202002002","DOIUrl":null,"url":null,"abstract":"In the existing extractive reading comprehension models,only the boundary of answers is utilized as the supervision signal and the labeling processed by human is ignored.Consequently,learned models are prone to learn the superficial features and the generalization performance is degraded.In this paper,a label-enhanced reading comprehension model is proposed to imitate human activity.The answer-bearing sentence,the content and the boundary of the answer are learned simultaneously.The answer-bearing sentence and the content of the answer can be derived from the boundary of the answer and these three types of labels are regarded as supervision signals.The model is trained by multitask learning.During prediction,the probabilities from three predictions are merged to determine the answer,and thus the generalization performance is improved.Experiments on SQuAD dataset demonstrate the effectiveness of LE-Reader model.","PeriodicalId":34917,"journal":{"name":"模式识别与人工智能","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"模式识别与人工智能","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.16451/J.CNKI.ISSN1003-6059.202002002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
In the existing extractive reading comprehension models,only the boundary of answers is utilized as the supervision signal and the labeling processed by human is ignored.Consequently,learned models are prone to learn the superficial features and the generalization performance is degraded.In this paper,a label-enhanced reading comprehension model is proposed to imitate human activity.The answer-bearing sentence,the content and the boundary of the answer are learned simultaneously.The answer-bearing sentence and the content of the answer can be derived from the boundary of the answer and these three types of labels are regarded as supervision signals.The model is trained by multitask learning.During prediction,the probabilities from three predictions are merged to determine the answer,and thus the generalization performance is improved.Experiments on SQuAD dataset demonstrate the effectiveness of LE-Reader model.