ViMRC - VLSP 2021: Improving Retrospective Reader for Vietnamese Machine Reading Comprehension

Quan Quoc Chu, Vi Van Ngo, N. H. Le, Duc Sy Nguyen
{"title":"ViMRC - VLSP 2021: Improving Retrospective Reader for Vietnamese Machine Reading Comprehension","authors":"Quan Quoc Chu, Vi Van Ngo, N. H. Le, Duc Sy Nguyen","doi":"10.25073/2588-1086/vnucsce.346","DOIUrl":null,"url":null,"abstract":"In recent years, there are multiple systems (eg. search engines and dialogue systems) that require machines to be able to read and understand human text to serve several tasks in application. Machine Reading Comprehension (MRC) has posed a challenge to the Natural Language Processing (NLP) community in teaching machines to understand the meaning of human text in order to answer questions provided. Specifically in this challenge, the dataset contains questions that can be unanswerable, otherwise the answers can be extracted from the given passages. To deal with this challenge, our works mainly based on a recent approach, known as Retrospective Reader, to confronting unanswerable questions. Additionally, we focuses on enhancing the ability of answer extraction by applying properly attention mechanism and improving the representation ability through semantic information. Besides, we also present an ensemble way to acquire significant improvement in results provided by single models. Our method achieves 1$^{st}$ place on Vietnamese MRC shared task at the $8^{th}$ International Workshop on Vietnamese Language and Speech Processing (VLSP) with F1-score of \\textbf{0.77241} and exact match (EM) of \\textbf{0.66137} on the private test phase. For research purpose, our source code is available at \\url{https://github.com/NamCyan/MRC\\_VLSP2021}","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"VNU Journal of Science: Computer Science and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25073/2588-1086/vnucsce.346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, there are multiple systems (eg. search engines and dialogue systems) that require machines to be able to read and understand human text to serve several tasks in application. Machine Reading Comprehension (MRC) has posed a challenge to the Natural Language Processing (NLP) community in teaching machines to understand the meaning of human text in order to answer questions provided. Specifically in this challenge, the dataset contains questions that can be unanswerable, otherwise the answers can be extracted from the given passages. To deal with this challenge, our works mainly based on a recent approach, known as Retrospective Reader, to confronting unanswerable questions. Additionally, we focuses on enhancing the ability of answer extraction by applying properly attention mechanism and improving the representation ability through semantic information. Besides, we also present an ensemble way to acquire significant improvement in results provided by single models. Our method achieves 1$^{st}$ place on Vietnamese MRC shared task at the $8^{th}$ International Workshop on Vietnamese Language and Speech Processing (VLSP) with F1-score of \textbf{0.77241} and exact match (EM) of \textbf{0.66137} on the private test phase. For research purpose, our source code is available at \url{https://github.com/NamCyan/MRC\_VLSP2021}
ViMRC - VLSP 2021:改进越南语机器阅读理解的回顾性阅读器
近年来,有多种系统(例如。搜索引擎和对话系统)要求机器能够阅读和理解人类文本,以服务于应用程序中的若干任务。机器阅读理解(MRC)对自然语言处理(NLP)领域提出了挑战,即教机器理解人类文本的含义以回答所提供的问题。具体来说,在这个挑战中,数据集包含无法回答的问题,否则可以从给定的段落中提取答案。为了应对这一挑战,我们的作品主要基于最近的一种方法,即回顾性读者,来面对无法回答的问题。此外,我们着重于通过适当的注意机制来提高答案抽取能力,并通过语义信息来提高答案的表示能力。此外,我们还提出了一种集成方法,以获得单一模型提供的结果的显着改进。我们的方法在$8^{th}$越南语语言和语音处理(VLSP)国际研讨会上的越南语MRC共享任务中获得了1 $^{st}$的成绩,f1得分为\textbf{0.77241},在私人测试阶段的精确匹配(EM)为\textbf{0.66137}。出于研究目的,我们的源代码可在 \url{https://github.com/NamCyan/MRC\_VLSP2021}
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信