ViMRC - VLSP 2021: Improving Retrospective Reader for Vietnamese Machine Reading Comprehension

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-12-16 DOI:10.25073/2588-1086/vnucsce.346

Quan Quoc Chu, Vi Van Ngo, N. H. Le, Duc Sy Nguyen

{"title":"ViMRC - VLSP 2021: Improving Retrospective Reader for Vietnamese Machine Reading Comprehension","authors":"Quan Quoc Chu, Vi Van Ngo, N. H. Le, Duc Sy Nguyen","doi":"10.25073/2588-1086/vnucsce.346","DOIUrl":null,"url":null,"abstract":"In recent years, there are multiple systems (eg. search engines and dialogue systems) that require machines to be able to read and understand human text to serve several tasks in application. Machine Reading Comprehension (MRC) has posed a challenge to the Natural Language Processing (NLP) community in teaching machines to understand the meaning of human text in order to answer questions provided. Specifically in this challenge, the dataset contains questions that can be unanswerable, otherwise the answers can be extracted from the given passages. To deal with this challenge, our works mainly based on a recent approach, known as Retrospective Reader, to confronting unanswerable questions. Additionally, we focuses on enhancing the ability of answer extraction by applying properly attention mechanism and improving the representation ability through semantic information. Besides, we also present an ensemble way to acquire significant improvement in results provided by single models. Our method achieves 1$^{st}$ place on Vietnamese MRC shared task at the $8^{th}$ International Workshop on Vietnamese Language and Speech Processing (VLSP) with F1-score of \\textbf{0.77241} and exact match (EM) of \\textbf{0.66137} on the private test phase. For research purpose, our source code is available at \\url{https://github.com/NamCyan/MRC\\_VLSP2021}","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"VNU Journal of Science: Computer Science and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25073/2588-1086/vnucsce.346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, there are multiple systems (eg. search engines and dialogue systems) that require machines to be able to read and understand human text to serve several tasks in application. Machine Reading Comprehension (MRC) has posed a challenge to the Natural Language Processing (NLP) community in teaching machines to understand the meaning of human text in order to answer questions provided. Specifically in this challenge, the dataset contains questions that can be unanswerable, otherwise the answers can be extracted from the given passages. To deal with this challenge, our works mainly based on a recent approach, known as Retrospective Reader, to confronting unanswerable questions. Additionally, we focuses on enhancing the ability of answer extraction by applying properly attention mechanism and improving the representation ability through semantic information. Besides, we also present an ensemble way to acquire significant improvement in results provided by single models. Our method achieves 1$^{st}$ place on Vietnamese MRC shared task at the $8^{th}$ International Workshop on Vietnamese Language and Speech Processing (VLSP) with F1-score of \textbf{0.77241} and exact match (EM) of \textbf{0.66137} on the private test phase. For research purpose, our source code is available at \url{https://github.com/NamCyan/MRC\_VLSP2021}

查看原文本刊更多论文

ViMRC - VLSP 2021:改进越南语机器阅读理解的回顾性阅读器

近年来，有多种系统(例如。搜索引擎和对话系统)要求机器能够阅读和理解人类文本，以服务于应用程序中的若干任务。机器阅读理解(MRC)对自然语言处理(NLP)领域提出了挑战，即教机器理解人类文本的含义以回答所提供的问题。具体来说，在这个挑战中，数据集包含无法回答的问题，否则可以从给定的段落中提取答案。为了应对这一挑战，我们的作品主要基于最近的一种方法，即回顾性读者，来面对无法回答的问题。此外，我们着重于通过适当的注意机制来提高答案抽取能力，并通过语义信息来提高答案的表示能力。此外，我们还提出了一种集成方法，以获得单一模型提供的结果的显着改进。我们的方法在$8^{th}$越南语语言和语音处理(VLSP)国际研讨会上的越南语MRC共享任务中获得了1 $^{st}$的成绩，f1得分为\textbf{0.77241}，在私人测试阶段的精确匹配(EM)为\textbf{0.66137}。出于研究目的，我们的源代码可在 \url{https://github.com/NamCyan/MRC\_VLSP2021}

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

VNU Journal of Science: Computer Science and Communication Engineering

自引率

0.00%

发文量