Cloze Evaluation for Deeper Understanding of Commonsense Stories in Indonesian

Proceedings of the First Workshop on Commonsense Representation and Reasoning (CSRR 2022) Pub Date : 1900-01-01 DOI:10.18653/v1/2022.csrr-1.2

Fajri Koto, Timothy Baldwin, Jey Han Lau

引用次数: 3

Abstract

Story comprehension that involves complex causal and temporal relations is a critical task in NLP, but previous studies have focused predominantly on English, leaving open the question of how the findings generalize to other languages, such as Indonesian. In this paper, we follow the Story Cloze Test framework of Mostafazadeh et al. (2016) in evaluating story understanding in Indonesian, by constructing a four-sentence story with one correct ending and one incorrect ending. To investigate commonsense knowledge acquisition in language models, we experimented with: (1) a classification task to predict the correct ending; and (2) a generation task to complete the story with a single sentence. We investigate these tasks in two settings: (i) monolingual training and ii) zero-shot cross-lingual transfer between Indonesian and English.

查看原文本刊更多论文

加深对印尼语常识性故事理解的完形填空评价

故事理解涉及复杂的因果关系和时间关系是NLP的一项关键任务，但之前的研究主要集中在英语上，留下了如何将研究结果推广到其他语言(如印尼语)的问题。在本文中，我们遵循Mostafazadeh等人(2016)的故事完形测试框架来评估印尼语的故事理解，通过构建一个四句话的故事，一个正确的结尾和一个错误的结尾。为了研究语言模型中的常识性知识获取，我们进行了以下实验:(1)分类任务来预测正确的结尾;(2)生成任务，用一句话完成故事。我们在两种情况下研究这些任务:(i)单语培训和(ii)印尼语和英语之间的零机会跨语言迁移。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the First Workshop on Commonsense Representation and Reasoning (CSRR 2022)

自引率

0.00%

发文量