Cloze Evaluation for Deeper Understanding of Commonsense Stories in Indonesian

Fajri Koto, Timothy Baldwin, Jey Han Lau
{"title":"Cloze Evaluation for Deeper Understanding of Commonsense Stories in Indonesian","authors":"Fajri Koto, Timothy Baldwin, Jey Han Lau","doi":"10.18653/v1/2022.csrr-1.2","DOIUrl":null,"url":null,"abstract":"Story comprehension that involves complex causal and temporal relations is a critical task in NLP, but previous studies have focused predominantly on English, leaving open the question of how the findings generalize to other languages, such as Indonesian. In this paper, we follow the Story Cloze Test framework of Mostafazadeh et al. (2016) in evaluating story understanding in Indonesian, by constructing a four-sentence story with one correct ending and one incorrect ending. To investigate commonsense knowledge acquisition in language models, we experimented with: (1) a classification task to predict the correct ending; and (2) a generation task to complete the story with a single sentence. We investigate these tasks in two settings: (i) monolingual training and ii) zero-shot cross-lingual transfer between Indonesian and English.","PeriodicalId":166496,"journal":{"name":"Proceedings of the First Workshop on Commonsense Representation and Reasoning (CSRR 2022)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First Workshop on Commonsense Representation and Reasoning (CSRR 2022)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.csrr-1.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Story comprehension that involves complex causal and temporal relations is a critical task in NLP, but previous studies have focused predominantly on English, leaving open the question of how the findings generalize to other languages, such as Indonesian. In this paper, we follow the Story Cloze Test framework of Mostafazadeh et al. (2016) in evaluating story understanding in Indonesian, by constructing a four-sentence story with one correct ending and one incorrect ending. To investigate commonsense knowledge acquisition in language models, we experimented with: (1) a classification task to predict the correct ending; and (2) a generation task to complete the story with a single sentence. We investigate these tasks in two settings: (i) monolingual training and ii) zero-shot cross-lingual transfer between Indonesian and English.
加深对印尼语常识性故事理解的完形填空评价
故事理解涉及复杂的因果关系和时间关系是NLP的一项关键任务,但之前的研究主要集中在英语上,留下了如何将研究结果推广到其他语言(如印尼语)的问题。在本文中,我们遵循Mostafazadeh等人(2016)的故事完形测试框架来评估印尼语的故事理解,通过构建一个四句话的故事,一个正确的结尾和一个错误的结尾。为了研究语言模型中的常识性知识获取,我们进行了以下实验:(1)分类任务来预测正确的结尾;(2)生成任务,用一句话完成故事。我们在两种情况下研究这些任务:(i)单语培训和(ii)印尼语和英语之间的零机会跨语言迁移。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信