Exploring collocational complexity in L2 Russian: A corpus-driven contrastive analysis

IF 1.3 2区 文学 0 LANGUAGE & LINGUISTICS
Mikhail Kopotev, Aleksandr Klimov, Olesya Kisselev
{"title":"Exploring collocational complexity in L2 Russian: A corpus-driven contrastive analysis","authors":"Mikhail Kopotev, Aleksandr Klimov, Olesya Kisselev","doi":"10.1177/13670069231168247","DOIUrl":null,"url":null,"abstract":"Objective: The objective of this article is to discuss the pedagogical and practical need for automated assessment tools that enable teachers, researchers, and other language practitioners to relatively quickly and automatically assess the general proficiency of second language (L2) speakers according to a number of different linguistic parameters, specifically the use of collocations. Introduction: The Introduction discusses existing approaches to the concept of collocation and its role in L2 corpus studies. Methods: In the main part, Toward the automatic detection of collocational strength, we describe an approach to assessing collocations in Russian L2 written production, which is based on the corpus-driven contrastive analysis. In brief, this approach proposes ranking L2 collocations in accordance with the “collocational scores” of bigrams extracted from a reference (L1) corpus, with the assumption that the greater number of native-like bigrams in an L2 text and the higher collocational scores associated with them, the more proficient is the learner who created the text. To test this hypothesis, we conducted a series of experiments based on an L1 reference corpus and split the L2 datasets into four proficiency groups (Beginner, Intermediate, Advanced, and Superior). Results and evaluation: The findings, presented in the section Results and evaluation of the methods, established an empirical minimum and maximum of mean reciprocal rank (MRR) values, and the variation of the MRR across the four proficiency levels. In the last sections, we discuss shortcomings of the methods and propose further steps for overcoming them. We believe that the proposed method for assessing the collocational skill of Russian L2 writers can be successfully integrated into a bigger system capable of multifaceted automatic assessment of Russian learner texts.","PeriodicalId":47574,"journal":{"name":"International Journal of Bilingualism","volume":"23 1","pages":"0"},"PeriodicalIF":1.3000,"publicationDate":"2023-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bilingualism","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/13670069231168247","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: The objective of this article is to discuss the pedagogical and practical need for automated assessment tools that enable teachers, researchers, and other language practitioners to relatively quickly and automatically assess the general proficiency of second language (L2) speakers according to a number of different linguistic parameters, specifically the use of collocations. Introduction: The Introduction discusses existing approaches to the concept of collocation and its role in L2 corpus studies. Methods: In the main part, Toward the automatic detection of collocational strength, we describe an approach to assessing collocations in Russian L2 written production, which is based on the corpus-driven contrastive analysis. In brief, this approach proposes ranking L2 collocations in accordance with the “collocational scores” of bigrams extracted from a reference (L1) corpus, with the assumption that the greater number of native-like bigrams in an L2 text and the higher collocational scores associated with them, the more proficient is the learner who created the text. To test this hypothesis, we conducted a series of experiments based on an L1 reference corpus and split the L2 datasets into four proficiency groups (Beginner, Intermediate, Advanced, and Superior). Results and evaluation: The findings, presented in the section Results and evaluation of the methods, established an empirical minimum and maximum of mean reciprocal rank (MRR) values, and the variation of the MRR across the four proficiency levels. In the last sections, we discuss shortcomings of the methods and propose further steps for overcoming them. We believe that the proposed method for assessing the collocational skill of Russian L2 writers can be successfully integrated into a bigger system capable of multifaceted automatic assessment of Russian learner texts.
二语俄语搭配复杂性研究:语料库驱动的对比分析
目的:本文的目的是讨论自动化评估工具的教学和实践需求,使教师,研究人员和其他语言从业者能够根据许多不同的语言参数,特别是搭配的使用,相对快速和自动地评估第二语言(L2)使用者的一般熟练程度。引言:引言部分讨论了搭配概念的现有研究方法及其在二语语料库研究中的作用。方法:在主要部分,为了自动检测搭配强度,我们描述了一种基于语料库驱动的对比分析的方法来评估俄语第二语言书面生产中的搭配。简而言之,该方法提出根据从参考(L1)语料库中提取的双元词的“搭配分数”对第二语言搭配进行排名,并假设第二语言文本中类似母语的双元词越多,搭配分数越高,则创造文本的学习者就越熟练。为了验证这一假设,我们基于一个L1参考语料库进行了一系列实验,并将L2数据集分为四个熟练程度组(初级、中级、高级和高级)。结果和评价:在方法的结果和评价一节中提出的研究结果,建立了平均倒数秩(MRR)值的经验最小值和最大值,以及MRR在四个熟练程度上的变化。在最后几节中,我们讨论了这些方法的缺点,并提出了克服它们的进一步步骤。我们相信,所提出的评估俄罗斯第二语言作者搭配技能的方法可以成功地整合到一个更大的系统中,该系统能够对俄罗斯学习者的文本进行多方面的自动评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
6.70%
发文量
76
期刊介绍: The International Journal of Bilingualism is an international forum for the dissemination of original research on the linguistic, psychological, neurological, and social issues which emerge from language contact. While stressing interdisciplinary links, the focus of the Journal is on the language behavior of the bi- and multilingual individual.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信