L2 英语口语句法复杂性:数据预处理问题、自动分析的可靠性以及熟练程度、第一语言背景和话题的影响

Minjin Kim, Xiaofei Lu
{"title":"L2 英语口语句法复杂性:数据预处理问题、自动分析的可靠性以及熟练程度、第一语言背景和话题的影响","authors":"Minjin Kim, Xiaofei Lu","doi":"10.1111/modl.12907","DOIUrl":null,"url":null,"abstract":"The effects of learner‐ and task‐related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC. Using data from the International Corpus Network of Asian Learners of English, this study explores data preprocessing issues for preparing L2 English speech samples for automated SC analysis, evaluates the reliability of L2 Syntactic Complexity Analyzer on preprocessed L2 English speech samples, and examines the effects of proficiency, first language (L1) background, and topic on L2 speaking SC. Our manual analysis of 30 random speech samples identified several issues that can be addressed through preprocessing to improve the accuracy of automated SC analysis. Results from multiple linear mixed‐effects models revealed significant effects of proficiency, L1 background, and topic on the mean length of clause, the number of complex AS‐units per AS‐unit, and the number of dependent clauses and complex nominals per clause in L2 learners’ spoken production. Our findings have useful implications for L2 speaking pedagogy and assessment as well as future L2 speaking SC research.","PeriodicalId":510718,"journal":{"name":"The Modern Language Journal","volume":"74 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic\",\"authors\":\"Minjin Kim, Xiaofei Lu\",\"doi\":\"10.1111/modl.12907\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The effects of learner‐ and task‐related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC. Using data from the International Corpus Network of Asian Learners of English, this study explores data preprocessing issues for preparing L2 English speech samples for automated SC analysis, evaluates the reliability of L2 Syntactic Complexity Analyzer on preprocessed L2 English speech samples, and examines the effects of proficiency, first language (L1) background, and topic on L2 speaking SC. Our manual analysis of 30 random speech samples identified several issues that can be addressed through preprocessing to improve the accuracy of automated SC analysis. Results from multiple linear mixed‐effects models revealed significant effects of proficiency, L1 background, and topic on the mean length of clause, the number of complex AS‐units per AS‐unit, and the number of dependent clauses and complex nominals per clause in L2 learners’ spoken production. Our findings have useful implications for L2 speaking pedagogy and assessment as well as future L2 speaking SC research.\",\"PeriodicalId\":510718,\"journal\":{\"name\":\"The Modern Language Journal\",\"volume\":\"74 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Modern Language Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1111/modl.12907\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Modern Language Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/modl.12907","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

与学习者和任务相关的变量对第二语言(L2)写作句法复杂性(SC)的影响已被广泛研究。然而,以往的研究很少评估用于分析第二语言口语句法复杂性的计算工具的可靠性,我们对这些变量对第二语言口语句法复杂性的影响也知之甚少。本研究利用亚洲英语学习者国际语料库网络的数据,探讨了为自动 SC 分析准备 L2 英语口语样本的数据预处理问题,评估了 L2 句法复杂性分析器在预处理 L2 英语口语样本上的可靠性,并研究了熟练程度、第一语言(L1)背景和话题对 L2 口语 SC 的影响。我们对 30 个随机语音样本进行了人工分析,发现了几个可以通过预处理来提高自动 SC 分析准确性的问题。多重线性混合效应模型的结果显示,在 L2 学习者的口语表达中,熟练程度、L1 背景和话题对句子的平均长度、每个 AS 单元的复杂 AS 数量以及每个句子的从句和复杂名词的数量都有显著影响。我们的研究结果对 L2 口语教学法和评估以及未来的 L2 口语 SC 研究都有有益的启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic

L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic
The effects of learner‐ and task‐related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC. Using data from the International Corpus Network of Asian Learners of English, this study explores data preprocessing issues for preparing L2 English speech samples for automated SC analysis, evaluates the reliability of L2 Syntactic Complexity Analyzer on preprocessed L2 English speech samples, and examines the effects of proficiency, first language (L1) background, and topic on L2 speaking SC. Our manual analysis of 30 random speech samples identified several issues that can be addressed through preprocessing to improve the accuracy of automated SC analysis. Results from multiple linear mixed‐effects models revealed significant effects of proficiency, L1 background, and topic on the mean length of clause, the number of complex AS‐units per AS‐unit, and the number of dependent clauses and complex nominals per clause in L2 learners’ spoken production. Our findings have useful implications for L2 speaking pedagogy and assessment as well as future L2 speaking SC research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信