Sentiment and Sentence Similarity as Predictors of Integrated and Independent L2 Writing Performance

Acuity Pub Date : 2021-06-28 DOI:10.35974/acuity.v7i2.2529

Kutay Uzun, Ö. Ulum

{"title":"Sentiment and Sentence Similarity as Predictors of Integrated and Independent L2 Writing Performance","authors":"Kutay Uzun, Ö. Ulum","doi":"10.35974/acuity.v7i2.2529","DOIUrl":null,"url":null,"abstract":"This study aimed to utilize sentiment and sentence similarity analyses, two Natural Language Processing techniques, to see if and how well they could predict L2 Writing Performance in integrated and independent task conditions. The data sources were an integrated L2 writing corpus of 185 literary analysis essays and an independent L2 writing corpus of 500 argumentative essays, both of which were compiled in higher education contexts. Both essay groups were scored between 0 and 100. Two Python libraries, TextBlob and SpaCy, were used to generate sentiment and sentence similarity data. Using sentiment (polarity and subjectivity) and sentence similarity variables, regression models were built and 95% prediction intervals were compared for integrated and independent corpora. The results showed that integrated L2 writing performance could be predicted by subjectivity and sentence similarity. However, only subjectivity predicted independent L2 writing performance. The prediction interval of subjectivity for independent writing model was found to be narrower than the same interval for integrated writing. The results show that the sentiment and sentence similarity analysis algorithms can be used to generate complementary data to improve more complex multivariate L2 writing performance prediction models.","PeriodicalId":33744,"journal":{"name":"Acuity","volume":"202 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acuity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35974/acuity.v7i2.2529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This study aimed to utilize sentiment and sentence similarity analyses, two Natural Language Processing techniques, to see if and how well they could predict L2 Writing Performance in integrated and independent task conditions. The data sources were an integrated L2 writing corpus of 185 literary analysis essays and an independent L2 writing corpus of 500 argumentative essays, both of which were compiled in higher education contexts. Both essay groups were scored between 0 and 100. Two Python libraries, TextBlob and SpaCy, were used to generate sentiment and sentence similarity data. Using sentiment (polarity and subjectivity) and sentence similarity variables, regression models were built and 95% prediction intervals were compared for integrated and independent corpora. The results showed that integrated L2 writing performance could be predicted by subjectivity and sentence similarity. However, only subjectivity predicted independent L2 writing performance. The prediction interval of subjectivity for independent writing model was found to be narrower than the same interval for integrated writing. The results show that the sentiment and sentence similarity analysis algorithms can be used to generate complementary data to improve more complex multivariate L2 writing performance prediction models.

查看原文本刊更多论文

情感和句子相似度作为综合和独立二语写作表现的预测因子

本研究旨在利用情感和句子相似度分析这两种自然语言处理技术，看看它们是否以及在多大程度上可以预测二语写作在综合和独立任务条件下的表现。数据来源是一个包含185篇文学分析文章的综合第二语言写作语料库和一个包含500篇议论文的独立第二语言写作语料库，两者都是在高等教育背景下编制的。两个作文组的得分都在0到100分之间。两个Python库TextBlob和SpaCy被用来生成情感和句子相似度数据。利用情感(极性和主观性)和句子相似度变量建立回归模型，并对独立语料库和综合语料库的95%预测区间进行比较。结果表明，主观性和句子相似度可以预测二语写作的综合表现。然而，只有主观性可以预测独立的二语写作表现。独立写作模式的主观性预测区间比综合写作模式的主观性预测区间窄。结果表明，情感和句子相似度分析算法可用于生成互补数据，以改进更复杂的多元二语写作性能预测模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Acuity

自引率

0.00%

发文量

审稿时长

8 weeks