Automated analysis of common errors in L2 learner production: Prototype web application development

IF 4.9 1区文学 Q1 LINGUISTICS

Studies in Second Language Acquisition Pub Date : 2025-06-19 DOI:10.1017/s0272263125100934

Atsushi Mizumoto

{"title":"Automated analysis of common errors in L2 learner production: Prototype web application development","authors":"Atsushi Mizumoto","doi":"10.1017/s0272263125100934","DOIUrl":null,"url":null,"abstract":"This research report presents the development and validation of Auto Error Analyzer, a prototype web application designed to automate the calculation of accuracy and its related metrics for measuring second language (L2) production. Building on recent advancements in natural language processing (NLP) and artificial intelligence (AI), Auto Error Analyzer introduces an automated accuracy measurement component, bridging a gap in existing assessment tools, which traditionally require human judgment for accuracy evaluation. By utilizing a state-of-the-art generative AI model (Llama 3.3) for error detection, Auto Error Analyzer analyzes L2 texts efficiently and cost-effectively, producing accuracy metrics (e.g., errors per 100 words). Validation results demonstrate high agreement between the tool’s error counts and human rater judgments (r = .94), with microaverage precision and recall in error detection being high as well (.96 and .94 respectively, F1 = .95), and its T-unit and clause counts matched outputs from established tools like L2SCA. Developed under open science principles to ensure transparency and replicability, the tool aims to support researchers and educators while emphasizing the complementary role of human expertise in language assessment. The possibilities of Auto Error Analyzer for efficient and scalable error analysis, as well as its limitations in detecting context-dependent and first-language (L1)-influenced errors, are also discussed.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"77 1","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in Second Language Acquisition","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1017/s0272263125100934","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LINGUISTICS","Score":null,"Total":0}

引用次数: 0

Abstract

This research report presents the development and validation of Auto Error Analyzer, a prototype web application designed to automate the calculation of accuracy and its related metrics for measuring second language (L2) production. Building on recent advancements in natural language processing (NLP) and artificial intelligence (AI), Auto Error Analyzer introduces an automated accuracy measurement component, bridging a gap in existing assessment tools, which traditionally require human judgment for accuracy evaluation. By utilizing a state-of-the-art generative AI model (Llama 3.3) for error detection, Auto Error Analyzer analyzes L2 texts efficiently and cost-effectively, producing accuracy metrics (e.g., errors per 100 words). Validation results demonstrate high agreement between the tool’s error counts and human rater judgments (r = .94), with microaverage precision and recall in error detection being high as well (.96 and .94 respectively, F1 = .95), and its T-unit and clause counts matched outputs from established tools like L2SCA. Developed under open science principles to ensure transparency and replicability, the tool aims to support researchers and educators while emphasizing the complementary role of human expertise in language assessment. The possibilities of Auto Error Analyzer for efficient and scalable error analysis, as well as its limitations in detecting context-dependent and first-language (L1)-influenced errors, are also discussed.

查看原文本刊更多论文

第二语言学习者生产中常见错误的自动化分析：原型web应用程序开发

本研究报告介绍了自动错误分析器的开发和验证，这是一个原型web应用程序，旨在自动计算第二语言（L2）生产的准确性及其相关指标。基于自然语言处理（NLP）和人工智能（AI）的最新进展，自动误差分析仪引入了自动精度测量组件，弥补了现有评估工具的空白，这些工具传统上需要人工判断来进行精度评估。通过使用最先进的生成式人工智能模型（Llama 3.3）进行错误检测，Auto error Analyzer高效且经济地分析第二语言文本，生成准确性指标（例如，每100个单词的错误）。验证结果表明，该工具的错误计数与人类判断之间的一致性很高（r = 0.94），错误检测中的微平均精度和召回率也很高（r = 0.94）。分别为96和0.94,F1 = 0.95)，其t单位和子句计数与L2SCA等已建立的工具的输出相匹配。该工具根据开放科学原则开发，以确保透明度和可复制性，旨在支持研究人员和教育工作者，同时强调人类专业知识在语言评估中的补充作用。本文还讨论了自动错误分析器在有效和可扩展的错误分析方面的可能性，以及它在检测上下文相关和第一语言（L1）影响的错误方面的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Studies in Second Language Acquisition LINGUISTICS-

CiteScore

8.00

自引率

9.80%

发文量

期刊介绍： Studies in Second Language Acquisition is a refereed journal of international scope devoted to the scientific discussion of acquisition or use of non-native and heritage languages. Each volume (five issues) contains research articles of either a quantitative, qualitative, or mixed-methods nature in addition to essays on current theoretical matters. Other rubrics include shorter articles such as Replication Studies, Critical Commentaries, and Research Reports.