{"title":"Integrating move analysis and sentence reconstruction in automated writing evaluation for L2 academic writers","authors":"Bo-Ren Mau , Hui-Hsien Feng","doi":"10.1016/j.asw.2025.100984","DOIUrl":null,"url":null,"abstract":"<div><div>Artificial intelligence has been widely utilized to assist L2 writers through automated writing evaluation (AWE) systems, which offer grammatical feedback. However, for English academic writing, such feedback is apparently insufficient to address the complexities of academic discourse. While genre-based AWE systems employ move analysis, they offer move detections as corrective feedback (CF) without addressing language use issues and are developed using limited datasets. Additionally, general-purpose large language models (LLMs; e.g., ChatGPT) may lack specialized mechanisms for accurately identifying rhetorical moves and providing genre-specific feedback in academic writing contexts. To address these limitations, this study proposes GURUS, a genre-based AWE system grounded in second language acquisition theories. It provides indirect CF by classifying moves with probabilistic scores, and direct CF through sentence reconstruction. GURUS is implemented as a web-based application using ensemble learning model and transformer-based LLMs. By offering indirect and direct CF, GURUS promotes learner-machine interaction, prompting learners to notice discrepancies between their writing and the reconstructed sentences. GURUS was trained on over one million sentences with OMRC moves. Its classification performance was assessed using <em>F1</em>-score and Brier score; furthermore, semantic and rhetorical production were evaluated using BERTscore and human assessment. The results show that GURUS sufficiently classifies sentence moves and reconstructs sentences while retaining semantic integrity. Given GURUS holds promise in academic writing instruction, this study also discusses its implementation to bolster learners’ genre awareness and proficiency in move-based abstract writing.</div></div>","PeriodicalId":46865,"journal":{"name":"Assessing Writing","volume":"66 ","pages":"Article 100984"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Assessing Writing","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1075293525000716","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial intelligence has been widely utilized to assist L2 writers through automated writing evaluation (AWE) systems, which offer grammatical feedback. However, for English academic writing, such feedback is apparently insufficient to address the complexities of academic discourse. While genre-based AWE systems employ move analysis, they offer move detections as corrective feedback (CF) without addressing language use issues and are developed using limited datasets. Additionally, general-purpose large language models (LLMs; e.g., ChatGPT) may lack specialized mechanisms for accurately identifying rhetorical moves and providing genre-specific feedback in academic writing contexts. To address these limitations, this study proposes GURUS, a genre-based AWE system grounded in second language acquisition theories. It provides indirect CF by classifying moves with probabilistic scores, and direct CF through sentence reconstruction. GURUS is implemented as a web-based application using ensemble learning model and transformer-based LLMs. By offering indirect and direct CF, GURUS promotes learner-machine interaction, prompting learners to notice discrepancies between their writing and the reconstructed sentences. GURUS was trained on over one million sentences with OMRC moves. Its classification performance was assessed using F1-score and Brier score; furthermore, semantic and rhetorical production were evaluated using BERTscore and human assessment. The results show that GURUS sufficiently classifies sentence moves and reconstructs sentences while retaining semantic integrity. Given GURUS holds promise in academic writing instruction, this study also discusses its implementation to bolster learners’ genre awareness and proficiency in move-based abstract writing.
期刊介绍:
Assessing Writing is a refereed international journal providing a forum for ideas, research and practice on the assessment of written language. Assessing Writing publishes articles, book reviews, conference reports, and academic exchanges concerning writing assessments of all kinds, including traditional (direct and standardised forms of) testing of writing, alternative performance assessments (such as portfolios), workplace sampling and classroom assessment. The journal focuses on all stages of the writing assessment process, including needs evaluation, assessment creation, implementation, and validation, and test development.