{"title":"Language assessment in the era of generative artificial intelligence: Opportunities, challenges, and future directions","authors":"Ping-Lin Chuang , Xun Yan","doi":"10.1016/j.system.2025.103846","DOIUrl":null,"url":null,"abstract":"<div><div>Recent breakthroughs in generative artificial intelligence (GenAI) have shaken the field of language assessment in unprecedented ways. On one hand, large-scale testing companies and organizations have spearheaded AI-based assessment models, infusing GenAI elements into various test development and quality management procedures. On the other hand, in local institutional contexts, although the interest in incorporating AI-based elements in instructional programs has also been soaring, little discussion has been made regarding the affordances and affordability of AI-based assessment systems in local contexts, and the impact of AI on language assessment across contexts remains underexplored. This paper presents a systematic review of 77 articles published on GenAI use in language testing and assessment, to synthesize assessment challenges, opportunities, and solutions involving GenAI. We review issues of GenAI in the domains of reliability, validity, fairness, and practicality in large-scale, local, and classroom-based assessments. Based on the findings from the systematic review, we identify principles and best practices of language assessment in the age of GenAI. Additionally, we provide future directions for assessment research and practice related to GenAI, with a focus on impact and sustainability of GenAI in language assessment across contexts. We caution researchers and practitioners about the danger of following the trend of incorporating GenAI in an unmeasured and unselective manner.</div></div>","PeriodicalId":48185,"journal":{"name":"System","volume":"134 ","pages":"Article 103846"},"PeriodicalIF":5.6000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"System","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0346251X25002568","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0
Abstract
Recent breakthroughs in generative artificial intelligence (GenAI) have shaken the field of language assessment in unprecedented ways. On one hand, large-scale testing companies and organizations have spearheaded AI-based assessment models, infusing GenAI elements into various test development and quality management procedures. On the other hand, in local institutional contexts, although the interest in incorporating AI-based elements in instructional programs has also been soaring, little discussion has been made regarding the affordances and affordability of AI-based assessment systems in local contexts, and the impact of AI on language assessment across contexts remains underexplored. This paper presents a systematic review of 77 articles published on GenAI use in language testing and assessment, to synthesize assessment challenges, opportunities, and solutions involving GenAI. We review issues of GenAI in the domains of reliability, validity, fairness, and practicality in large-scale, local, and classroom-based assessments. Based on the findings from the systematic review, we identify principles and best practices of language assessment in the age of GenAI. Additionally, we provide future directions for assessment research and practice related to GenAI, with a focus on impact and sustainability of GenAI in language assessment across contexts. We caution researchers and practitioners about the danger of following the trend of incorporating GenAI in an unmeasured and unselective manner.
期刊介绍:
This international journal is devoted to the applications of educational technology and applied linguistics to problems of foreign language teaching and learning. Attention is paid to all languages and to problems associated with the study and teaching of English as a second or foreign language. The journal serves as a vehicle of expression for colleagues in developing countries. System prefers its contributors to provide articles which have a sound theoretical base with a visible practical application which can be generalized. The review section may take up works of a more theoretical nature to broaden the background.