{"title":"AlphaLexChinese:测量汉语文本的词汇复杂性及其对二语写作分数的预测效度","authors":"Haobo Zhang, Lei Lei","doi":"10.1016/j.system.2025.103809","DOIUrl":null,"url":null,"abstract":"<div><div>The study introduces AlphaLexChinese (ALC), the first tool that is designed to measure the lexical complexity of Chinese texts. ALC incorporates 50 metrics across three dimensions, i.e., lexical density, lexical sophistication, and lexical variation. To test the applicability and validity of ALC, we analyzed 11,485 scored essays from a corpus of L2 Chinese writing. The multiple regression analysis revealed that nine metrics significantly predicted the scores of the L2 Chinese writing, which accounts for 14.2 % of the variance in scores. These metrics include three metrics of lexical sophistication (i.e., the Mean CPG Score, the Moving Average Verb Sophistication, and the Mean AoA Score), and six metrics of lexical variation (i.e., the Moving Average Lexical Word Variation, the Measure of Textual Lexical Diversity, the Moving Average Entropy of Lexical Words, the Moving Average TTR, the Moving Average Verb Variation 2, and the Moving Average Verb Variation 1). Pedagogical and research implications of ALC are discussed from the perspectives of pedagogical meanings of the metrics, tracking diachronic changes in language, L1 and L2 Chinese language teaching and learning, and possible applications in automated essay scoring systems.</div></div>","PeriodicalId":48185,"journal":{"name":"System","volume":"134 ","pages":"Article 103809"},"PeriodicalIF":5.6000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AlphaLexChinese: Measuring lexical complexity in Chinese texts and its predictive validity for L2 writing scores\",\"authors\":\"Haobo Zhang, Lei Lei\",\"doi\":\"10.1016/j.system.2025.103809\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The study introduces AlphaLexChinese (ALC), the first tool that is designed to measure the lexical complexity of Chinese texts. ALC incorporates 50 metrics across three dimensions, i.e., lexical density, lexical sophistication, and lexical variation. To test the applicability and validity of ALC, we analyzed 11,485 scored essays from a corpus of L2 Chinese writing. The multiple regression analysis revealed that nine metrics significantly predicted the scores of the L2 Chinese writing, which accounts for 14.2 % of the variance in scores. These metrics include three metrics of lexical sophistication (i.e., the Mean CPG Score, the Moving Average Verb Sophistication, and the Mean AoA Score), and six metrics of lexical variation (i.e., the Moving Average Lexical Word Variation, the Measure of Textual Lexical Diversity, the Moving Average Entropy of Lexical Words, the Moving Average TTR, the Moving Average Verb Variation 2, and the Moving Average Verb Variation 1). Pedagogical and research implications of ALC are discussed from the perspectives of pedagogical meanings of the metrics, tracking diachronic changes in language, L1 and L2 Chinese language teaching and learning, and possible applications in automated essay scoring systems.</div></div>\",\"PeriodicalId\":48185,\"journal\":{\"name\":\"System\",\"volume\":\"134 \",\"pages\":\"Article 103809\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"System\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0346251X25002192\",\"RegionNum\":1,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"System","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0346251X25002192","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
AlphaLexChinese: Measuring lexical complexity in Chinese texts and its predictive validity for L2 writing scores
The study introduces AlphaLexChinese (ALC), the first tool that is designed to measure the lexical complexity of Chinese texts. ALC incorporates 50 metrics across three dimensions, i.e., lexical density, lexical sophistication, and lexical variation. To test the applicability and validity of ALC, we analyzed 11,485 scored essays from a corpus of L2 Chinese writing. The multiple regression analysis revealed that nine metrics significantly predicted the scores of the L2 Chinese writing, which accounts for 14.2 % of the variance in scores. These metrics include three metrics of lexical sophistication (i.e., the Mean CPG Score, the Moving Average Verb Sophistication, and the Mean AoA Score), and six metrics of lexical variation (i.e., the Moving Average Lexical Word Variation, the Measure of Textual Lexical Diversity, the Moving Average Entropy of Lexical Words, the Moving Average TTR, the Moving Average Verb Variation 2, and the Moving Average Verb Variation 1). Pedagogical and research implications of ALC are discussed from the perspectives of pedagogical meanings of the metrics, tracking diachronic changes in language, L1 and L2 Chinese language teaching and learning, and possible applications in automated essay scoring systems.
期刊介绍:
This international journal is devoted to the applications of educational technology and applied linguistics to problems of foreign language teaching and learning. Attention is paid to all languages and to problems associated with the study and teaching of English as a second or foreign language. The journal serves as a vehicle of expression for colleagues in developing countries. System prefers its contributors to provide articles which have a sound theoretical base with a visible practical application which can be generalized. The review section may take up works of a more theoretical nature to broaden the background.