回到语法：利用语法纠错自动评估 L2 口语水平

IF 2.4 3区计算机科学 Q2 ACOUSTICS

Speech Communication Pub Date : 2023-12-12 DOI:10.1016/j.specom.2023.103025

Stefano Bannò , Marco Matassoni

{"title":"回到语法：利用语法纠错自动评估 L2 口语水平","authors":"Stefano Bannò , Marco Matassoni","doi":"10.1016/j.specom.2023.103025","DOIUrl":null,"url":null,"abstract":"<div><p>In an interconnected world where English has become the lingua franca of culture, entertainment, business, and academia, the growing demand for learning English as a second language (L2) has led to an increasing interest in automatic approaches for assessing spoken language proficiency. In this regard, mastering grammar is one of the key elements of L2 proficiency.</p><p>In this paper, we illustrate an approach to L2 proficiency assessment and feedback based on grammatical features using only publicly available data for training and a small proprietary dataset for testing. Specifically, we implement it in a cascaded fashion, starting from learners’ utterances, investigating disfluency detection, exploring spoken grammatical error correction (GEC), and finally using grammatical features extracted with the spoken GEC module for proficiency assessment.</p><p>We compare this grading system to a BERT-based grader and find that the two systems have similar performances when using manual transcriptions, but their combinations bring significant improvements to the assessment performance and enhance validity and explainability. Instead, when using automatic transcriptions, the GEC-based grader obtains better results than the BERT-based grader.</p><p>The results obtained are discussed and evaluated with appropriate metrics across the proposed pipeline.</p></div>","PeriodicalId":49485,"journal":{"name":"Speech Communication","volume":"157 ","pages":"Article 103025"},"PeriodicalIF":2.4000,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency\",\"authors\":\"Stefano Bannò , Marco Matassoni\",\"doi\":\"10.1016/j.specom.2023.103025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In an interconnected world where English has become the lingua franca of culture, entertainment, business, and academia, the growing demand for learning English as a second language (L2) has led to an increasing interest in automatic approaches for assessing spoken language proficiency. In this regard, mastering grammar is one of the key elements of L2 proficiency.</p><p>In this paper, we illustrate an approach to L2 proficiency assessment and feedback based on grammatical features using only publicly available data for training and a small proprietary dataset for testing. Specifically, we implement it in a cascaded fashion, starting from learners’ utterances, investigating disfluency detection, exploring spoken grammatical error correction (GEC), and finally using grammatical features extracted with the spoken GEC module for proficiency assessment.</p><p>We compare this grading system to a BERT-based grader and find that the two systems have similar performances when using manual transcriptions, but their combinations bring significant improvements to the assessment performance and enhance validity and explainability. Instead, when using automatic transcriptions, the GEC-based grader obtains better results than the BERT-based grader.</p><p>The results obtained are discussed and evaluated with appropriate metrics across the proposed pipeline.</p></div>\",\"PeriodicalId\":49485,\"journal\":{\"name\":\"Speech Communication\",\"volume\":\"157 \",\"pages\":\"Article 103025\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2023-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Speech Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167639323001590\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Speech Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167639323001590","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

摘要

在一个相互联系的世界里，英语已成为文化、娱乐、商业和学术界的通用语言，人们对英语作为第二语言（L2）的学习需求日益增长，这导致人们对口语能力自动评估方法的兴趣与日俱增。在本文中，我们展示了一种基于语法特征的 L2 能力评估和反馈方法，该方法仅使用公开数据进行训练，并使用一个小型专有数据集进行测试。具体来说，我们以级联的方式实施这种方法，从学习者的语篇开始，研究不流畅检测，探索口语语法错误纠正（GEC），最后使用口语语法错误纠正模块提取的语法特征进行能力评估。我们将这种评分系统与基于 BERT 的评分系统进行了比较，发现这两种系统在使用人工转录时具有相似的性能，但它们的组合能显著提高评估性能，并增强有效性和可解释性。相反，在使用自动转录时，基于 GEC 的评分器比基于 BERT 的评分器获得了更好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency

查看原文本刊更多论文

Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency

In an interconnected world where English has become the lingua franca of culture, entertainment, business, and academia, the growing demand for learning English as a second language (L2) has led to an increasing interest in automatic approaches for assessing spoken language proficiency. In this regard, mastering grammar is one of the key elements of L2 proficiency.

In this paper, we illustrate an approach to L2 proficiency assessment and feedback based on grammatical features using only publicly available data for training and a small proprietary dataset for testing. Specifically, we implement it in a cascaded fashion, starting from learners’ utterances, investigating disfluency detection, exploring spoken grammatical error correction (GEC), and finally using grammatical features extracted with the spoken GEC module for proficiency assessment.

We compare this grading system to a BERT-based grader and find that the two systems have similar performances when using manual transcriptions, but their combinations bring significant improvements to the assessment performance and enhance validity and explainability. Instead, when using automatic transcriptions, the GEC-based grader obtains better results than the BERT-based grader.

The results obtained are discussed and evaluated with appropriate metrics across the proposed pipeline.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Speech Communication 工程技术-计算机：跨学科应用

CiteScore

6.80

自引率

6.20%

发文量

审稿时长

19.2 weeks

期刊介绍： Speech Communication is an interdisciplinary journal whose primary objective is to fulfil the need for the rapid dissemination and thorough discussion of basic and applied research results. The journal''s primary objectives are: • to present a forum for the advancement of human and human-machine speech communication science; • to stimulate cross-fertilization between different fields of this domain; • to contribute towards the rapid and wide diffusion of scientifically sound contributions in this domain.