Heterogeneous models ensemble for Chinese grammatical error correction

International Conference on Artificial Intelligence, Virtual Reality, and Visualization Pub Date : 2023-03-01 DOI:10.1117/12.2667512

Yeling Liang, Lin Li

引用次数: 1

Abstract

Grammatical error correction (GEC) aims to automatically identify and correct grammatical errors in a sentence. Neural machine translation (NMT) models are the mainstream approaches for the GEC task. However, the models require a large amount of data to be adequately trained, the variety of grammatical errors and the dependencies between errors in a sentence make it difficult for a single NMT model to correct multiple errors at once. In the work, we propose an ensemble approach for heterogeneous models, which integrates rule-based, NMT, and pre-trained language model-based GEC models through the recurrent generation approach, the approach can exploit the strengths of each model and cover a wider range of errors in a sentence. We also mitigate the scarcity of task-specific data for the GEC task through the data augmentation approach. We conduct extensive experiments on the NLPCC2018 shared task dataset to demonstrate the effectiveness of our proposed methods, and reaches the F0.5 value of 37.26, outperforming the best model in the shared task.

查看原文本刊更多论文

汉语语法错误纠错的异构模型集成

语法错误纠正(GEC)旨在自动识别和纠正句子中的语法错误。神经机器翻译(NMT)模型是解决GEC任务的主流方法。然而，这些模型需要大量的数据进行充分的训练，语法错误的多样性以及句子中错误之间的依赖性使得单个NMT模型很难同时纠正多个错误。在这项工作中，我们提出了一种异构模型的集成方法，该方法通过循环生成方法集成了基于规则的、NMT的和基于预训练语言模型的GEC模型，该方法可以利用每个模型的优势，并覆盖句子中更大范围的错误。我们还通过数据增强方法缓解了GEC任务特定于任务的数据的稀缺性。我们在NLPCC2018共享任务数据集上进行了大量的实验，证明了我们提出的方法的有效性，F0.5值达到37.26，优于共享任务中的最佳模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Artificial Intelligence, Virtual Reality, and Visualization

自引率

0.00%

发文量