Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction

Fayu Pan, Bin Cao
{"title":"Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction","authors":"Fayu Pan, Bin Cao","doi":"10.1109/ICWS53863.2021.00073","DOIUrl":null,"url":null,"abstract":"Noisy text is common in semantic services and can have bad effects. Grammatical Error Correction (GEC) can be used to improve text quality, but traditional neural machine translation approaches need hundreds of milliseconds to correct a single text, which is unacceptable to time-sensitive services. To improve the efficiency of GEC, we choose to detect errors first and then make corrections. We present an intuitive multitask learning approach by checking if the text contains errors, finding errors' positions, and finally generating corrections. Two classifiers are introduced to serially detect sentence-level and token-level errors as errors only take a few parts in common corpora. Different from traditional approaches, we adapt a non-autoregressive decoder and only generate needed words to correct those detected errors, making the correction stage efficient. Experiments show that our approach can be ten times faster than the traditional approach in inference, and can achieve a comparable GEC performance.","PeriodicalId":213320,"journal":{"name":"2021 IEEE International Conference on Web Services (ICWS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Web Services (ICWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWS53863.2021.00073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Noisy text is common in semantic services and can have bad effects. Grammatical Error Correction (GEC) can be used to improve text quality, but traditional neural machine translation approaches need hundreds of milliseconds to correct a single text, which is unacceptable to time-sensitive services. To improve the efficiency of GEC, we choose to detect errors first and then make corrections. We present an intuitive multitask learning approach by checking if the text contains errors, finding errors' positions, and finally generating corrections. Two classifiers are introduced to serially detect sentence-level and token-level errors as errors only take a few parts in common corpora. Different from traditional approaches, we adapt a non-autoregressive decoder and only generate needed words to correct those detected errors, making the correction stage efficient. Experiments show that our approach can be ten times faster than the traditional approach in inference, and can achieve a comparable GEC performance.
基于层次错误检测和纠错的高效语法纠错
噪声文本在语义服务中很常见,可能会产生不良影响。语法错误纠正(GEC)可以用来提高文本质量,但传统的神经机器翻译方法需要数百毫秒来纠正单个文本,这对于时间敏感的服务来说是不可接受的。为了提高GEC的效率,我们选择先检测错误再进行修正。我们提出了一种直观的多任务学习方法,通过检查文本是否包含错误,找到错误的位置,最后生成更正。由于错误在常见语料库中只占很少的部分,因此引入了两个分类器来串行检测句子级和记号级错误。与传统方法不同,我们采用非自回归解码器,只生成需要的单词来纠正检测到的错误,使纠正阶段效率高。实验表明,该方法的推理速度比传统方法快10倍,并且可以达到相当的GEC性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信