{"title":"Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction","authors":"Fayu Pan, Bin Cao","doi":"10.1109/ICWS53863.2021.00073","DOIUrl":null,"url":null,"abstract":"Noisy text is common in semantic services and can have bad effects. Grammatical Error Correction (GEC) can be used to improve text quality, but traditional neural machine translation approaches need hundreds of milliseconds to correct a single text, which is unacceptable to time-sensitive services. To improve the efficiency of GEC, we choose to detect errors first and then make corrections. We present an intuitive multitask learning approach by checking if the text contains errors, finding errors' positions, and finally generating corrections. Two classifiers are introduced to serially detect sentence-level and token-level errors as errors only take a few parts in common corpora. Different from traditional approaches, we adapt a non-autoregressive decoder and only generate needed words to correct those detected errors, making the correction stage efficient. Experiments show that our approach can be ten times faster than the traditional approach in inference, and can achieve a comparable GEC performance.","PeriodicalId":213320,"journal":{"name":"2021 IEEE International Conference on Web Services (ICWS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Web Services (ICWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWS53863.2021.00073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Noisy text is common in semantic services and can have bad effects. Grammatical Error Correction (GEC) can be used to improve text quality, but traditional neural machine translation approaches need hundreds of milliseconds to correct a single text, which is unacceptable to time-sensitive services. To improve the efficiency of GEC, we choose to detect errors first and then make corrections. We present an intuitive multitask learning approach by checking if the text contains errors, finding errors' positions, and finally generating corrections. Two classifiers are introduced to serially detect sentence-level and token-level errors as errors only take a few parts in common corpora. Different from traditional approaches, we adapt a non-autoregressive decoder and only generate needed words to correct those detected errors, making the correction stage efficient. Experiments show that our approach can be ten times faster than the traditional approach in inference, and can achieve a comparable GEC performance.