{"title":"Gated Character-aware Convolutional Neural Network for Effective Automated Essay Scoring","authors":"Huanyu Bai, Zhilin Huang, Anran Hao, S. Hui","doi":"10.1145/3486622.3493945","DOIUrl":null,"url":null,"abstract":"Automated Essay Scoring (AES) is a challenging topic in Natural Language Processing. Many current state-of-the-art approaches are based on deep learning models. However, most AES models overlook the importance of character-level information, which is important to both the performance and the fairness. The character-level information is able to provide orthographic knowledge (e.g., spelling) and help the learning of infrequent and ⟨UNK⟩ tokens. In this paper, we propose a Gated Character-aware Convolutional Neural Network (GCCNN) model for the AES task. The proposed GCCNN model incorporates character-level information by a character-level encoder and a gated fusion mechanism. First, the character-level encoder learns word embeddings from sequences of characters by a hierarchical convolutional neural network. Next, the gated fusion mechanism adaptively controls the amount of word-level and character-level information to be fused using vector gating. Then, the essay-level encoder learns an essay representation based on the fused word embeddings. Finally, the fully connected layer maps the essay representation into its corresponding score. The experimental results show that our GCCNN model outperforms the baseline deep learning models. In addition, our qualitative analysis also demonstrates the importance of character-level information for tackling the out-of-vocabulary problem in grading essays.","PeriodicalId":89230,"journal":{"name":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"96 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3486622.3493945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Automated Essay Scoring (AES) is a challenging topic in Natural Language Processing. Many current state-of-the-art approaches are based on deep learning models. However, most AES models overlook the importance of character-level information, which is important to both the performance and the fairness. The character-level information is able to provide orthographic knowledge (e.g., spelling) and help the learning of infrequent and ⟨UNK⟩ tokens. In this paper, we propose a Gated Character-aware Convolutional Neural Network (GCCNN) model for the AES task. The proposed GCCNN model incorporates character-level information by a character-level encoder and a gated fusion mechanism. First, the character-level encoder learns word embeddings from sequences of characters by a hierarchical convolutional neural network. Next, the gated fusion mechanism adaptively controls the amount of word-level and character-level information to be fused using vector gating. Then, the essay-level encoder learns an essay representation based on the fused word embeddings. Finally, the fully connected layer maps the essay representation into its corresponding score. The experimental results show that our GCCNN model outperforms the baseline deep learning models. In addition, our qualitative analysis also demonstrates the importance of character-level information for tackling the out-of-vocabulary problem in grading essays.