{"title":"面向一次性学习的日语作文评分统计学习模型","authors":"Chihiro Ejima, Koichi Takeuchi","doi":"10.1109/IIAIAAI55812.2022.00070","DOIUrl":null,"url":null,"abstract":"A lot of studies of automatic essay scoring are conducted using machine learning models. The previous studies show high performance for scoring large scale essays with machine learning models, however, more than hundreds of scored answers are required to train the neural network models. In this paper we discuss the possibility of one-shot learning, that is, using only one model essay as a training sample of a highest score. For this purpose, we apply regression models to estimate essay scores with different embedding models, that are, BERT and bag-of-words based encoding models. In preliminary experiments, feature analyses of one-shot learning with UMAP for the two embedding models reveal that the bag-of-words based model has more potential to score the test essays comparing to the BERT encoding model. Thus, to clarify the performance of the bag-of-words based encoding model, we conduct two experiments: firstly, we evaluate the performance of models to estimate the scores of test essays using 80% of score essays are used as training data; secondly, one-shot learning is applied to the models. The experimental results show that the proposed bag-of-words based encoding model is promising.","PeriodicalId":156230,"journal":{"name":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Statistical Learning Models for Japanese Essay Scoring Toward One-shot Learning\",\"authors\":\"Chihiro Ejima, Koichi Takeuchi\",\"doi\":\"10.1109/IIAIAAI55812.2022.00070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A lot of studies of automatic essay scoring are conducted using machine learning models. The previous studies show high performance for scoring large scale essays with machine learning models, however, more than hundreds of scored answers are required to train the neural network models. In this paper we discuss the possibility of one-shot learning, that is, using only one model essay as a training sample of a highest score. For this purpose, we apply regression models to estimate essay scores with different embedding models, that are, BERT and bag-of-words based encoding models. In preliminary experiments, feature analyses of one-shot learning with UMAP for the two embedding models reveal that the bag-of-words based model has more potential to score the test essays comparing to the BERT encoding model. Thus, to clarify the performance of the bag-of-words based encoding model, we conduct two experiments: firstly, we evaluate the performance of models to estimate the scores of test essays using 80% of score essays are used as training data; secondly, one-shot learning is applied to the models. The experimental results show that the proposed bag-of-words based encoding model is promising.\",\"PeriodicalId\":156230,\"journal\":{\"name\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAIAAI55812.2022.00070\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAIAAI55812.2022.00070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Statistical Learning Models for Japanese Essay Scoring Toward One-shot Learning
A lot of studies of automatic essay scoring are conducted using machine learning models. The previous studies show high performance for scoring large scale essays with machine learning models, however, more than hundreds of scored answers are required to train the neural network models. In this paper we discuss the possibility of one-shot learning, that is, using only one model essay as a training sample of a highest score. For this purpose, we apply regression models to estimate essay scores with different embedding models, that are, BERT and bag-of-words based encoding models. In preliminary experiments, feature analyses of one-shot learning with UMAP for the two embedding models reveal that the bag-of-words based model has more potential to score the test essays comparing to the BERT encoding model. Thus, to clarify the performance of the bag-of-words based encoding model, we conduct two experiments: firstly, we evaluate the performance of models to estimate the scores of test essays using 80% of score essays are used as training data; secondly, one-shot learning is applied to the models. The experimental results show that the proposed bag-of-words based encoding model is promising.