Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto
{"title":"Accuracy' Measures of Sentiment Analysis Algorithms for Spanish Corpus generated in Peer Assessment","authors":"Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto","doi":"10.1145/3410352.3410838","DOIUrl":null,"url":null,"abstract":"The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.","PeriodicalId":178037,"journal":{"name":"Proceedings of the 6th International Conference on Engineering & MIS 2020","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Engineering & MIS 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3410352.3410838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.