Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto
{"title":"在同行评估中生成的西班牙语语料库的情感分析算法的准确性度量","authors":"Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto","doi":"10.1145/3410352.3410838","DOIUrl":null,"url":null,"abstract":"The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.","PeriodicalId":178037,"journal":{"name":"Proceedings of the 6th International Conference on Engineering & MIS 2020","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Accuracy' Measures of Sentiment Analysis Algorithms for Spanish Corpus generated in Peer Assessment\",\"authors\":\"Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto\",\"doi\":\"10.1145/3410352.3410838\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.\",\"PeriodicalId\":178037,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Engineering & MIS 2020\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Engineering & MIS 2020\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3410352.3410838\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Engineering & MIS 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3410352.3410838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Accuracy' Measures of Sentiment Analysis Algorithms for Spanish Corpus generated in Peer Assessment
The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.