在同行评估中生成的西班牙语语料库的情感分析算法的准确性度量

Proceedings of the 6th International Conference on Engineering & MIS 2020 Pub Date : 2020-09-14 DOI:10.1145/3410352.3410838

Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto

{"title":"在同行评估中生成的西班牙语语料库的情感分析算法的准确性度量","authors":"Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto","doi":"10.1145/3410352.3410838","DOIUrl":null,"url":null,"abstract":"The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.","PeriodicalId":178037,"journal":{"name":"Proceedings of the 6th International Conference on Engineering & MIS 2020","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Accuracy' Measures of Sentiment Analysis Algorithms for Spanish Corpus generated in Peer Assessment\",\"authors\":\"Maricela Pinargote Ortega, Lorena Bowen Mendoza, J. M. Hormaza, S. Soto\",\"doi\":\"10.1145/3410352.3410838\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.\",\"PeriodicalId\":178037,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Engineering & MIS 2020\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Engineering & MIS 2020\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3410352.3410838\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Engineering & MIS 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3410352.3410838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

本研究的目的是测试一个模型，该模型从西班牙语的一些反馈中将一些情绪分类为积极或消极，这些反馈是通过高等教育的同行评估产生的。实现了监督式机器学习方法。使用手动标记的数据集进行了几个实验，以测试n -gram的不同组合，包括术语频率-逆文档频率(TF-IDF)和分类算法:多项朴素贝叶斯，支持向量机，逻辑回归和随机森林，以获得最佳性能的正确组合。仿真结果表明，1-g + 2-g + TF-IDF组合的支持向量机分类器在精度、召回率和F-Measure方面是最好的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Accuracy' Measures of Sentiment Analysis Algorithms for Spanish Corpus generated in Peer Assessment

The purpose of this study is to test a model that classifies some sentiment as positive or negative from some feedback in Spanish that are generated through peer assessment in Higher Education. The Supervised Machine Learning method is implemented. Several experiments are performed with a manually tagged data set to test different combinations of N-grams with Term Frequency-Inverse Document Frequency (TF-IDF), and classification algorithms: Multinomial Naive Bayes, Support Vector Machine, Logistic Regression, and also Random Forest, in order to obtain the right combination that gives the best performance. The simulation results displayed that the Support Vector Machine classifier with the combination of 1-grams + 2-grams + TF-IDF is the best model in Precision, Recall and F-Measure.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 6th International Conference on Engineering & MIS 2020

自引率

0.00%

发文量