利用机器学习技术增强师生评价中文本反馈的情感分析

Caren A. Pacol, T. Palaoag
{"title":"利用机器学习技术增强师生评价中文本反馈的情感分析","authors":"Caren A. Pacol, T. Palaoag","doi":"10.33422/ejest.v4i1.604","DOIUrl":null,"url":null,"abstract":"Sentiment Analysis has been an interesting and popular research area encouraging researchers and practitioners to adopt this tool in various fields such as the government, health care and education.  In education, instruction evaluation is one of the activities that sentiment analysis has served.  Though, it is a common practice that educational institutions periodically evaluate their teachers’ performance, students’ comments which are rich in insights are not easily taken into account because of lack of automated text analytics methods. In this study, supervised machine learning algorithms were used. Experiments were conducted to evaluate base models employing naïve bayes, support vector machines and logistic regression in comparison to ensemble combining the three. Random forest, an ensemble learning algorithm was also experimented. Machine learning techniques such as term-frequency – inverse document frequency (TF-IDF) and ngram were also explored to devise a model with the highest possible accuracy. Results show that in this case, tf-idf vectorization does not show significant improvement in sentiment classification. On the other hand, ngram vectorization improve performance of base models and has potential to improve ensemble models. Random forest showed higher performance measures than base models and ensemble of the three base models. However, it did not outperform ngram combined with support vector machines. In future work the model with highest accuracy found can be embedded in a sentiment analysis tool for students’ feedback on teaching performance. More advanced transformation techniques and other ensemble techniques may be explored to further improve accuracy in sentiment classification.","PeriodicalId":143710,"journal":{"name":"European Journal of Engineering Science and Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Enhancing Sentiment Analysis of Textual Feedback in the Student-Faculty Evaluation using Machine Learning Techniques\",\"authors\":\"Caren A. Pacol, T. Palaoag\",\"doi\":\"10.33422/ejest.v4i1.604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sentiment Analysis has been an interesting and popular research area encouraging researchers and practitioners to adopt this tool in various fields such as the government, health care and education.  In education, instruction evaluation is one of the activities that sentiment analysis has served.  Though, it is a common practice that educational institutions periodically evaluate their teachers’ performance, students’ comments which are rich in insights are not easily taken into account because of lack of automated text analytics methods. In this study, supervised machine learning algorithms were used. Experiments were conducted to evaluate base models employing naïve bayes, support vector machines and logistic regression in comparison to ensemble combining the three. Random forest, an ensemble learning algorithm was also experimented. Machine learning techniques such as term-frequency – inverse document frequency (TF-IDF) and ngram were also explored to devise a model with the highest possible accuracy. Results show that in this case, tf-idf vectorization does not show significant improvement in sentiment classification. On the other hand, ngram vectorization improve performance of base models and has potential to improve ensemble models. Random forest showed higher performance measures than base models and ensemble of the three base models. However, it did not outperform ngram combined with support vector machines. In future work the model with highest accuracy found can be embedded in a sentiment analysis tool for students’ feedback on teaching performance. More advanced transformation techniques and other ensemble techniques may be explored to further improve accuracy in sentiment classification.\",\"PeriodicalId\":143710,\"journal\":{\"name\":\"European Journal of Engineering Science and Technology\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Engineering Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33422/ejest.v4i1.604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Engineering Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33422/ejest.v4i1.604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

情感分析一直是一个有趣且受欢迎的研究领域,鼓励研究人员和从业者在政府、医疗保健和教育等各个领域采用这一工具。在教育中,教学评价是情感分析服务的活动之一。虽然,教育机构定期评估教师的表现是一种常见的做法,但由于缺乏自动文本分析方法,学生的富有洞察力的评论不容易被考虑在内。在本研究中,使用了监督机器学习算法。通过实验来评估使用naïve贝叶斯、支持向量机和逻辑回归的基本模型,并与三者结合的集成模型进行比较。随机森林,一个集成学习算法也进行了实验。还探索了术语频率-逆文档频率(TF-IDF)和ngram等机器学习技术,以设计具有最高准确性的模型。结果表明,在这种情况下,tf-idf矢量化在情感分类方面没有显着改善。另一方面,图矢量化提高了基本模型的性能,并有潜力改进集成模型。随机森林比基本模型和三个基本模型的集合表现出更高的性能指标。然而,它并没有超过ngram与支持向量机的结合。在未来的工作中,发现的准确率最高的模型可以嵌入到学生对教学绩效反馈的情感分析工具中。可以探索更先进的转换技术和其他集成技术,以进一步提高情感分类的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing Sentiment Analysis of Textual Feedback in the Student-Faculty Evaluation using Machine Learning Techniques
Sentiment Analysis has been an interesting and popular research area encouraging researchers and practitioners to adopt this tool in various fields such as the government, health care and education.  In education, instruction evaluation is one of the activities that sentiment analysis has served.  Though, it is a common practice that educational institutions periodically evaluate their teachers’ performance, students’ comments which are rich in insights are not easily taken into account because of lack of automated text analytics methods. In this study, supervised machine learning algorithms were used. Experiments were conducted to evaluate base models employing naïve bayes, support vector machines and logistic regression in comparison to ensemble combining the three. Random forest, an ensemble learning algorithm was also experimented. Machine learning techniques such as term-frequency – inverse document frequency (TF-IDF) and ngram were also explored to devise a model with the highest possible accuracy. Results show that in this case, tf-idf vectorization does not show significant improvement in sentiment classification. On the other hand, ngram vectorization improve performance of base models and has potential to improve ensemble models. Random forest showed higher performance measures than base models and ensemble of the three base models. However, it did not outperform ngram combined with support vector machines. In future work the model with highest accuracy found can be embedded in a sentiment analysis tool for students’ feedback on teaching performance. More advanced transformation techniques and other ensemble techniques may be explored to further improve accuracy in sentiment classification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信