A hybrid machine learning model for sentiment analysis and satisfaction assessment with Turkish universities using Twitter data

Abdulfattah Ba Alawi, Ferhat Bozkurt
{"title":"A hybrid machine learning model for sentiment analysis and satisfaction assessment with Turkish universities using Twitter data","authors":"Abdulfattah Ba Alawi,&nbsp;Ferhat Bozkurt","doi":"10.1016/j.dajour.2024.100473","DOIUrl":null,"url":null,"abstract":"<div><p>Social media platforms, such as X (Twitter), contain enormous amounts of data, and collecting valuable information from that data assists in making more informed decisions. In recent years, governments and institutions have begun to explore the possibilities of utilizing social networks as a platform to supply and enhance the quality of their services. Consequently, there is an increased demand to estimate people’s satisfaction with Turkish Universities written in Turkish wherever it is challenging and requires more effort to cope with its rich morphological structure to perform a Sentiment Analysis task. This study proposes a Turkish text sentiment analysis methodology for estimating people’s satisfaction with Turkish Universities by employing nine conventional machine learning models, deep learning techniques, and BERT-based transformers upon an original manually annotated dataset consisting of 17,793 tweets in Turkish. An innovative hybrid architecture, named BERT-BiLSTM-CNN, integrates Bidirectional Encoder Representations from Transformers (BERT), Bidirectional Long Short-Term Memory (BiLSTM), and a triple parallel Convolutional Neural Network (CNN) branch, achieving exceptional accuracy in sentiment analysis of Turkish tweets. During testing, the proposed architecture revealed an impressive accuracy rate of over 0.9101, an F1 Score of 0.8801, and a Receiver Operating Characteristic (ROC) of 0.9632 for analyzing sentiment, demonstrating that the model outperformed the state-of-the-art models, and it is capable of coping with the linguistic complexities of Turkish sentiment analysis. This work provides new insights into sentiment analysis by proposing a hybrid model that combines several computational methodologies to improve the understanding of attitudes.</p></div>","PeriodicalId":100357,"journal":{"name":"Decision Analytics Journal","volume":"11 ","pages":"Article 100473"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772662224000778/pdfft?md5=1a3bb3730d164f65bda14895f575f5c7&pid=1-s2.0-S2772662224000778-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Analytics Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772662224000778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Social media platforms, such as X (Twitter), contain enormous amounts of data, and collecting valuable information from that data assists in making more informed decisions. In recent years, governments and institutions have begun to explore the possibilities of utilizing social networks as a platform to supply and enhance the quality of their services. Consequently, there is an increased demand to estimate people’s satisfaction with Turkish Universities written in Turkish wherever it is challenging and requires more effort to cope with its rich morphological structure to perform a Sentiment Analysis task. This study proposes a Turkish text sentiment analysis methodology for estimating people’s satisfaction with Turkish Universities by employing nine conventional machine learning models, deep learning techniques, and BERT-based transformers upon an original manually annotated dataset consisting of 17,793 tweets in Turkish. An innovative hybrid architecture, named BERT-BiLSTM-CNN, integrates Bidirectional Encoder Representations from Transformers (BERT), Bidirectional Long Short-Term Memory (BiLSTM), and a triple parallel Convolutional Neural Network (CNN) branch, achieving exceptional accuracy in sentiment analysis of Turkish tweets. During testing, the proposed architecture revealed an impressive accuracy rate of over 0.9101, an F1 Score of 0.8801, and a Receiver Operating Characteristic (ROC) of 0.9632 for analyzing sentiment, demonstrating that the model outperformed the state-of-the-art models, and it is capable of coping with the linguistic complexities of Turkish sentiment analysis. This work provides new insights into sentiment analysis by proposing a hybrid model that combines several computational methodologies to improve the understanding of attitudes.

利用 Twitter 数据对土耳其大学进行情感分析和满意度评估的混合机器学习模型
X (Twitter) 等社交媒体平台包含大量数据,从这些数据中收集有价值的信息有助于做出更明智的决策。近年来,政府和机构已开始探索利用社交网络作为提供和提高服务质量的平台的可能性。因此,人们对用土耳其语撰写的土耳其大学满意度的评估需求日益增加,因为用土耳其语撰写大学具有挑战性,需要花费更多精力来应对其丰富的形态结构,以执行情感分析任务。本研究提出了一种土耳其语文本情感分析方法,通过采用九种传统机器学习模型、深度学习技术和基于 BERT 的转换器,对由 17,793 条土耳其语推文组成的原始人工标注数据集进行分析,以估算人们对土耳其大学的满意度。一种名为BERT-BiLSTM-CNN的创新型混合架构集成了变压器双向编码器表示(BERT)、双向长短期记忆(BiLSTM)和三重并行卷积神经网络(CNN)分支,在土耳其语推文的情感分析中取得了卓越的准确性。在测试过程中,所提出的架构在情感分析方面的准确率超过了 0.9101,F1 得分为 0.8801,ROC 为 0.9632。这项研究提出了一种结合多种计算方法的混合模型,为情感分析提供了新的见解,从而提高了对态度的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.90
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信