使用深度学习和机器学习方法分析社交媒体用户观点：航空公司案例研究

Turkish Journal of Mathematics and Computer Science Pub Date : 2023-10-31 DOI:10.47000/tjmcs.1368430

Ömer Ayberk Şencan, I. Atacak

{"title":"使用深度学习和机器学习方法分析社交媒体用户观点：航空公司案例研究","authors":"Ömer Ayberk Şencan, I. Atacak","doi":"10.47000/tjmcs.1368430","DOIUrl":null,"url":null,"abstract":"ABsTRACT. The rapid surge in social media usage has augmented the significance and value of data available on these platforms. As a result, analyzing community sentiment and opinions related to various topics and events using social media data has become increasingly crucial. However, the sheer volume of data produced on social media platforms surpasses human processing capabilities. Consequently, artificial intelligence-based models became frequently employed in social media analysis. In this study, deep learning (DL) and machine learning (ML) methods are applied to assess user opinions regarding airlines, and the effectiveness of these methods in social media analysis is comparatively discussed based on the performance results obtained. Due to the imbalanced nature of the dataset, synthetic data is produced using the Synthetic Minority Over-Sampling Technique (SMOTE) to enhance model performance. Before the SMOTE process, the dataset containing 14640 data points expanded to 27534 data points after the SMOTE process. The experimental results demonstrate that Support Vector Machines (SVM) achieved the highest performance among all methods with accuracy, precision, recall, and F-score values of 0.79 in the pre-SMOTE (imbalanced dataset). In contrast, Random Forest (RF) obtained the best performance among all methods, with accuracy, precision, recall, and F-score values of 0.88 in the post-SMOTE (balanced data set). Moreover, experimental findings demonstrate that SMOTE led to performance improvements in ML and DL models, ranging from a minimum of 3% to a maximum of 24% increase in F-Score metric.","PeriodicalId":506513,"journal":{"name":"Turkish Journal of Mathematics and Computer Science","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Social Media User Opinion Analysis Using Deep Learning and Machine Learning Methods: A Case Study on Airlines\",\"authors\":\"Ömer Ayberk Şencan, I. Atacak\",\"doi\":\"10.47000/tjmcs.1368430\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABsTRACT. The rapid surge in social media usage has augmented the significance and value of data available on these platforms. As a result, analyzing community sentiment and opinions related to various topics and events using social media data has become increasingly crucial. However, the sheer volume of data produced on social media platforms surpasses human processing capabilities. Consequently, artificial intelligence-based models became frequently employed in social media analysis. In this study, deep learning (DL) and machine learning (ML) methods are applied to assess user opinions regarding airlines, and the effectiveness of these methods in social media analysis is comparatively discussed based on the performance results obtained. Due to the imbalanced nature of the dataset, synthetic data is produced using the Synthetic Minority Over-Sampling Technique (SMOTE) to enhance model performance. Before the SMOTE process, the dataset containing 14640 data points expanded to 27534 data points after the SMOTE process. The experimental results demonstrate that Support Vector Machines (SVM) achieved the highest performance among all methods with accuracy, precision, recall, and F-score values of 0.79 in the pre-SMOTE (imbalanced dataset). In contrast, Random Forest (RF) obtained the best performance among all methods, with accuracy, precision, recall, and F-score values of 0.88 in the post-SMOTE (balanced data set). Moreover, experimental findings demonstrate that SMOTE led to performance improvements in ML and DL models, ranging from a minimum of 3% to a maximum of 24% increase in F-Score metric.\",\"PeriodicalId\":506513,\"journal\":{\"name\":\"Turkish Journal of Mathematics and Computer Science\",\"volume\":\"31 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Turkish Journal of Mathematics and Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47000/tjmcs.1368430\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Mathematics and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47000/tjmcs.1368430","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

摘要。社交媒体使用量的快速激增增强了这些平台上可用数据的重要性和价值。因此，利用社交媒体数据分析与各种话题和事件相关的社区情绪和观点变得越来越重要。然而，社交媒体平台上产生的大量数据超出了人类的处理能力。因此，基于人工智能的模型经常被用于社交媒体分析。本研究将深度学习（DL）和机器学习（ML）方法应用于评估用户对航空公司的意见，并根据所获得的性能结果比较讨论了这些方法在社交媒体分析中的有效性。由于数据集的不平衡性，我们使用合成少数群体过度采样技术（SMOTE）生成合成数据，以提高模型性能。在进行 SMOTE 处理之前，数据集包含 14640 个数据点，经过 SMOTE 处理后，数据集扩大到 27534 个数据点。实验结果表明，在所有方法中，支持向量机（SVM）在 SMOTE 前（不平衡数据集）的准确度、精确度、召回率和 F 分数值均为 0.79，取得了最高的性能。相比之下，随机森林（RF）在所有方法中表现最佳，在后 SMOTE（平衡数据集）中的准确度、精确度、召回率和 F 分数均为 0.88。此外，实验结果表明，SMOTE 提高了 ML 和 DL 模型的性能，F-Score 指标的提高幅度最小为 3%，最大为 24%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Social Media User Opinion Analysis Using Deep Learning and Machine Learning Methods: A Case Study on Airlines

ABsTRACT. The rapid surge in social media usage has augmented the significance and value of data available on these platforms. As a result, analyzing community sentiment and opinions related to various topics and events using social media data has become increasingly crucial. However, the sheer volume of data produced on social media platforms surpasses human processing capabilities. Consequently, artificial intelligence-based models became frequently employed in social media analysis. In this study, deep learning (DL) and machine learning (ML) methods are applied to assess user opinions regarding airlines, and the effectiveness of these methods in social media analysis is comparatively discussed based on the performance results obtained. Due to the imbalanced nature of the dataset, synthetic data is produced using the Synthetic Minority Over-Sampling Technique (SMOTE) to enhance model performance. Before the SMOTE process, the dataset containing 14640 data points expanded to 27534 data points after the SMOTE process. The experimental results demonstrate that Support Vector Machines (SVM) achieved the highest performance among all methods with accuracy, precision, recall, and F-score values of 0.79 in the pre-SMOTE (imbalanced dataset). In contrast, Random Forest (RF) obtained the best performance among all methods, with accuracy, precision, recall, and F-score values of 0.88 in the post-SMOTE (balanced data set). Moreover, experimental findings demonstrate that SMOTE led to performance improvements in ML and DL models, ranging from a minimum of 3% to a maximum of 24% increase in F-Score metric.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Turkish Journal of Mathematics and Computer Science

自引率

0.00%

发文量