理解在线公众情绪:基于机器学习的2019年中国国庆期间英汉推特话语分析

Yekai Xu, Qingqian He, S. Ni
{"title":"理解在线公众情绪:基于机器学习的2019年中国国庆期间英汉推特话语分析","authors":"Yekai Xu, Qingqian He, S. Ni","doi":"10.1109/imitec50163.2020.9334093","DOIUrl":null,"url":null,"abstract":"As the Internet gradually penetrates people's daily lives, individual citizens are empowered to demonstrate and exchange opinions and sentiments at any time anywhere. Online communities are increasingly participating in the agenda-setting of public affairs and official policies. However, how to depict online public opinion and to what degree does it influence the real world are still unclear. This study addresses the above problems by analyzing Twitter discourse during the 2019 Chinese National Day with a machine learning-based approach. Over 300,000 English and Chinese tweets were collected between Sept 30 and Oct 3, and a hybrid method of support vector machine (SVM) and dictionary was applied to evaluate the sentiments of the collected tweets. This method avoids complex structures while yielding an average accuracy of over 96% in most classifiers used in the study. The results indicate alignment between the time of National Day celebration activities and the peak of sentiments revealed in both English and Chinese tweets, although the sentiments of the two languages tend to be in opposite directions. The sentiments of tweets also diverge from nation to nation, but are generally consistent with the country's official relations with China. The linguistic features of the tweets suggest different concerns for Twitter users who have different sentiments towards China. Future studies should prolong the collecting period and refine the algorithms used. Specific attention can also be paid to important countries pairs like China and the United States.","PeriodicalId":349926,"journal":{"name":"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Understanding Online Public Sentiments: A Machine Learning-Based Analysis of English and Chinese Twitter Discourse during the 2019 Chinese National Day\",\"authors\":\"Yekai Xu, Qingqian He, S. Ni\",\"doi\":\"10.1109/imitec50163.2020.9334093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the Internet gradually penetrates people's daily lives, individual citizens are empowered to demonstrate and exchange opinions and sentiments at any time anywhere. Online communities are increasingly participating in the agenda-setting of public affairs and official policies. However, how to depict online public opinion and to what degree does it influence the real world are still unclear. This study addresses the above problems by analyzing Twitter discourse during the 2019 Chinese National Day with a machine learning-based approach. Over 300,000 English and Chinese tweets were collected between Sept 30 and Oct 3, and a hybrid method of support vector machine (SVM) and dictionary was applied to evaluate the sentiments of the collected tweets. This method avoids complex structures while yielding an average accuracy of over 96% in most classifiers used in the study. The results indicate alignment between the time of National Day celebration activities and the peak of sentiments revealed in both English and Chinese tweets, although the sentiments of the two languages tend to be in opposite directions. The sentiments of tweets also diverge from nation to nation, but are generally consistent with the country's official relations with China. The linguistic features of the tweets suggest different concerns for Twitter users who have different sentiments towards China. Future studies should prolong the collecting period and refine the algorithms used. Specific attention can also be paid to important countries pairs like China and the United States.\",\"PeriodicalId\":349926,\"journal\":{\"name\":\"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/imitec50163.2020.9334093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/imitec50163.2020.9334093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

随着互联网逐渐渗透到人们的日常生活中,公民个人可以随时随地表达和交流意见和情绪。网络社区越来越多地参与公共事务和官方政策的议程设置。然而,如何描绘网络民意以及它对现实世界的影响程度仍然不清楚。本研究通过基于机器学习的方法分析2019年中国国庆节期间的推特话语,解决了上述问题。在9月30日至10月3日期间收集了30多万条中英文推文,并采用支持向量机(SVM)和字典的混合方法对收集到的推文的情感进行评估。该方法避免了复杂的结构,同时在研究中使用的大多数分类器中产生超过96%的平均准确率。结果表明,国庆庆祝活动的时间与英语和中文推特上显示的情绪高峰是一致的,尽管两种语言的情绪倾向于相反的方向。推特上的情绪也因国而异,但总体上与该国与中国的官方关系一致。这些推文的语言特征表明,对中国有不同看法的推特用户关注的问题不同。未来的研究应延长收集周期并改进所使用的算法。还可以特别关注像中国和美国这样的重要国家。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Understanding Online Public Sentiments: A Machine Learning-Based Analysis of English and Chinese Twitter Discourse during the 2019 Chinese National Day
As the Internet gradually penetrates people's daily lives, individual citizens are empowered to demonstrate and exchange opinions and sentiments at any time anywhere. Online communities are increasingly participating in the agenda-setting of public affairs and official policies. However, how to depict online public opinion and to what degree does it influence the real world are still unclear. This study addresses the above problems by analyzing Twitter discourse during the 2019 Chinese National Day with a machine learning-based approach. Over 300,000 English and Chinese tweets were collected between Sept 30 and Oct 3, and a hybrid method of support vector machine (SVM) and dictionary was applied to evaluate the sentiments of the collected tweets. This method avoids complex structures while yielding an average accuracy of over 96% in most classifiers used in the study. The results indicate alignment between the time of National Day celebration activities and the peak of sentiments revealed in both English and Chinese tweets, although the sentiments of the two languages tend to be in opposite directions. The sentiments of tweets also diverge from nation to nation, but are generally consistent with the country's official relations with China. The linguistic features of the tweets suggest different concerns for Twitter users who have different sentiments towards China. Future studies should prolong the collecting period and refine the algorithms used. Specific attention can also be paid to important countries pairs like China and the United States.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信