根据推特数据预测法国总统选举结果

Taras Rudnyk, O. Chertov
{"title":"根据推特数据预测法国总统选举结果","authors":"Taras Rudnyk, O. Chertov","doi":"10.31891/csit-2022-4-4","DOIUrl":null,"url":null,"abstract":"This paper presents the study to collect, store and analyze data from Twitter to forecast French presidential election results, compared to sociological polls. The first and probably the most important step of the research is to collect, store and clean data, the whole result depends on the amount and quality of data. In the next step of research, datasets are analyzed. Lastly, complete report and visualizations are provided. In the study, we propose modern technics, mathematical algorithms, and machine learning approaches to analyze big amounts of data from the Twitter social network in order to forecast the 2022 French presidential election results. The determined outcome is compared with sociological polls and the real results of elections. \nIn the conducted research modern types of media are compared to select the best one for election prediction. Selected Twitter social network as the one with the most appropriate data and availability to download big amounts of useful information. The approach based on the usage of Python programming language, Selenium browser emulation and MongoDB database was used to collect, store and clean data about the main French election candidates – Emmanuel Macron and Marine Le Pen. The research was made from August 2021 until the election itself in April 2022. The determined outcome is compared with sociological polls and the results of elections and showed that analysis of social network data could be a good alternative to traditional sociological polls as it shows the same trends month by month and well predicted the win of Emmanuel Macron in elections. Moreover, the proposed approach has its benefits compared to sociological polls such as always being fresh, and close to real-time information, the price of research is much lower and could be reused for the next parliamentary or presidential elections with a small modification. \nThe research could be extended and adapted for other countries. Currently, the proposed algorithms and mathematical models showed good results in the French and Ukraine elections. It works well with English, French, Ukrainian and Russian languages. This allows us to claim that it will also work fine with other Latin or Cyrillic alphabets but for Asian or Arabic languages more research would be needed. Twitter is a good choice for European and American countries. In the future, other social networks should be considered for the countries in which it is not so popular.","PeriodicalId":353631,"journal":{"name":"Computer systems and information technologies","volume":"29 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FORECASTING THE RESULTS OF THE PRESIDENTIAL ELECTIONS IN FRANCE BASED ON TWITTER DATA\",\"authors\":\"Taras Rudnyk, O. Chertov\",\"doi\":\"10.31891/csit-2022-4-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the study to collect, store and analyze data from Twitter to forecast French presidential election results, compared to sociological polls. The first and probably the most important step of the research is to collect, store and clean data, the whole result depends on the amount and quality of data. In the next step of research, datasets are analyzed. Lastly, complete report and visualizations are provided. In the study, we propose modern technics, mathematical algorithms, and machine learning approaches to analyze big amounts of data from the Twitter social network in order to forecast the 2022 French presidential election results. The determined outcome is compared with sociological polls and the real results of elections. \\nIn the conducted research modern types of media are compared to select the best one for election prediction. Selected Twitter social network as the one with the most appropriate data and availability to download big amounts of useful information. The approach based on the usage of Python programming language, Selenium browser emulation and MongoDB database was used to collect, store and clean data about the main French election candidates – Emmanuel Macron and Marine Le Pen. The research was made from August 2021 until the election itself in April 2022. The determined outcome is compared with sociological polls and the results of elections and showed that analysis of social network data could be a good alternative to traditional sociological polls as it shows the same trends month by month and well predicted the win of Emmanuel Macron in elections. Moreover, the proposed approach has its benefits compared to sociological polls such as always being fresh, and close to real-time information, the price of research is much lower and could be reused for the next parliamentary or presidential elections with a small modification. \\nThe research could be extended and adapted for other countries. Currently, the proposed algorithms and mathematical models showed good results in the French and Ukraine elections. It works well with English, French, Ukrainian and Russian languages. This allows us to claim that it will also work fine with other Latin or Cyrillic alphabets but for Asian or Arabic languages more research would be needed. Twitter is a good choice for European and American countries. In the future, other social networks should be considered for the countries in which it is not so popular.\",\"PeriodicalId\":353631,\"journal\":{\"name\":\"Computer systems and information technologies\",\"volume\":\"29 5\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer systems and information technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31891/csit-2022-4-4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer systems and information technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31891/csit-2022-4-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出的研究收集,存储和分析数据从Twitter预测法国总统选举结果,与社会学民意调查。研究的第一步,可能也是最重要的一步是收集、存储和清理数据,整个结果取决于数据的数量和质量。在下一步的研究中,对数据集进行分析。最后,提供了完整的报告和可视化。在这项研究中,我们提出了现代技术、数学算法和机器学习方法来分析来自Twitter社交网络的大量数据,以预测2022年法国总统选举的结果。将确定的结果与社会学民意调查和实际选举结果进行比较。在进行的研究中,比较了现代媒体类型,以选择最佳的选举预测媒介。选择Twitter社交网络作为最合适的数据和可用性下载大量有用的信息。该方法基于Python编程语言、Selenium浏览器仿真和MongoDB数据库的使用,用于收集、存储和清理法国主要选举候选人埃马努尔·马克龙和马琳·勒庞的数据。这项研究从2021年8月开始,一直进行到2022年4月大选。将确定的结果与社会学民意调查和选举结果进行比较,表明对社交网络数据的分析可能是传统社会学民意调查的一个很好的替代方案,因为它每月显示相同的趋势,并很好地预测了埃马纽埃尔·马克龙在选举中的胜利。此外,与社会学民意调查相比,该方法有其优点,如始终保持新鲜,接近实时信息,研究成本低得多,并且可以在稍加修改后重新用于下届议会或总统选举。这项研究可以扩展并适用于其他国家。目前,所提出的算法和数学模型在法国和乌克兰选举中显示出良好的效果。它适用于英语、法语、乌克兰语和俄语。这让我们可以声称,它也可以很好地工作与其他拉丁或西里尔字母,但对于亚洲或阿拉伯语言将需要更多的研究。对于欧美国家来说,Twitter是一个不错的选择。在未来,应该考虑其他的社交网络,在那些不太受欢迎的国家。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
FORECASTING THE RESULTS OF THE PRESIDENTIAL ELECTIONS IN FRANCE BASED ON TWITTER DATA
This paper presents the study to collect, store and analyze data from Twitter to forecast French presidential election results, compared to sociological polls. The first and probably the most important step of the research is to collect, store and clean data, the whole result depends on the amount and quality of data. In the next step of research, datasets are analyzed. Lastly, complete report and visualizations are provided. In the study, we propose modern technics, mathematical algorithms, and machine learning approaches to analyze big amounts of data from the Twitter social network in order to forecast the 2022 French presidential election results. The determined outcome is compared with sociological polls and the real results of elections. In the conducted research modern types of media are compared to select the best one for election prediction. Selected Twitter social network as the one with the most appropriate data and availability to download big amounts of useful information. The approach based on the usage of Python programming language, Selenium browser emulation and MongoDB database was used to collect, store and clean data about the main French election candidates – Emmanuel Macron and Marine Le Pen. The research was made from August 2021 until the election itself in April 2022. The determined outcome is compared with sociological polls and the results of elections and showed that analysis of social network data could be a good alternative to traditional sociological polls as it shows the same trends month by month and well predicted the win of Emmanuel Macron in elections. Moreover, the proposed approach has its benefits compared to sociological polls such as always being fresh, and close to real-time information, the price of research is much lower and could be reused for the next parliamentary or presidential elections with a small modification. The research could be extended and adapted for other countries. Currently, the proposed algorithms and mathematical models showed good results in the French and Ukraine elections. It works well with English, French, Ukrainian and Russian languages. This allows us to claim that it will also work fine with other Latin or Cyrillic alphabets but for Asian or Arabic languages more research would be needed. Twitter is a good choice for European and American countries. In the future, other social networks should be considered for the countries in which it is not so popular.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信