CandiDATA: an enhanced dataset for data analysis of elections in Brazil from 1945 to 2020

Felipe F. Vasconcelos, João V. S. Tavares, Matheus G. S. Oliveira, Fábio Coutinho, João Paulo Clarindo
{"title":"CandiDATA: an enhanced dataset for data analysis of elections in Brazil from 1945 to 2020","authors":"Felipe F. Vasconcelos, João V. S. Tavares, Matheus G. S. Oliveira, Fábio Coutinho, João Paulo Clarindo","doi":"10.5753/jidm.2022.2361","DOIUrl":null,"url":null,"abstract":"The Brazilian Superior Electoral Court (TSE) keeps data on elections that have taken place in Brazil since 1933. These data constitute an important collection serving as a reference for works in several research areas. However, this collection is not fully exploited due to some problems, such as missing and non-standard data, making analysis and integration with external databases difficult. Previous works built limited datasets and tools because of these problems as they only include data since the 1998 election, disregarding the election years from 1945 and 1996. This work discusses the steps to create CandiDATA – a standardized and enhanced dataset from TSE data, including a toolkit of webscrapping and data visualization. CandiDATA is available in open format and covers the election period between 1945 and 2020.","PeriodicalId":301338,"journal":{"name":"J. Inf. Data Manag.","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Inf. Data Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/jidm.2022.2361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The Brazilian Superior Electoral Court (TSE) keeps data on elections that have taken place in Brazil since 1933. These data constitute an important collection serving as a reference for works in several research areas. However, this collection is not fully exploited due to some problems, such as missing and non-standard data, making analysis and integration with external databases difficult. Previous works built limited datasets and tools because of these problems as they only include data since the 1998 election, disregarding the election years from 1945 and 1996. This work discusses the steps to create CandiDATA – a standardized and enhanced dataset from TSE data, including a toolkit of webscrapping and data visualization. CandiDATA is available in open format and covers the election period between 1945 and 2020.
CandiDATA:一个增强的数据集,用于分析1945年至2020年巴西选举的数据
巴西高级选举法院(TSE)保存着自1933年以来巴西举行的选举数据。这些数据构成了一个重要的集合,为几个研究领域的工作提供了参考。然而,由于一些问题,例如缺少和非标准的数据,使得分析和与外部数据库的集成变得困难,因此没有充分利用这个集合。由于这些问题,以前的工作建立了有限的数据集和工具,因为它们只包括1998年选举以来的数据,而忽略了1945年和1996年的选举年。本文讨论了创建CandiDATA的步骤,这是一个基于TSE数据的标准化和增强数据集,包括一个web抓取和数据可视化工具包。CandiDATA以开放格式提供,涵盖1945年至2020年的选举期间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信