Developing a data pipeline to improve accessibility and utilization of Charlottesville's Open Data Portal

L. Beane, Elena Gillis, Raf Alvarado, C. Wylie
{"title":"Developing a data pipeline to improve accessibility and utilization of Charlottesville's Open Data Portal","authors":"L. Beane, Elena Gillis, Raf Alvarado, C. Wylie","doi":"10.1109/SIEDS.2019.8735653","DOIUrl":null,"url":null,"abstract":"To improve democratic engagement between the people and the government, the city of Charlottesville put forward a proposition to construct an online portal that would contain data from the city departments that is considered public by nature. This move was intended to promote the ease of access to data pertinent to ongoing policy debates in the city and incentivize the public to contribute to the policy-making process with informed participation. Such efforts, while successful at their start, have gradually stagnated, and the end objective of the portal has not been reached. In this paper we identify possible reasons for this stagnation – inconsistent formatting of the datasets, variables that are not meant for human legibility, and limited data with disproportional representation from the city departments. We then propose a data pipeline that serves as a tool to extract utility from the data. It does so by converting the datasets into a consistent format, merges the datasets, and allows for creation of simple visualizations. The pipeline acts as a link between the raw data published by the government units and the city by increasing its interpretability and legibility and outputting results that are easily relatable to the policy issues at hand. We demonstrate this by analyzing datasets for crime and real estate and relating our findings to the affordable housing debate.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS.2019.8735653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

To improve democratic engagement between the people and the government, the city of Charlottesville put forward a proposition to construct an online portal that would contain data from the city departments that is considered public by nature. This move was intended to promote the ease of access to data pertinent to ongoing policy debates in the city and incentivize the public to contribute to the policy-making process with informed participation. Such efforts, while successful at their start, have gradually stagnated, and the end objective of the portal has not been reached. In this paper we identify possible reasons for this stagnation – inconsistent formatting of the datasets, variables that are not meant for human legibility, and limited data with disproportional representation from the city departments. We then propose a data pipeline that serves as a tool to extract utility from the data. It does so by converting the datasets into a consistent format, merges the datasets, and allows for creation of simple visualizations. The pipeline acts as a link between the raw data published by the government units and the city by increasing its interpretability and legibility and outputting results that are easily relatable to the policy issues at hand. We demonstrate this by analyzing datasets for crime and real estate and relating our findings to the affordable housing debate.
开发数据管道,以提高夏洛茨维尔开放数据门户的可访问性和利用率
为了提高人民与政府之间的民主参与,夏洛茨维尔市提出了建立一个在线门户网站的建议,该门户网站将包含被认为是公共性质的城市部门的数据。此举旨在促进城市中正在进行的政策辩论相关数据的获取,并激励公众在知情参与的情况下为政策制定过程做出贡献。这种努力虽然在开始时取得了成功,但已逐渐停滞不前,门户的最终目标尚未实现。在本文中,我们确定了这种停滞的可能原因-数据集格式不一致,变量不适合人类易读性,以及来自城市部门的不成比例代表性的有限数据。然后,我们提出了一个数据管道,作为从数据中提取实用程序的工具。它通过将数据集转换为一致的格式、合并数据集并允许创建简单的可视化来实现这一点。该管道作为政府单位和城市发布的原始数据之间的联系,增加了其可解释性和可读性,并输出了与手头的政策问题容易相关的结果。我们通过分析犯罪和房地产的数据集,并将我们的发现与经济适用房的辩论联系起来,来证明这一点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信