{"title":"A Data Lake and Analytics Platform with Application to COVID-19 Dynamic Analysis","authors":"F. Pereira, J. G. F. S. Costa, L. M. Gonçalves","doi":"10.5753/sibgrapi.est.2022.23283","DOIUrl":null,"url":null,"abstract":"We propose a platform consisting of a data lake that has been implemented as a web-based service, to specifically solve the Covid-19 data production and processing problem. The main idea is that it can be used by data scientists working on COVID-19-related projects in order to access as much data as possible in one repository and be able not only to analyze that data but also to manage and contribute to new data. Through this platform, it has been possible to dynamically aggregate different data repositories related to the COVID-19 pandemic, in order to provide users, through a web interface, tools for use, transformations, and collaboration of data, as well as analysis and visualization tools integrated to geographic information systems.","PeriodicalId":182158,"journal":{"name":"Anais Estendidos do XXXV Conference on Graphics, Patterns and Images (SIBGRAPI Estendido 2022)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anais Estendidos do XXXV Conference on Graphics, Patterns and Images (SIBGRAPI Estendido 2022)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/sibgrapi.est.2022.23283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a platform consisting of a data lake that has been implemented as a web-based service, to specifically solve the Covid-19 data production and processing problem. The main idea is that it can be used by data scientists working on COVID-19-related projects in order to access as much data as possible in one repository and be able not only to analyze that data but also to manage and contribute to new data. Through this platform, it has been possible to dynamically aggregate different data repositories related to the COVID-19 pandemic, in order to provide users, through a web interface, tools for use, transformations, and collaboration of data, as well as analysis and visualization tools integrated to geographic information systems.