Craig A. Knoblock, Aparna R. Joshi, Abhishek Megotia, Minh Pham, Chelsea Ursaner
{"title":"自动时空索引整合和分析组织数据","authors":"Craig A. Knoblock, Aparna R. Joshi, Abhishek Megotia, Minh Pham, Chelsea Ursaner","doi":"10.1145/3152178.3152185","DOIUrl":null,"url":null,"abstract":"Organizations are awash in data. In many cases, they do not know what data exists within the organization and much information is not available when needed, or worse, information gets recreated from other sources. In this paper, we present an automatic approach to spatio-temporal indexing of the datasets within an organization. The indexing process automatically identifies the spatial and temporal fields, normalizes and cleans those fields, and then loads them into a big data store where the information can be efficiently searched, queried, and analyzed. We evaluated our approach on 600 datasets published by the City of Los Angeles and show that we can automatically process their data and can efficiently access and analyze the indexed data.","PeriodicalId":378940,"journal":{"name":"Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Organization\",\"authors\":\"Craig A. Knoblock, Aparna R. Joshi, Abhishek Megotia, Minh Pham, Chelsea Ursaner\",\"doi\":\"10.1145/3152178.3152185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Organizations are awash in data. In many cases, they do not know what data exists within the organization and much information is not available when needed, or worse, information gets recreated from other sources. In this paper, we present an automatic approach to spatio-temporal indexing of the datasets within an organization. The indexing process automatically identifies the spatial and temporal fields, normalizes and cleans those fields, and then loads them into a big data store where the information can be efficiently searched, queried, and analyzed. We evaluated our approach on 600 datasets published by the City of Los Angeles and show that we can automatically process their data and can efficiently access and analyze the indexed data.\",\"PeriodicalId\":378940,\"journal\":{\"name\":\"Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3152178.3152185\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3152178.3152185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Organization
Organizations are awash in data. In many cases, they do not know what data exists within the organization and much information is not available when needed, or worse, information gets recreated from other sources. In this paper, we present an automatic approach to spatio-temporal indexing of the datasets within an organization. The indexing process automatically identifies the spatial and temporal fields, normalizes and cleans those fields, and then loads them into a big data store where the information can be efficiently searched, queried, and analyzed. We evaluated our approach on 600 datasets published by the City of Los Angeles and show that we can automatically process their data and can efficiently access and analyze the indexed data.