{"title":"在openstreetmap中寻找等效键:基于扩展定义的语义相似度计算","authors":"I. Majić, S. Winter, Martin Tomko","doi":"10.1145/3149808.3149813","DOIUrl":null,"url":null,"abstract":"Volunteered Geographic Information (VGI) projects, such as Open-StreetMap (OSM) enable the public to contribute to the collection of spatial data. In OSM, users may deviate from spatial feature annotation guidelines and create new tags (i.e. key=value pairs), even if recommended tags exist. This is problematic, as undocumented tags have no set meaning, and they potentially contribute to the dataset heterogeneity and thus reduce usability. This paper proposes an unsupervised approach to identify equivalent documented attribute keys to the used undocumented keys. Based on their extensional definitions through their values, co-occurring keys and geometries of the features they annotate, the semantic similarity of OSM keys is evaluated. The approach has been tested on the OSM dataset for the state of Victoria, Australia. Results have been evaluated against a set of manually detected equivalent keys and show that the method is plausible, but may fail if some assumptions about tag use are not enforced, e.g., semantically unique tags.","PeriodicalId":158183,"journal":{"name":"Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Finding equivalent keys in openstreetmap: semantic similarity computation based on extensional definitions\",\"authors\":\"I. Majić, S. Winter, Martin Tomko\",\"doi\":\"10.1145/3149808.3149813\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Volunteered Geographic Information (VGI) projects, such as Open-StreetMap (OSM) enable the public to contribute to the collection of spatial data. In OSM, users may deviate from spatial feature annotation guidelines and create new tags (i.e. key=value pairs), even if recommended tags exist. This is problematic, as undocumented tags have no set meaning, and they potentially contribute to the dataset heterogeneity and thus reduce usability. This paper proposes an unsupervised approach to identify equivalent documented attribute keys to the used undocumented keys. Based on their extensional definitions through their values, co-occurring keys and geometries of the features they annotate, the semantic similarity of OSM keys is evaluated. The approach has been tested on the OSM dataset for the state of Victoria, Australia. Results have been evaluated against a set of manually detected equivalent keys and show that the method is plausible, but may fail if some assumptions about tag use are not enforced, e.g., semantically unique tags.\",\"PeriodicalId\":158183,\"journal\":{\"name\":\"Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery\",\"volume\":\"2016 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3149808.3149813\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3149808.3149813","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Finding equivalent keys in openstreetmap: semantic similarity computation based on extensional definitions
Volunteered Geographic Information (VGI) projects, such as Open-StreetMap (OSM) enable the public to contribute to the collection of spatial data. In OSM, users may deviate from spatial feature annotation guidelines and create new tags (i.e. key=value pairs), even if recommended tags exist. This is problematic, as undocumented tags have no set meaning, and they potentially contribute to the dataset heterogeneity and thus reduce usability. This paper proposes an unsupervised approach to identify equivalent documented attribute keys to the used undocumented keys. Based on their extensional definitions through their values, co-occurring keys and geometries of the features they annotate, the semantic similarity of OSM keys is evaluated. The approach has been tested on the OSM dataset for the state of Victoria, Australia. Results have been evaluated against a set of manually detected equivalent keys and show that the method is plausible, but may fail if some assumptions about tag use are not enforced, e.g., semantically unique tags.