Shivangi Srivastava, John E. Vargas-Muñoz, David Swinkels, D. Tuia
{"title":"基于卷积神经网络的地面图像多标签构建函数分类","authors":"Shivangi Srivastava, John E. Vargas-Muñoz, David Swinkels, D. Tuia","doi":"10.1145/3281548.3281559","DOIUrl":null,"url":null,"abstract":"We approach the problem of multi building function classification for buildings from the city of Amsterdam using a collection of Google Street View (GSV) pictures acquired at multiple zoom levels (field of views, FoV) and the corresponding governmental census data per building. Since buildings can have multiple usages, we cast the problem as multilabel classification task. To do so, we trained a CNN model end-to-end with the task of predicting multiple co-occurring building function classes per building. We fuse the individual features of three FoVs by using volumetric stacking. Our proposed model outperforms baseline CNN models that use either single or multiple FoVs.","PeriodicalId":231184,"journal":{"name":"Proceedings of the 2nd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Multilabel Building Functions Classification from Ground Pictures using Convolutional Neural Networks\",\"authors\":\"Shivangi Srivastava, John E. Vargas-Muñoz, David Swinkels, D. Tuia\",\"doi\":\"10.1145/3281548.3281559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We approach the problem of multi building function classification for buildings from the city of Amsterdam using a collection of Google Street View (GSV) pictures acquired at multiple zoom levels (field of views, FoV) and the corresponding governmental census data per building. Since buildings can have multiple usages, we cast the problem as multilabel classification task. To do so, we trained a CNN model end-to-end with the task of predicting multiple co-occurring building function classes per building. We fuse the individual features of three FoVs by using volumetric stacking. Our proposed model outperforms baseline CNN models that use either single or multiple FoVs.\",\"PeriodicalId\":231184,\"journal\":{\"name\":\"Proceedings of the 2nd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3281548.3281559\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3281548.3281559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multilabel Building Functions Classification from Ground Pictures using Convolutional Neural Networks
We approach the problem of multi building function classification for buildings from the city of Amsterdam using a collection of Google Street View (GSV) pictures acquired at multiple zoom levels (field of views, FoV) and the corresponding governmental census data per building. Since buildings can have multiple usages, we cast the problem as multilabel classification task. To do so, we trained a CNN model end-to-end with the task of predicting multiple co-occurring building function classes per building. We fuse the individual features of three FoVs by using volumetric stacking. Our proposed model outperforms baseline CNN models that use either single or multiple FoVs.