{"title":"Street View Challenge: Identification of Commercial Entities in Street View Imagery","authors":"A. Zamir, A. Darino, M. Shah","doi":"10.1109/ICMLA.2011.181","DOIUrl":null,"url":null,"abstract":"This paper presents our submission to the Street View Challenge of identifying commercial entities in street view imagery. The provided data set of the challenge consists of approximately 129K street view images tagged with GPScoordinates. The problem is to identify different types of businesses visible in these images. Our solution is based on utilizing the textual information. However, the textual content of street view images is challenging in terms of variety and complexity, which limits the success of the approaches that are purely based on processing the content. Therefore, we use a method which leverages both the textual content of the images and business listings, in order to accomplish the identification task successfully. The robustness of our method is due to the fact that the information obtained from the different resources is cross-validated leading to significant improvements compared to the baselines. The experiments show approximately 70% of success rate on the defined problem.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 10th International Conference on Machine Learning and Applications and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2011.181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 27
Abstract
This paper presents our submission to the Street View Challenge of identifying commercial entities in street view imagery. The provided data set of the challenge consists of approximately 129K street view images tagged with GPScoordinates. The problem is to identify different types of businesses visible in these images. Our solution is based on utilizing the textual information. However, the textual content of street view images is challenging in terms of variety and complexity, which limits the success of the approaches that are purely based on processing the content. Therefore, we use a method which leverages both the textual content of the images and business listings, in order to accomplish the identification task successfully. The robustness of our method is due to the fact that the information obtained from the different resources is cross-validated leading to significant improvements compared to the baselines. The experiments show approximately 70% of success rate on the defined problem.