{"title":"远距离目标检测的数据驱动街景布局估计","authors":"Donghao Zhang, Xuming He, Hanxi Li","doi":"10.1109/DICTA.2014.7008099","DOIUrl":null,"url":null,"abstract":"We present a street scene layout estimation method based on transferring layout annotation from a (large) image database and its application for distant object detection. Inspired by nonparametric scene labeling approaches, we estimate a scene's geometric layout by matching global image descriptors and retrieving the most similar layout configuration. Our label transfer is done for each sub-region of an image and a tiered scene model is used to integrate all the local label information into a coherent scene layout prediction. Given the geometric layout, we use a super-resolution method to zoom in the distance region and refine the search in object detection. On KITTI dataset, we show that we can reliably generate scene layout and improve the detection of distant cars over the state of the art DPM detector.","PeriodicalId":146695,"journal":{"name":"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Data-Driven Street Scene Layout Estimation for Distant Object Detection\",\"authors\":\"Donghao Zhang, Xuming He, Hanxi Li\",\"doi\":\"10.1109/DICTA.2014.7008099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a street scene layout estimation method based on transferring layout annotation from a (large) image database and its application for distant object detection. Inspired by nonparametric scene labeling approaches, we estimate a scene's geometric layout by matching global image descriptors and retrieving the most similar layout configuration. Our label transfer is done for each sub-region of an image and a tiered scene model is used to integrate all the local label information into a coherent scene layout prediction. Given the geometric layout, we use a super-resolution method to zoom in the distance region and refine the search in object detection. On KITTI dataset, we show that we can reliably generate scene layout and improve the detection of distant cars over the state of the art DPM detector.\",\"PeriodicalId\":146695,\"journal\":{\"name\":\"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2014.7008099\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2014.7008099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data-Driven Street Scene Layout Estimation for Distant Object Detection
We present a street scene layout estimation method based on transferring layout annotation from a (large) image database and its application for distant object detection. Inspired by nonparametric scene labeling approaches, we estimate a scene's geometric layout by matching global image descriptors and retrieving the most similar layout configuration. Our label transfer is done for each sub-region of an image and a tiered scene model is used to integrate all the local label information into a coherent scene layout prediction. Given the geometric layout, we use a super-resolution method to zoom in the distance region and refine the search in object detection. On KITTI dataset, we show that we can reliably generate scene layout and improve the detection of distant cars over the state of the art DPM detector.