{"title":"面向高分辨率遥感图像目标检测的中层视觉元素挖掘","authors":"Xinle Liu, Hui-bin Yan, H. Huo, T. Fang","doi":"10.1109/PRRS.2018.8486179","DOIUrl":null,"url":null,"abstract":"The goal of mining middle-level visual elements is to discover a set of image patches that are representative of and discriminative for a target category. The commonly used mid-level feature representations such as bag-of-visual-words (BOW) models or part-based models in high-resolution remote sensing (HRS) images, seldom consider the discriminability of visual words or parts in object detection. To address this problem, we propose a novel and effective HRS image object detection method based on mid-level visual element representations. First, we employ an iterative procedure that alternates between retraining discriminative classifiers and mining for additional patch instances to discover the discriminative patches, i.e., discriminative mid-level visual elements. Then, a novel mid-level feature representation for an image is constructed based on these visual elements to achieve object detection in HRS images. The experiments on the two HRS image datasets demonstrated the effectiveness of the proposed method compared with several state-of-the-art BOW-based and part-based models.","PeriodicalId":197319,"journal":{"name":"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mining Mid-Level Visual Elements for Object Detection in High-Resolution Remote Sensing Images\",\"authors\":\"Xinle Liu, Hui-bin Yan, H. Huo, T. Fang\",\"doi\":\"10.1109/PRRS.2018.8486179\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The goal of mining middle-level visual elements is to discover a set of image patches that are representative of and discriminative for a target category. The commonly used mid-level feature representations such as bag-of-visual-words (BOW) models or part-based models in high-resolution remote sensing (HRS) images, seldom consider the discriminability of visual words or parts in object detection. To address this problem, we propose a novel and effective HRS image object detection method based on mid-level visual element representations. First, we employ an iterative procedure that alternates between retraining discriminative classifiers and mining for additional patch instances to discover the discriminative patches, i.e., discriminative mid-level visual elements. Then, a novel mid-level feature representation for an image is constructed based on these visual elements to achieve object detection in HRS images. The experiments on the two HRS image datasets demonstrated the effectiveness of the proposed method compared with several state-of-the-art BOW-based and part-based models.\",\"PeriodicalId\":197319,\"journal\":{\"name\":\"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRRS.2018.8486179\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRRS.2018.8486179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mining Mid-Level Visual Elements for Object Detection in High-Resolution Remote Sensing Images
The goal of mining middle-level visual elements is to discover a set of image patches that are representative of and discriminative for a target category. The commonly used mid-level feature representations such as bag-of-visual-words (BOW) models or part-based models in high-resolution remote sensing (HRS) images, seldom consider the discriminability of visual words or parts in object detection. To address this problem, we propose a novel and effective HRS image object detection method based on mid-level visual element representations. First, we employ an iterative procedure that alternates between retraining discriminative classifiers and mining for additional patch instances to discover the discriminative patches, i.e., discriminative mid-level visual elements. Then, a novel mid-level feature representation for an image is constructed based on these visual elements to achieve object detection in HRS images. The experiments on the two HRS image datasets demonstrated the effectiveness of the proposed method compared with several state-of-the-art BOW-based and part-based models.