{"title":"一种新的RGB-D数据中目标检测的目标建议生成方法","authors":"Sang-Il Oh, Hang-Bong Kang","doi":"10.1109/SAMI.2017.7880341","DOIUrl":null,"url":null,"abstract":"This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.","PeriodicalId":105599,"journal":{"name":"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A new object proposal generation method for object detection in RGB-D data\",\"authors\":\"Sang-Il Oh, Hang-Bong Kang\",\"doi\":\"10.1109/SAMI.2017.7880341\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.\",\"PeriodicalId\":105599,\"journal\":{\"name\":\"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"volume\":\"2015 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAMI.2017.7880341\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMI.2017.7880341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new object proposal generation method for object detection in RGB-D data
This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.