一种新的RGB-D数据中目标检测的目标建议生成方法

Sang-Il Oh, Hang-Bong Kang
{"title":"一种新的RGB-D数据中目标检测的目标建议生成方法","authors":"Sang-Il Oh, Hang-Bong Kang","doi":"10.1109/SAMI.2017.7880341","DOIUrl":null,"url":null,"abstract":"This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.","PeriodicalId":105599,"journal":{"name":"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A new object proposal generation method for object detection in RGB-D data\",\"authors\":\"Sang-Il Oh, Hang-Bong Kang\",\"doi\":\"10.1109/SAMI.2017.7880341\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.\",\"PeriodicalId\":105599,\"journal\":{\"name\":\"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"volume\":\"2015 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAMI.2017.7880341\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMI.2017.7880341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本文提出了一种基于室内场景RGB-D数据生成目标建议的改进选择搜索方法。该方法首先利用颜色平坦化技术在RGB图像数据中产生单调的颜色变化。然后,从彩色平面图像和深度图数据出发,采用基于成本函数的分段分组和深度分割,得到理想的分割结果。使用代价函数对图像数据进行分段分组,通过预先学习的权重计算两个相邻区域之间的颜色、纹理和大小的不相似性。深度分割利用分箱深度网格图中网格单元的高度差。通过考虑两种数据模态之间的重叠,从RGB图像和深度图数据中提取出最终的目标建议区域集。最后,将提取的目标建议区域集输入到广泛用于目标分类的AlexNet或VGG-16中,以评估我们的方法在目标检测和分类任务上的性能。与其他方法相比,本文提出的基于片段的方法可以使用较少的建议数量来精确检测有意义的目标区域。此外,该方法的检测和分类性能优于以往的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A new object proposal generation method for object detection in RGB-D data
This paper proposes a modified selective search method that generates object proposals on RGB-D data in indoor scenes. The proposed method first applies color flattening to generate monotonous color variations in RGB image data. Then, from the color-flattened image and depth map data, cost function-based segment grouping and depth segmentation are applied to produce desirable segmentation results. Segment grouping using cost function on image data computes dissimilarities in color, texture, and size between two adjacent regions with pre-learned weights. Depth segmentation uses the height difference of grid cells in the binned depth grid map. The final set of object proposal regions extracted from the RGB image and depth map data is organized by considering the overlapping between two data modalities. Finally, the extracted set of object proposal regions is fed into AlexNet or VGG-16, both of which are widely used for object classification, to evaluate our method on object detection and classification tasks. The proposed segment-based method can precisely detect meaningful object regions using a smaller number of proposals than other methods. Further, its detection and classification performance are better than those of previous methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信