基于快速形状感知聚类和分类的鸟瞰图多模态人体检测

Csaba Beleznai, Daniel Steininger, G. Croonen, Elisabeth Broneder
{"title":"基于快速形状感知聚类和分类的鸟瞰图多模态人体检测","authors":"Csaba Beleznai, Daniel Steininger, G. Croonen, Elisabeth Broneder","doi":"10.1109/PRRS.2018.8486236","DOIUrl":null,"url":null,"abstract":"Recognizing humans from aerial views represents an increasingly relevant endeavor; a trend mainly driven by the widespread use of unmanned aerial vehicles (UAVs). An accurate and real-time visual human recognition task, however, represents a scientific challenge because typical UAV imaging and computational capabilities and conditions introduce complexities and constraints. Motion blur, the non-specific top-view appearance of humans, low-image resolution and limited onboard computational resources are among the most important limiting factors to be considered. In this paper we propose a run-time-efficient multi-modal detection framework performing clustering and recognition on thermal infrared, passive stereo depth and intensity channels in order to cope with the above complexities and to achieve accurate human detection results. Thermal infrared and depth data are used to generate proposals in combination with an explicit, tree-structured shape representation driven clustering scheme. Generated proposals are used as an input for a discriminatively trained deep classification step to recognize humans. The proposed clustering and classification scheme is validated in qualitative and quantitative terms on four large aerial datasets representing complex (small objects, clutter, occlusions) situations.","PeriodicalId":197319,"journal":{"name":"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Multi-Modal Human Detection from Aerial Views by Fast Shape-Aware Clustering and Classification\",\"authors\":\"Csaba Beleznai, Daniel Steininger, G. Croonen, Elisabeth Broneder\",\"doi\":\"10.1109/PRRS.2018.8486236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognizing humans from aerial views represents an increasingly relevant endeavor; a trend mainly driven by the widespread use of unmanned aerial vehicles (UAVs). An accurate and real-time visual human recognition task, however, represents a scientific challenge because typical UAV imaging and computational capabilities and conditions introduce complexities and constraints. Motion blur, the non-specific top-view appearance of humans, low-image resolution and limited onboard computational resources are among the most important limiting factors to be considered. In this paper we propose a run-time-efficient multi-modal detection framework performing clustering and recognition on thermal infrared, passive stereo depth and intensity channels in order to cope with the above complexities and to achieve accurate human detection results. Thermal infrared and depth data are used to generate proposals in combination with an explicit, tree-structured shape representation driven clustering scheme. Generated proposals are used as an input for a discriminatively trained deep classification step to recognize humans. The proposed clustering and classification scheme is validated in qualitative and quantitative terms on four large aerial datasets representing complex (small objects, clutter, occlusions) situations.\",\"PeriodicalId\":197319,\"journal\":{\"name\":\"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)\",\"volume\":\"105 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRRS.2018.8486236\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRRS.2018.8486236","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

从鸟瞰图中识别人类代表着一项日益相关的努力;这一趋势主要是由无人驾驶飞行器的广泛使用推动的。然而,精确和实时的视觉人类识别任务代表了一项科学挑战,因为典型的无人机成像和计算能力和条件引入了复杂性和限制。运动模糊、人类的非特定俯视图外观、低图像分辨率和有限的机载计算资源是需要考虑的最重要的限制因素。本文提出了一种运行时高效的多模态检测框架,对热红外、被动立体深度和强度通道进行聚类和识别,以应对上述复杂性,并获得准确的人体检测结果。利用热红外和深度数据结合明确的树状结构形状表示驱动的聚类方案生成建议。生成的建议被用作判别训练的深度分类步骤的输入,以识别人类。在代表复杂情况(小目标、杂波、遮挡)的四个大型航空数据集上,对所提出的聚类和分类方案进行了定性和定量验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Modal Human Detection from Aerial Views by Fast Shape-Aware Clustering and Classification
Recognizing humans from aerial views represents an increasingly relevant endeavor; a trend mainly driven by the widespread use of unmanned aerial vehicles (UAVs). An accurate and real-time visual human recognition task, however, represents a scientific challenge because typical UAV imaging and computational capabilities and conditions introduce complexities and constraints. Motion blur, the non-specific top-view appearance of humans, low-image resolution and limited onboard computational resources are among the most important limiting factors to be considered. In this paper we propose a run-time-efficient multi-modal detection framework performing clustering and recognition on thermal infrared, passive stereo depth and intensity channels in order to cope with the above complexities and to achieve accurate human detection results. Thermal infrared and depth data are used to generate proposals in combination with an explicit, tree-structured shape representation driven clustering scheme. Generated proposals are used as an input for a discriminatively trained deep classification step to recognize humans. The proposed clustering and classification scheme is validated in qualitative and quantitative terms on four large aerial datasets representing complex (small objects, clutter, occlusions) situations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信