行人识别通过不同的跨模态深度学习方法

2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES) Pub Date : 2017-06-27 DOI:10.1109/ICVES.2017.7991914

D. Pop, A. Rogozan, F. Nashashibi, A. Bensrhair

{"title":"行人识别通过不同的跨模态深度学习方法","authors":"D. Pop, A. Rogozan, F. Nashashibi, A. Bensrhair","doi":"10.1109/ICVES.2017.7991914","DOIUrl":null,"url":null,"abstract":"A wide variety of approaches have been proposed for pedestrian detection in the last decay and it still remains an open challenge due to its outstanding importance in the field of automotive. In recent years, deep learning classification methods, in particular convolutional neural networks, combined with multi-modality images applied on different fusion schemes have achieved great performances in computer vision tasks. For the pedestrian recognition task, the late-fusion scheme outperforms the early and intermediate integration of modalities. In this paper, we focus on improving and optimizing the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. We propose different training methods based on Cross-Modality deep learning of Convolutional Neural Networks (CNNs): (1) a correlated model, (2) an incremental model and, (3) a particular cross-modality model, where each CNN is trained on one modality, but tested on a different one. The experiments show that the incremental cross-modality deep learning of CNNs achieves the best performances. It improves the classification performances not only for each modality classifier, but also for the multi-modality late-fusion scheme. The particular cross-modality model is a promising idea for automated annotation of modality images with a classifier trained on a different modality and/or for cross-dataset training.","PeriodicalId":303389,"journal":{"name":"2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Pedestrian recognition through different cross-modality deep learning methods\",\"authors\":\"D. Pop, A. Rogozan, F. Nashashibi, A. Bensrhair\",\"doi\":\"10.1109/ICVES.2017.7991914\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A wide variety of approaches have been proposed for pedestrian detection in the last decay and it still remains an open challenge due to its outstanding importance in the field of automotive. In recent years, deep learning classification methods, in particular convolutional neural networks, combined with multi-modality images applied on different fusion schemes have achieved great performances in computer vision tasks. For the pedestrian recognition task, the late-fusion scheme outperforms the early and intermediate integration of modalities. In this paper, we focus on improving and optimizing the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. We propose different training methods based on Cross-Modality deep learning of Convolutional Neural Networks (CNNs): (1) a correlated model, (2) an incremental model and, (3) a particular cross-modality model, where each CNN is trained on one modality, but tested on a different one. The experiments show that the incremental cross-modality deep learning of CNNs achieves the best performances. It improves the classification performances not only for each modality classifier, but also for the multi-modality late-fusion scheme. The particular cross-modality model is a promising idea for automated annotation of modality images with a classifier trained on a different modality and/or for cross-dataset training.\",\"PeriodicalId\":303389,\"journal\":{\"name\":\"2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICVES.2017.7991914\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICVES.2017.7991914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

人们已经提出了各种各样的行人检测方法，但由于其在汽车领域的重要性，它仍然是一个开放的挑战。近年来，深度学习分类方法，特别是卷积神经网络，结合不同融合方案下的多模态图像，在计算机视觉任务中取得了很好的效果。对于行人识别任务，后期融合方案优于早期和中期融合模式。本文主要对基于戴姆勒立体视觉数据集的行人分类后期融合方案进行改进和优化。我们提出了基于卷积神经网络(CNN)的跨模态深度学习的不同训练方法:(1)相关模型，(2)增量模型，(3)特定的跨模态模型，其中每个CNN在一种模态上训练，但在不同的模态上进行测试。实验表明，cnn的增量跨模态深度学习达到了最好的性能。它不仅提高了每个模态分类器的分类性能，而且提高了多模态后期融合方案的分类性能。特定的跨模态模型是一个很有前途的想法，可以使用在不同模态上训练的分类器对模态图像进行自动注释和/或跨数据集训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Pedestrian recognition through different cross-modality deep learning methods

A wide variety of approaches have been proposed for pedestrian detection in the last decay and it still remains an open challenge due to its outstanding importance in the field of automotive. In recent years, deep learning classification methods, in particular convolutional neural networks, combined with multi-modality images applied on different fusion schemes have achieved great performances in computer vision tasks. For the pedestrian recognition task, the late-fusion scheme outperforms the early and intermediate integration of modalities. In this paper, we focus on improving and optimizing the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. We propose different training methods based on Cross-Modality deep learning of Convolutional Neural Networks (CNNs): (1) a correlated model, (2) an incremental model and, (3) a particular cross-modality model, where each CNN is trained on one modality, but tested on a different one. The experiments show that the incremental cross-modality deep learning of CNNs achieves the best performances. It improves the classification performances not only for each modality classifier, but also for the multi-modality late-fusion scheme. The particular cross-modality model is a promising idea for automated annotation of modality images with a classifier trained on a different modality and/or for cross-dataset training.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE International Conference on Vehicular Electronics and Safety (ICVES)

自引率

0.00%

发文量