使用预训练网络的光照变化下的物体识别

Kalpathy Sivaraman, A. Murthy
{"title":"使用预训练网络的光照变化下的物体识别","authors":"Kalpathy Sivaraman, A. Murthy","doi":"10.1109/AIPR.2018.8707399","DOIUrl":null,"url":null,"abstract":"We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.","PeriodicalId":230582,"journal":{"name":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Object Recognition under Lighting Variations using Pre-Trained Networks\",\"authors\":\"Kalpathy Sivaraman, A. Murthy\",\"doi\":\"10.1109/AIPR.2018.8707399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.\",\"PeriodicalId\":230582,\"journal\":{\"name\":\"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIPR.2018.8707399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2018.8707399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

我们报告了在ImageNet上训练的三个最先进的卷积神经网络(cnn) VGG16、ResNet和SqueezeNet在15种不同光照条件下的目标识别性能,使用Phos数据集和在ExDark数据集上使用Pascal VOC训练的类似ResNet的网络。标准化softmax值的不稳定性被用来强调预训练的网络对光照变化的鲁棒性。我们的研究产生了一个鲁棒性分析框架,用于分析cnn在不同照明条件下的性能。Phos数据集包括在不同光照条件下拍摄的15个场景:9幅在不同强度均匀光照下拍摄的图像,6幅在不同程度非均匀光照下拍摄的图像。ExDARK数据集由10个不同光照条件下的场景组成。开发了一个基于keras的管道来研究imagenet训练的VGG16、ResNet和SqueezeNet在Phos数据集的15种不同光照条件下对同一物体输出的softmax值。在PASCAL VOC数据集上对ResNet架构进行端到端训练。在softmax值中观察到的大变化提供了不稳定性能的经验证据,并且需要增加训练以解释照明变化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Object Recognition under Lighting Variations using Pre-Trained Networks
We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信