{"title":"使用预训练网络的光照变化下的物体识别","authors":"Kalpathy Sivaraman, A. Murthy","doi":"10.1109/AIPR.2018.8707399","DOIUrl":null,"url":null,"abstract":"We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.","PeriodicalId":230582,"journal":{"name":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Object Recognition under Lighting Variations using Pre-Trained Networks\",\"authors\":\"Kalpathy Sivaraman, A. Murthy\",\"doi\":\"10.1109/AIPR.2018.8707399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.\",\"PeriodicalId\":230582,\"journal\":{\"name\":\"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIPR.2018.8707399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2018.8707399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object Recognition under Lighting Variations using Pre-Trained Networks
We report the object-recognition performance of VGG16, ResNet, and SqueezeNet, three state-of-the-art Convolutional Neural Networks (CNNs) trained on ImageNet, across 15 different lighting conditions using the Phos dataset and a ResNet-like network trained on Pascal VOC on the ExDark dataset. The instabilities in the normalized softmax values are used to highlight that pre-trained networks are not robust to lighting variations. Our investigation yields a robustness analysis framework for analyzing the performance of CNNs under different lighting conditions.The Phos dataset consists of 15 scenes captured under different illumination conditions: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The ExDARK dataset consists of ten scenes under different illumination conditions. A Keras-based pipeline was developed to study the softmax values output by ImageNet-trained VGG16, ResNet, and SqueezeNet for the same object under the 15 different lighting conditions of the Phos dataset. A ResNet architecture was trained end-to-end on the PASCAL VOC dataset. Large variations observed in the softmax values provide empirical evidence of unstable performance and the need to augment training to account for lighting variations.