Mahassine Defaoui, L. Koutti, Mohamed El Ansari, Redouan Lahmyed, L. Masmoudi
{"title":"PedVis-VGG-16: A Fine-tuned deep convolutional neural network for pedestrian image classifications","authors":"Mahassine Defaoui, L. Koutti, Mohamed El Ansari, Redouan Lahmyed, L. Masmoudi","doi":"10.1109/WINCOM55661.2022.9966465","DOIUrl":null,"url":null,"abstract":"Recently, pedestrian detection has attracted a lot of attention in recent years. It is known as a computer vision research hotspot, widely used in different fields. Despite the impressive progress of its approaches, their performance remains unsatisfactory. This paper proposes PedVis-Vgg-16a deep learning network for automatically detecting pedestrians presence in visible images. The suggested architecture is based on the fine-tuned VGG-16 architecture with modifications to the last block of the model. Different improvement components including data augmentation, parameter optimization, and parameter adaption, were taken to enhance the architecture performance. The newly designed architecture is validated on the publicly available dataset INRIA, which contains 4001 images and the results provided are satisfactory.","PeriodicalId":128342,"journal":{"name":"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WINCOM55661.2022.9966465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recently, pedestrian detection has attracted a lot of attention in recent years. It is known as a computer vision research hotspot, widely used in different fields. Despite the impressive progress of its approaches, their performance remains unsatisfactory. This paper proposes PedVis-Vgg-16a deep learning network for automatically detecting pedestrians presence in visible images. The suggested architecture is based on the fine-tuned VGG-16 architecture with modifications to the last block of the model. Different improvement components including data augmentation, parameter optimization, and parameter adaption, were taken to enhance the architecture performance. The newly designed architecture is validated on the publicly available dataset INRIA, which contains 4001 images and the results provided are satisfactory.