M. Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed, Mohiuddin Ahmed, Md Rakibul Haquek
{"title":"Improving Performance of a Pre-trained ResNet-50 Based VGGFace Recognition System by Utilizing Retraining as a Heuristic Step","authors":"M. Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed, Mohiuddin Ahmed, Md Rakibul Haquek","doi":"10.1109/ICCIT54785.2021.9689918","DOIUrl":null,"url":null,"abstract":"Deep learning has remodeled the research aspect of facial recognition throughout the last decade by utilizing multiple processing layers to extract significant facial features. Although this emerging technology has achieved high performance for the face recognition problems, the dilemma of achieving low performance while training with a few samples per class has not been resolved yet. In this study, it has been shown that by utilizing retraining as a heuristic step, ResNet-50 based VGGFace architecture can enhance the performance of the face recognition scheme significantly. Multi-task Cascaded Convolutional Neural Networks have been utilized to crop faces first. The first training phase was completed by considering train samples from a combined dataset of 5-celebrity dataset, Georgia tech database, and three variants of KomNet datasets. The retraining of individual datasets further produced 94.41% test accuracy for the KomNet social media dataset and 100% test accuracy for the other four datasets.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Deep learning has remodeled the research aspect of facial recognition throughout the last decade by utilizing multiple processing layers to extract significant facial features. Although this emerging technology has achieved high performance for the face recognition problems, the dilemma of achieving low performance while training with a few samples per class has not been resolved yet. In this study, it has been shown that by utilizing retraining as a heuristic step, ResNet-50 based VGGFace architecture can enhance the performance of the face recognition scheme significantly. Multi-task Cascaded Convolutional Neural Networks have been utilized to crop faces first. The first training phase was completed by considering train samples from a combined dataset of 5-celebrity dataset, Georgia tech database, and three variants of KomNet datasets. The retraining of individual datasets further produced 94.41% test accuracy for the KomNet social media dataset and 100% test accuracy for the other four datasets.