M. Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed, Mohiuddin Ahmed, Md Rakibul Haquek
{"title":"利用再训练作为启发式步骤提高预训练的基于ResNet-50的vgg人脸识别系统的性能","authors":"M. Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed, Mohiuddin Ahmed, Md Rakibul Haquek","doi":"10.1109/ICCIT54785.2021.9689918","DOIUrl":null,"url":null,"abstract":"Deep learning has remodeled the research aspect of facial recognition throughout the last decade by utilizing multiple processing layers to extract significant facial features. Although this emerging technology has achieved high performance for the face recognition problems, the dilemma of achieving low performance while training with a few samples per class has not been resolved yet. In this study, it has been shown that by utilizing retraining as a heuristic step, ResNet-50 based VGGFace architecture can enhance the performance of the face recognition scheme significantly. Multi-task Cascaded Convolutional Neural Networks have been utilized to crop faces first. The first training phase was completed by considering train samples from a combined dataset of 5-celebrity dataset, Georgia tech database, and three variants of KomNet datasets. The retraining of individual datasets further produced 94.41% test accuracy for the KomNet social media dataset and 100% test accuracy for the other four datasets.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving Performance of a Pre-trained ResNet-50 Based VGGFace Recognition System by Utilizing Retraining as a Heuristic Step\",\"authors\":\"M. Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed, Mohiuddin Ahmed, Md Rakibul Haquek\",\"doi\":\"10.1109/ICCIT54785.2021.9689918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning has remodeled the research aspect of facial recognition throughout the last decade by utilizing multiple processing layers to extract significant facial features. Although this emerging technology has achieved high performance for the face recognition problems, the dilemma of achieving low performance while training with a few samples per class has not been resolved yet. In this study, it has been shown that by utilizing retraining as a heuristic step, ResNet-50 based VGGFace architecture can enhance the performance of the face recognition scheme significantly. Multi-task Cascaded Convolutional Neural Networks have been utilized to crop faces first. The first training phase was completed by considering train samples from a combined dataset of 5-celebrity dataset, Georgia tech database, and three variants of KomNet datasets. The retraining of individual datasets further produced 94.41% test accuracy for the KomNet social media dataset and 100% test accuracy for the other four datasets.\",\"PeriodicalId\":166450,\"journal\":{\"name\":\"2021 24th International Conference on Computer and Information Technology (ICCIT)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 24th International Conference on Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT54785.2021.9689918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Performance of a Pre-trained ResNet-50 Based VGGFace Recognition System by Utilizing Retraining as a Heuristic Step
Deep learning has remodeled the research aspect of facial recognition throughout the last decade by utilizing multiple processing layers to extract significant facial features. Although this emerging technology has achieved high performance for the face recognition problems, the dilemma of achieving low performance while training with a few samples per class has not been resolved yet. In this study, it has been shown that by utilizing retraining as a heuristic step, ResNet-50 based VGGFace architecture can enhance the performance of the face recognition scheme significantly. Multi-task Cascaded Convolutional Neural Networks have been utilized to crop faces first. The first training phase was completed by considering train samples from a combined dataset of 5-celebrity dataset, Georgia tech database, and three variants of KomNet datasets. The retraining of individual datasets further produced 94.41% test accuracy for the KomNet social media dataset and 100% test accuracy for the other four datasets.