Rupali J. Dhabarde, D. V. Kodavade, Aditya Konnur, Vijay Manwatkar
{"title":"基于HAPNet分割和混合VGG16-SVM分类器的红外图像面部表情识别","authors":"Rupali J. Dhabarde, D. V. Kodavade, Aditya Konnur, Vijay Manwatkar","doi":"10.3103/S1060992X24600599","DOIUrl":null,"url":null,"abstract":"<p>Recognition of Human Face expression is the most significant and challenging societal interaction tasks. Humans often convey their feelings and intentions through their facial expressions in a natural and honest manner, nonverbal communication is mostly characterized by facial expressions. Various approaches for classifying emotions and facial recognition have been established to enhance the accuracy of face recognition in the infrared images. Significant issues of recent deep FER systems include overfitting due to insufficient training data as well as expression-unrelated variables such as identification bias, head posture, and illumination. To address these challenges, the proposed model implemented a method for detecting facial expression using HAPNet segmentation and hybrid VGG16 with SVM classifier. At first, pre-processed the images using an optimized Difference of Gaussians (DOG) filter for enhancing the edges of the image and the Artificial Gorilla Troops Optimization Algorithm (GTO) is used to select the kernel size based on the maximum PSNR. Segmentation is the next step for segmenting the face using the Hybrid, Asymmetric, and Progressive Network (HAPNet) method. Landmark is detected based on Multi-Task Cascaded Convolutional Networks (MTCNN) for identifying the location of the mouth eyes, and nose. The last step is to categorize the seven emotions which are happy, sad, disgusted, surprised, angry, fearful, and neutral on faces using the hybrid VGG16 with Support Vector Machine (SVM) algorithm. The effectiveness of the proposed methodology is evaluated using the metrics of accuracy is 96.6%, positive predictive value is 93.08%, hit rate of 95.2%, selectivity of 92.5%, negative predictive value of 95.8%, and f1-score of 94.49%. Experiments on the database illustrates that the proposed approach performs better than conventional techniques for accurately identifies the expressions on the face in the thermal images.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 2","pages":"146 - 163"},"PeriodicalIF":0.8000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Facial Expression Recognition in Infrared Imaging Using HAPNet Segmentation and Hybrid VGG16-SVM Classifier\",\"authors\":\"Rupali J. Dhabarde, D. V. Kodavade, Aditya Konnur, Vijay Manwatkar\",\"doi\":\"10.3103/S1060992X24600599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Recognition of Human Face expression is the most significant and challenging societal interaction tasks. Humans often convey their feelings and intentions through their facial expressions in a natural and honest manner, nonverbal communication is mostly characterized by facial expressions. Various approaches for classifying emotions and facial recognition have been established to enhance the accuracy of face recognition in the infrared images. Significant issues of recent deep FER systems include overfitting due to insufficient training data as well as expression-unrelated variables such as identification bias, head posture, and illumination. To address these challenges, the proposed model implemented a method for detecting facial expression using HAPNet segmentation and hybrid VGG16 with SVM classifier. At first, pre-processed the images using an optimized Difference of Gaussians (DOG) filter for enhancing the edges of the image and the Artificial Gorilla Troops Optimization Algorithm (GTO) is used to select the kernel size based on the maximum PSNR. Segmentation is the next step for segmenting the face using the Hybrid, Asymmetric, and Progressive Network (HAPNet) method. Landmark is detected based on Multi-Task Cascaded Convolutional Networks (MTCNN) for identifying the location of the mouth eyes, and nose. The last step is to categorize the seven emotions which are happy, sad, disgusted, surprised, angry, fearful, and neutral on faces using the hybrid VGG16 with Support Vector Machine (SVM) algorithm. The effectiveness of the proposed methodology is evaluated using the metrics of accuracy is 96.6%, positive predictive value is 93.08%, hit rate of 95.2%, selectivity of 92.5%, negative predictive value of 95.8%, and f1-score of 94.49%. Experiments on the database illustrates that the proposed approach performs better than conventional techniques for accurately identifies the expressions on the face in the thermal images.</p>\",\"PeriodicalId\":721,\"journal\":{\"name\":\"Optical Memory and Neural Networks\",\"volume\":\"34 2\",\"pages\":\"146 - 163\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optical Memory and Neural Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.3103/S1060992X24600599\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Memory and Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S1060992X24600599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
Facial Expression Recognition in Infrared Imaging Using HAPNet Segmentation and Hybrid VGG16-SVM Classifier
Recognition of Human Face expression is the most significant and challenging societal interaction tasks. Humans often convey their feelings and intentions through their facial expressions in a natural and honest manner, nonverbal communication is mostly characterized by facial expressions. Various approaches for classifying emotions and facial recognition have been established to enhance the accuracy of face recognition in the infrared images. Significant issues of recent deep FER systems include overfitting due to insufficient training data as well as expression-unrelated variables such as identification bias, head posture, and illumination. To address these challenges, the proposed model implemented a method for detecting facial expression using HAPNet segmentation and hybrid VGG16 with SVM classifier. At first, pre-processed the images using an optimized Difference of Gaussians (DOG) filter for enhancing the edges of the image and the Artificial Gorilla Troops Optimization Algorithm (GTO) is used to select the kernel size based on the maximum PSNR. Segmentation is the next step for segmenting the face using the Hybrid, Asymmetric, and Progressive Network (HAPNet) method. Landmark is detected based on Multi-Task Cascaded Convolutional Networks (MTCNN) for identifying the location of the mouth eyes, and nose. The last step is to categorize the seven emotions which are happy, sad, disgusted, surprised, angry, fearful, and neutral on faces using the hybrid VGG16 with Support Vector Machine (SVM) algorithm. The effectiveness of the proposed methodology is evaluated using the metrics of accuracy is 96.6%, positive predictive value is 93.08%, hit rate of 95.2%, selectivity of 92.5%, negative predictive value of 95.8%, and f1-score of 94.49%. Experiments on the database illustrates that the proposed approach performs better than conventional techniques for accurately identifies the expressions on the face in the thermal images.
期刊介绍:
The journal covers a wide range of issues in information optics such as optical memory, mechanisms for optical data recording and processing, photosensitive materials, optical, optoelectronic and holographic nanostructures, and many other related topics. Papers on memory systems using holographic and biological structures and concepts of brain operation are also included. The journal pays particular attention to research in the field of neural net systems that may lead to a new generation of computional technologies by endowing them with intelligence.