Y.S. Gan , Kun-Hong Liu , Min-Huan Wu , Gen-Bing Liong , Sze-Teng Liong
{"title":"基于双输入CNN架构的端到端真实视频微表情识别系统","authors":"Y.S. Gan , Kun-Hong Liu , Min-Huan Wu , Gen-Bing Liong , Sze-Teng Liong","doi":"10.1016/j.eswa.2025.128062","DOIUrl":null,"url":null,"abstract":"<div><div>Micro-expression (ME) recognition reveals nonverbal emotions through subtle, involuntary facial muscle movements. However, the development and commercialization of ME recognition systems have been hindered by the lack of databases that accurately reflect real-world conditions. This study addresses this challenge by proposing a robust end-to-end system designed to operate effectively in unconstrained environments. Existing methods typically rely on a single apex frame, which may be unreliable due to noise, occlusions, or lighting variations. To address these issues, a 3D facial reconstruction technique is applied as a pre-processing step to normalize pose and lighting. A novel dual-peak frame detection strategy is then introduced to extract two expressive optical flow frames, reducing the impact of noise from any single frame. Finally, a Shallow and Small-size Dual-input (SSD) CNN architecture is designed to jointly process the two frames for improved emotion classification. The proposed system achieves strong performance on the challenging in-the-wild MEVIEW dataset, with accuracy and F1-score of 75 % and 77.68 %, respectively. Comprehensive evaluations further validate the effectiveness of the pipeline, highlighting its potential for real-world ME recognition applications.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"286 ","pages":"Article 128062"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved end-to-end micro-expression recognition system for real-world videos via dual-input CNN architecture\",\"authors\":\"Y.S. Gan , Kun-Hong Liu , Min-Huan Wu , Gen-Bing Liong , Sze-Teng Liong\",\"doi\":\"10.1016/j.eswa.2025.128062\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Micro-expression (ME) recognition reveals nonverbal emotions through subtle, involuntary facial muscle movements. However, the development and commercialization of ME recognition systems have been hindered by the lack of databases that accurately reflect real-world conditions. This study addresses this challenge by proposing a robust end-to-end system designed to operate effectively in unconstrained environments. Existing methods typically rely on a single apex frame, which may be unreliable due to noise, occlusions, or lighting variations. To address these issues, a 3D facial reconstruction technique is applied as a pre-processing step to normalize pose and lighting. A novel dual-peak frame detection strategy is then introduced to extract two expressive optical flow frames, reducing the impact of noise from any single frame. Finally, a Shallow and Small-size Dual-input (SSD) CNN architecture is designed to jointly process the two frames for improved emotion classification. The proposed system achieves strong performance on the challenging in-the-wild MEVIEW dataset, with accuracy and F1-score of 75 % and 77.68 %, respectively. Comprehensive evaluations further validate the effectiveness of the pipeline, highlighting its potential for real-world ME recognition applications.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"286 \",\"pages\":\"Article 128062\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425016835\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425016835","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
An improved end-to-end micro-expression recognition system for real-world videos via dual-input CNN architecture
Micro-expression (ME) recognition reveals nonverbal emotions through subtle, involuntary facial muscle movements. However, the development and commercialization of ME recognition systems have been hindered by the lack of databases that accurately reflect real-world conditions. This study addresses this challenge by proposing a robust end-to-end system designed to operate effectively in unconstrained environments. Existing methods typically rely on a single apex frame, which may be unreliable due to noise, occlusions, or lighting variations. To address these issues, a 3D facial reconstruction technique is applied as a pre-processing step to normalize pose and lighting. A novel dual-peak frame detection strategy is then introduced to extract two expressive optical flow frames, reducing the impact of noise from any single frame. Finally, a Shallow and Small-size Dual-input (SSD) CNN architecture is designed to jointly process the two frames for improved emotion classification. The proposed system achieves strong performance on the challenging in-the-wild MEVIEW dataset, with accuracy and F1-score of 75 % and 77.68 %, respectively. Comprehensive evaluations further validate the effectiveness of the pipeline, highlighting its potential for real-world ME recognition applications.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.