{"title":"基于时序稳定 3DMM 的人脸表情跟踪延迟改进策略","authors":"Tri Tung Nguyen Nguyen, D. Tran, Joo-Ho Lee","doi":"10.1109/SII58957.2024.10417546","DOIUrl":null,"url":null,"abstract":"2D image-based face tracking is a core feature for multiple AR/VR applications. The latest advancements in self-supervised 3DMM face reconstruction maintained high-accuracy analysis-by-synthesis tracking but were not designed for online inference settings with low latency performance. Recently, state-of-the-art models such as MICA [1] has demonstrated significant improvement in term of accuracy for the offline face construction task but the design is ill-suited for practical use cases due to their long processing time on low and middle-end hardware. The original workflow includes two analysis-by-synthesis stages: face shape reconstruction and face tracking. The shape reconstruction aims to regress a neutral 3DMM model from the input. Then the tracking process learns relevant parameters for expressions, eyes, mouth, etc. for a differentiable render to reconstruct the original photographic input. This study aims to propose a design for an interface to apply offline 3DMM face tracking into an online inference pipeline for facial analysis-based applications.","PeriodicalId":518021,"journal":{"name":"2024 IEEE/SICE International Symposium on System Integration (SII)","volume":"19 1","pages":"327-332"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Latency Improvement Strategy for Temporally Stable Sequential 3DMM-based Face Expression Tracking\",\"authors\":\"Tri Tung Nguyen Nguyen, D. Tran, Joo-Ho Lee\",\"doi\":\"10.1109/SII58957.2024.10417546\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"2D image-based face tracking is a core feature for multiple AR/VR applications. The latest advancements in self-supervised 3DMM face reconstruction maintained high-accuracy analysis-by-synthesis tracking but were not designed for online inference settings with low latency performance. Recently, state-of-the-art models such as MICA [1] has demonstrated significant improvement in term of accuracy for the offline face construction task but the design is ill-suited for practical use cases due to their long processing time on low and middle-end hardware. The original workflow includes two analysis-by-synthesis stages: face shape reconstruction and face tracking. The shape reconstruction aims to regress a neutral 3DMM model from the input. Then the tracking process learns relevant parameters for expressions, eyes, mouth, etc. for a differentiable render to reconstruct the original photographic input. This study aims to propose a design for an interface to apply offline 3DMM face tracking into an online inference pipeline for facial analysis-based applications.\",\"PeriodicalId\":518021,\"journal\":{\"name\":\"2024 IEEE/SICE International Symposium on System Integration (SII)\",\"volume\":\"19 1\",\"pages\":\"327-332\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2024 IEEE/SICE International Symposium on System Integration (SII)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SII58957.2024.10417546\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 IEEE/SICE International Symposium on System Integration (SII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SII58957.2024.10417546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Latency Improvement Strategy for Temporally Stable Sequential 3DMM-based Face Expression Tracking
2D image-based face tracking is a core feature for multiple AR/VR applications. The latest advancements in self-supervised 3DMM face reconstruction maintained high-accuracy analysis-by-synthesis tracking but were not designed for online inference settings with low latency performance. Recently, state-of-the-art models such as MICA [1] has demonstrated significant improvement in term of accuracy for the offline face construction task but the design is ill-suited for practical use cases due to their long processing time on low and middle-end hardware. The original workflow includes two analysis-by-synthesis stages: face shape reconstruction and face tracking. The shape reconstruction aims to regress a neutral 3DMM model from the input. Then the tracking process learns relevant parameters for expressions, eyes, mouth, etc. for a differentiable render to reconstruct the original photographic input. This study aims to propose a design for an interface to apply offline 3DMM face tracking into an online inference pipeline for facial analysis-based applications.