{"title":"Latency Improvement Strategy for Temporally Stable Sequential 3DMM-based Face Expression Tracking","authors":"Tri Tung Nguyen Nguyen, D. Tran, Joo-Ho Lee","doi":"10.1109/SII58957.2024.10417546","DOIUrl":null,"url":null,"abstract":"2D image-based face tracking is a core feature for multiple AR/VR applications. The latest advancements in self-supervised 3DMM face reconstruction maintained high-accuracy analysis-by-synthesis tracking but were not designed for online inference settings with low latency performance. Recently, state-of-the-art models such as MICA [1] has demonstrated significant improvement in term of accuracy for the offline face construction task but the design is ill-suited for practical use cases due to their long processing time on low and middle-end hardware. The original workflow includes two analysis-by-synthesis stages: face shape reconstruction and face tracking. The shape reconstruction aims to regress a neutral 3DMM model from the input. Then the tracking process learns relevant parameters for expressions, eyes, mouth, etc. for a differentiable render to reconstruct the original photographic input. This study aims to propose a design for an interface to apply offline 3DMM face tracking into an online inference pipeline for facial analysis-based applications.","PeriodicalId":518021,"journal":{"name":"2024 IEEE/SICE International Symposium on System Integration (SII)","volume":"19 1","pages":"327-332"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 IEEE/SICE International Symposium on System Integration (SII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SII58957.2024.10417546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
2D image-based face tracking is a core feature for multiple AR/VR applications. The latest advancements in self-supervised 3DMM face reconstruction maintained high-accuracy analysis-by-synthesis tracking but were not designed for online inference settings with low latency performance. Recently, state-of-the-art models such as MICA [1] has demonstrated significant improvement in term of accuracy for the offline face construction task but the design is ill-suited for practical use cases due to their long processing time on low and middle-end hardware. The original workflow includes two analysis-by-synthesis stages: face shape reconstruction and face tracking. The shape reconstruction aims to regress a neutral 3DMM model from the input. Then the tracking process learns relevant parameters for expressions, eyes, mouth, etc. for a differentiable render to reconstruct the original photographic input. This study aims to propose a design for an interface to apply offline 3DMM face tracking into an online inference pipeline for facial analysis-based applications.