{"title":"Physiological navigation amplifier for remote extracting PPG signals from face video clips","authors":"Bin Li , Wei Zhang , Hong Fu","doi":"10.1016/j.neucom.2025.130228","DOIUrl":null,"url":null,"abstract":"<div><div>Remote photoplethysmography (rPPG) signal measurement based on face video clips is one of the essential methods for non-contact monitoring of human heart health. However, the subtle variations in face color associated with the rPPG signal can be easily contaminated by external noise, leading to a limited cross-domain generalization of existing models. In the context of short face video clips, we have observed that the influence of factors such as background, illumination, and other variables results in significant variation in noise between samples, while those within the sample typically exhibit similar patterns. Therefore, those distinctive features can be utilized to model noise variation. This paper presents a physiological navigation amplifier with a self-adaptive differential feature representation structure for rPPG signal measurement in challenging video clips. First, a noise representation encoder is proposed for self-adaptive differential feature representation. Second, a physiological navigation amplifier is designed to extract the subtle rPPG signal from facial feature sets by disentangling rPPG and noise features in the spatiotemporal space. Finally, to solve the problem of measured signal degradation caused by drastic external disturbances that result in network performance decline, a learnable signal rectification matrix is employed to reconstruct the measured rPPG signal. The experimental results, obtained through publicly available intra-datasets and cross-dataset validation, demonstrate that our proposed method reduces the mean absolute error (MAE) in rPPG signal and heart rate measurements by more than 17 % (OBF), 13 % (COHFACE), and 24 % (UBFC) compared to state-of-the-art methods.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"639 ","pages":"Article 130228"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225009002","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Remote photoplethysmography (rPPG) signal measurement based on face video clips is one of the essential methods for non-contact monitoring of human heart health. However, the subtle variations in face color associated with the rPPG signal can be easily contaminated by external noise, leading to a limited cross-domain generalization of existing models. In the context of short face video clips, we have observed that the influence of factors such as background, illumination, and other variables results in significant variation in noise between samples, while those within the sample typically exhibit similar patterns. Therefore, those distinctive features can be utilized to model noise variation. This paper presents a physiological navigation amplifier with a self-adaptive differential feature representation structure for rPPG signal measurement in challenging video clips. First, a noise representation encoder is proposed for self-adaptive differential feature representation. Second, a physiological navigation amplifier is designed to extract the subtle rPPG signal from facial feature sets by disentangling rPPG and noise features in the spatiotemporal space. Finally, to solve the problem of measured signal degradation caused by drastic external disturbances that result in network performance decline, a learnable signal rectification matrix is employed to reconstruct the measured rPPG signal. The experimental results, obtained through publicly available intra-datasets and cross-dataset validation, demonstrate that our proposed method reduces the mean absolute error (MAE) in rPPG signal and heart rate measurements by more than 17 % (OBF), 13 % (COHFACE), and 24 % (UBFC) compared to state-of-the-art methods.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.