Physiological navigation amplifier for remote extracting PPG signals from face video clips

IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Bin Li , Wei Zhang , Hong Fu
{"title":"Physiological navigation amplifier for remote extracting PPG signals from face video clips","authors":"Bin Li ,&nbsp;Wei Zhang ,&nbsp;Hong Fu","doi":"10.1016/j.neucom.2025.130228","DOIUrl":null,"url":null,"abstract":"<div><div>Remote photoplethysmography (rPPG) signal measurement based on face video clips is one of the essential methods for non-contact monitoring of human heart health. However, the subtle variations in face color associated with the rPPG signal can be easily contaminated by external noise, leading to a limited cross-domain generalization of existing models. In the context of short face video clips, we have observed that the influence of factors such as background, illumination, and other variables results in significant variation in noise between samples, while those within the sample typically exhibit similar patterns. Therefore, those distinctive features can be utilized to model noise variation. This paper presents a physiological navigation amplifier with a self-adaptive differential feature representation structure for rPPG signal measurement in challenging video clips. First, a noise representation encoder is proposed for self-adaptive differential feature representation. Second, a physiological navigation amplifier is designed to extract the subtle rPPG signal from facial feature sets by disentangling rPPG and noise features in the spatiotemporal space. Finally, to solve the problem of measured signal degradation caused by drastic external disturbances that result in network performance decline, a learnable signal rectification matrix is employed to reconstruct the measured rPPG signal. The experimental results, obtained through publicly available intra-datasets and cross-dataset validation, demonstrate that our proposed method reduces the mean absolute error (MAE) in rPPG signal and heart rate measurements by more than 17 % (OBF), 13 % (COHFACE), and 24 % (UBFC) compared to state-of-the-art methods.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"639 ","pages":"Article 130228"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225009002","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Remote photoplethysmography (rPPG) signal measurement based on face video clips is one of the essential methods for non-contact monitoring of human heart health. However, the subtle variations in face color associated with the rPPG signal can be easily contaminated by external noise, leading to a limited cross-domain generalization of existing models. In the context of short face video clips, we have observed that the influence of factors such as background, illumination, and other variables results in significant variation in noise between samples, while those within the sample typically exhibit similar patterns. Therefore, those distinctive features can be utilized to model noise variation. This paper presents a physiological navigation amplifier with a self-adaptive differential feature representation structure for rPPG signal measurement in challenging video clips. First, a noise representation encoder is proposed for self-adaptive differential feature representation. Second, a physiological navigation amplifier is designed to extract the subtle rPPG signal from facial feature sets by disentangling rPPG and noise features in the spatiotemporal space. Finally, to solve the problem of measured signal degradation caused by drastic external disturbances that result in network performance decline, a learnable signal rectification matrix is employed to reconstruct the measured rPPG signal. The experimental results, obtained through publicly available intra-datasets and cross-dataset validation, demonstrate that our proposed method reduces the mean absolute error (MAE) in rPPG signal and heart rate measurements by more than 17 % (OBF), 13 % (COHFACE), and 24 % (UBFC) compared to state-of-the-art methods.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信