Haibo Zhang , Xu Wang , Pan Dang , Chaohui Ma , Shuai Liu , Zhuang Xiong , Cheng Liu
{"title":"UL-Phys:基于无监督学习的面部视频中超轻量远程生理测量","authors":"Haibo Zhang , Xu Wang , Pan Dang , Chaohui Ma , Shuai Liu , Zhuang Xiong , Cheng Liu","doi":"10.1016/j.asoc.2025.113593","DOIUrl":null,"url":null,"abstract":"<div><div>Remote photoplethysmography (rPPG) enables non-contact monitoring of vital signs using facial videos, but current supervised learning methods often rely on complex architectures and large annotated datasets, limiting their practicality in real-time and resource-constrained scenarios. This paper addresses these limitations by proposing UL-Phys, an ultra-lightweight self-supervised framework for rPPG signal estimation. From a research standpoint, we reformulate the rPPG task as a linear self-supervised reconstruction problem, introducing a novel frequency-constrained objective to extract inherent periodic information without requiring ground truth labels. The framework integrates a lightweight 3D spatiotemporal encoder-decoder network, and a neuroscience-inspired hybrid attention module to enhance pulsatile signal regions while suppressing noise. Experimental evaluations on PURE and UBFC-rPPG datasets demonstrate that UL-Phys achieves superior performance compared to existing supervised and self-supervised baselines, while significantly reducing model complexity and inference latency. Our method also shows strong generalization across datasets, highlighting the value of embedding physiological priors into lightweight, self-supervised architectures. These findings offer a promising direction for scalable and deployable rPPG systems in real-world settings.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"182 ","pages":"Article 113593"},"PeriodicalIF":7.2000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UL-Phys:Ultra-lightweight remote physiological measurement in facial videos based on unsupervised learning\",\"authors\":\"Haibo Zhang , Xu Wang , Pan Dang , Chaohui Ma , Shuai Liu , Zhuang Xiong , Cheng Liu\",\"doi\":\"10.1016/j.asoc.2025.113593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Remote photoplethysmography (rPPG) enables non-contact monitoring of vital signs using facial videos, but current supervised learning methods often rely on complex architectures and large annotated datasets, limiting their practicality in real-time and resource-constrained scenarios. This paper addresses these limitations by proposing UL-Phys, an ultra-lightweight self-supervised framework for rPPG signal estimation. From a research standpoint, we reformulate the rPPG task as a linear self-supervised reconstruction problem, introducing a novel frequency-constrained objective to extract inherent periodic information without requiring ground truth labels. The framework integrates a lightweight 3D spatiotemporal encoder-decoder network, and a neuroscience-inspired hybrid attention module to enhance pulsatile signal regions while suppressing noise. Experimental evaluations on PURE and UBFC-rPPG datasets demonstrate that UL-Phys achieves superior performance compared to existing supervised and self-supervised baselines, while significantly reducing model complexity and inference latency. Our method also shows strong generalization across datasets, highlighting the value of embedding physiological priors into lightweight, self-supervised architectures. These findings offer a promising direction for scalable and deployable rPPG systems in real-world settings.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"182 \",\"pages\":\"Article 113593\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625009044\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625009044","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
UL-Phys:Ultra-lightweight remote physiological measurement in facial videos based on unsupervised learning
Remote photoplethysmography (rPPG) enables non-contact monitoring of vital signs using facial videos, but current supervised learning methods often rely on complex architectures and large annotated datasets, limiting their practicality in real-time and resource-constrained scenarios. This paper addresses these limitations by proposing UL-Phys, an ultra-lightweight self-supervised framework for rPPG signal estimation. From a research standpoint, we reformulate the rPPG task as a linear self-supervised reconstruction problem, introducing a novel frequency-constrained objective to extract inherent periodic information without requiring ground truth labels. The framework integrates a lightweight 3D spatiotemporal encoder-decoder network, and a neuroscience-inspired hybrid attention module to enhance pulsatile signal regions while suppressing noise. Experimental evaluations on PURE and UBFC-rPPG datasets demonstrate that UL-Phys achieves superior performance compared to existing supervised and self-supervised baselines, while significantly reducing model complexity and inference latency. Our method also shows strong generalization across datasets, highlighting the value of embedding physiological priors into lightweight, self-supervised architectures. These findings offer a promising direction for scalable and deployable rPPG systems in real-world settings.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.