Wang Liao, Chen Zhang, Belmin Alić, Alina Wildenauer, Sarah Dietz-Terjung, Jose Guillermo Ortiz Sucre, Sivagurunathan Sutharsan, Christoph Schöbel, Karsten Seidl, Gunther Notni
{"title":"Leveraging 3D convolutional neural network and 3D visible-near-infrared multimodal imaging for enhanced contactless oximetry.","authors":"Wang Liao, Chen Zhang, Belmin Alić, Alina Wildenauer, Sarah Dietz-Terjung, Jose Guillermo Ortiz Sucre, Sivagurunathan Sutharsan, Christoph Schöbel, Karsten Seidl, Gunther Notni","doi":"10.1117/1.JBO.29.S3.S33309","DOIUrl":null,"url":null,"abstract":"<p><strong>Significance: </strong>Monitoring oxygen saturation ( <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> ) is important in healthcare, especially for diagnosing and managing pulmonary diseases. Non-contact approaches broaden the potential applications of <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement by better hygiene, comfort, and capability for long-term monitoring. However, existing studies often encounter challenges such as lower signal-to-noise ratios and stringent environmental conditions.</p><p><strong>Aim: </strong>We aim to develop and validate a contactless <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement approach using 3D convolutional neural networks (3D CNN) and 3D visible-near-infrared (VIS-NIR) multimodal imaging, to offer a convenient, accurate, and robust alternative for <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> monitoring.</p><p><strong>Approach: </strong>We propose an approach that utilizes a 3D VIS-NIR multimodal camera system to capture facial videos, in which <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> is estimated through 3D CNN by simultaneously extracting spatial and temporal features. Our approach includes registration of multimodal images, tracking of the 3D region of interest, spatial and temporal preprocessing, and 3D CNN-based feature extraction and <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> regression.</p><p><strong>Results: </strong>In a breath-holding experiment involving 23 healthy participants, we obtained multimodal video data with reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values ranging from 80% to 99% measured by pulse oximeter on the fingertip. The approach achieved a mean absolute error (MAE) of 2.31% and a Pearson correlation coefficient of 0.64 in the experiment, demonstrating good agreement with traditional pulse oximetry. The discrepancy of estimated <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values was within 3% of the reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> for <math><mrow><mo>∼</mo> <mn>80</mn> <mo>%</mo></mrow> </math> of all 1-s time points. Besides, in clinical trials involving patients with sleep apnea syndrome, our approach demonstrated robust performance, with an MAE of less than 2% in <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> estimations compared to gold-standard polysomnography.</p><p><strong>Conclusions: </strong>The proposed approach offers a promising alternative for non-contact oxygen saturation measurement with good sensitivity to desaturation, showing potential for applications in clinical settings.</p>","PeriodicalId":15264,"journal":{"name":"Journal of Biomedical Optics","volume":"29 Suppl 3","pages":"S33309"},"PeriodicalIF":3.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338290/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Optics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JBO.29.S3.S33309","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Significance: Monitoring oxygen saturation ( ) is important in healthcare, especially for diagnosing and managing pulmonary diseases. Non-contact approaches broaden the potential applications of measurement by better hygiene, comfort, and capability for long-term monitoring. However, existing studies often encounter challenges such as lower signal-to-noise ratios and stringent environmental conditions.
Aim: We aim to develop and validate a contactless measurement approach using 3D convolutional neural networks (3D CNN) and 3D visible-near-infrared (VIS-NIR) multimodal imaging, to offer a convenient, accurate, and robust alternative for monitoring.
Approach: We propose an approach that utilizes a 3D VIS-NIR multimodal camera system to capture facial videos, in which is estimated through 3D CNN by simultaneously extracting spatial and temporal features. Our approach includes registration of multimodal images, tracking of the 3D region of interest, spatial and temporal preprocessing, and 3D CNN-based feature extraction and regression.
Results: In a breath-holding experiment involving 23 healthy participants, we obtained multimodal video data with reference values ranging from 80% to 99% measured by pulse oximeter on the fingertip. The approach achieved a mean absolute error (MAE) of 2.31% and a Pearson correlation coefficient of 0.64 in the experiment, demonstrating good agreement with traditional pulse oximetry. The discrepancy of estimated values was within 3% of the reference for of all 1-s time points. Besides, in clinical trials involving patients with sleep apnea syndrome, our approach demonstrated robust performance, with an MAE of less than 2% in estimations compared to gold-standard polysomnography.
Conclusions: The proposed approach offers a promising alternative for non-contact oxygen saturation measurement with good sensitivity to desaturation, showing potential for applications in clinical settings.
期刊介绍:
The Journal of Biomedical Optics publishes peer-reviewed papers on the use of modern optical technology for improved health care and biomedical research.