Wang Liao, Chen Zhang, Belmin Alić, Alina Wildenauer, Sarah Dietz-Terjung, Jose Guillermo Ortiz Sucre, Sivagurunathan Sutharsan, Christoph Schöbel, Karsten Seidl, Gunther Notni
{"title":"利用三维卷积神经网络和三维可见光-近红外多模态成像增强非接触式血氧测量。","authors":"Wang Liao, Chen Zhang, Belmin Alić, Alina Wildenauer, Sarah Dietz-Terjung, Jose Guillermo Ortiz Sucre, Sivagurunathan Sutharsan, Christoph Schöbel, Karsten Seidl, Gunther Notni","doi":"10.1117/1.JBO.29.S3.S33309","DOIUrl":null,"url":null,"abstract":"<p><strong>Significance: </strong>Monitoring oxygen saturation ( <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> ) is important in healthcare, especially for diagnosing and managing pulmonary diseases. Non-contact approaches broaden the potential applications of <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement by better hygiene, comfort, and capability for long-term monitoring. However, existing studies often encounter challenges such as lower signal-to-noise ratios and stringent environmental conditions.</p><p><strong>Aim: </strong>We aim to develop and validate a contactless <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement approach using 3D convolutional neural networks (3D CNN) and 3D visible-near-infrared (VIS-NIR) multimodal imaging, to offer a convenient, accurate, and robust alternative for <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> monitoring.</p><p><strong>Approach: </strong>We propose an approach that utilizes a 3D VIS-NIR multimodal camera system to capture facial videos, in which <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> is estimated through 3D CNN by simultaneously extracting spatial and temporal features. Our approach includes registration of multimodal images, tracking of the 3D region of interest, spatial and temporal preprocessing, and 3D CNN-based feature extraction and <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> regression.</p><p><strong>Results: </strong>In a breath-holding experiment involving 23 healthy participants, we obtained multimodal video data with reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values ranging from 80% to 99% measured by pulse oximeter on the fingertip. The approach achieved a mean absolute error (MAE) of 2.31% and a Pearson correlation coefficient of 0.64 in the experiment, demonstrating good agreement with traditional pulse oximetry. The discrepancy of estimated <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values was within 3% of the reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> for <math><mrow><mo>∼</mo> <mn>80</mn> <mo>%</mo></mrow> </math> of all 1-s time points. Besides, in clinical trials involving patients with sleep apnea syndrome, our approach demonstrated robust performance, with an MAE of less than 2% in <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> estimations compared to gold-standard polysomnography.</p><p><strong>Conclusions: </strong>The proposed approach offers a promising alternative for non-contact oxygen saturation measurement with good sensitivity to desaturation, showing potential for applications in clinical settings.</p>","PeriodicalId":15264,"journal":{"name":"Journal of Biomedical Optics","volume":"29 Suppl 3","pages":"S33309"},"PeriodicalIF":3.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338290/pdf/","citationCount":"0","resultStr":"{\"title\":\"Leveraging 3D convolutional neural network and 3D visible-near-infrared multimodal imaging for enhanced contactless oximetry.\",\"authors\":\"Wang Liao, Chen Zhang, Belmin Alić, Alina Wildenauer, Sarah Dietz-Terjung, Jose Guillermo Ortiz Sucre, Sivagurunathan Sutharsan, Christoph Schöbel, Karsten Seidl, Gunther Notni\",\"doi\":\"10.1117/1.JBO.29.S3.S33309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Significance: </strong>Monitoring oxygen saturation ( <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> ) is important in healthcare, especially for diagnosing and managing pulmonary diseases. Non-contact approaches broaden the potential applications of <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement by better hygiene, comfort, and capability for long-term monitoring. However, existing studies often encounter challenges such as lower signal-to-noise ratios and stringent environmental conditions.</p><p><strong>Aim: </strong>We aim to develop and validate a contactless <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> measurement approach using 3D convolutional neural networks (3D CNN) and 3D visible-near-infrared (VIS-NIR) multimodal imaging, to offer a convenient, accurate, and robust alternative for <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> monitoring.</p><p><strong>Approach: </strong>We propose an approach that utilizes a 3D VIS-NIR multimodal camera system to capture facial videos, in which <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> is estimated through 3D CNN by simultaneously extracting spatial and temporal features. Our approach includes registration of multimodal images, tracking of the 3D region of interest, spatial and temporal preprocessing, and 3D CNN-based feature extraction and <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> regression.</p><p><strong>Results: </strong>In a breath-holding experiment involving 23 healthy participants, we obtained multimodal video data with reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values ranging from 80% to 99% measured by pulse oximeter on the fingertip. The approach achieved a mean absolute error (MAE) of 2.31% and a Pearson correlation coefficient of 0.64 in the experiment, demonstrating good agreement with traditional pulse oximetry. The discrepancy of estimated <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> values was within 3% of the reference <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> for <math><mrow><mo>∼</mo> <mn>80</mn> <mo>%</mo></mrow> </math> of all 1-s time points. Besides, in clinical trials involving patients with sleep apnea syndrome, our approach demonstrated robust performance, with an MAE of less than 2% in <math> <mrow><msub><mi>SpO</mi> <mn>2</mn></msub> </mrow> </math> estimations compared to gold-standard polysomnography.</p><p><strong>Conclusions: </strong>The proposed approach offers a promising alternative for non-contact oxygen saturation measurement with good sensitivity to desaturation, showing potential for applications in clinical settings.</p>\",\"PeriodicalId\":15264,\"journal\":{\"name\":\"Journal of Biomedical Optics\",\"volume\":\"29 Suppl 3\",\"pages\":\"S33309\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338290/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Biomedical Optics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JBO.29.S3.S33309\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Optics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JBO.29.S3.S33309","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
Leveraging 3D convolutional neural network and 3D visible-near-infrared multimodal imaging for enhanced contactless oximetry.
Significance: Monitoring oxygen saturation ( ) is important in healthcare, especially for diagnosing and managing pulmonary diseases. Non-contact approaches broaden the potential applications of measurement by better hygiene, comfort, and capability for long-term monitoring. However, existing studies often encounter challenges such as lower signal-to-noise ratios and stringent environmental conditions.
Aim: We aim to develop and validate a contactless measurement approach using 3D convolutional neural networks (3D CNN) and 3D visible-near-infrared (VIS-NIR) multimodal imaging, to offer a convenient, accurate, and robust alternative for monitoring.
Approach: We propose an approach that utilizes a 3D VIS-NIR multimodal camera system to capture facial videos, in which is estimated through 3D CNN by simultaneously extracting spatial and temporal features. Our approach includes registration of multimodal images, tracking of the 3D region of interest, spatial and temporal preprocessing, and 3D CNN-based feature extraction and regression.
Results: In a breath-holding experiment involving 23 healthy participants, we obtained multimodal video data with reference values ranging from 80% to 99% measured by pulse oximeter on the fingertip. The approach achieved a mean absolute error (MAE) of 2.31% and a Pearson correlation coefficient of 0.64 in the experiment, demonstrating good agreement with traditional pulse oximetry. The discrepancy of estimated values was within 3% of the reference for of all 1-s time points. Besides, in clinical trials involving patients with sleep apnea syndrome, our approach demonstrated robust performance, with an MAE of less than 2% in estimations compared to gold-standard polysomnography.
Conclusions: The proposed approach offers a promising alternative for non-contact oxygen saturation measurement with good sensitivity to desaturation, showing potential for applications in clinical settings.
期刊介绍:
The Journal of Biomedical Optics publishes peer-reviewed papers on the use of modern optical technology for improved health care and biomedical research.