Andreas Weber, Kathrin Enderle-Ammour, Karl-Moritz Schröder, Marc C Metzger, Leonard Simon Brandenburg, Jürgen Beck, Jakob Straehle, David Steybe, Mohamed Hassan, Severin Schmid, Birte Ohm, Martin Werner, Bernward Passlick, Rainer Schmelzeisen, Uyen-Thao Le, Peter Bronsert
{"title":"受激拉曼散射图像预处理对深度神经网络检测肿瘤组织性能的影响。","authors":"Andreas Weber, Kathrin Enderle-Ammour, Karl-Moritz Schröder, Marc C Metzger, Leonard Simon Brandenburg, Jürgen Beck, Jakob Straehle, David Steybe, Mohamed Hassan, Severin Schmid, Birte Ohm, Martin Werner, Bernward Passlick, Rainer Schmelzeisen, Uyen-Thao Le, Peter Bronsert","doi":"10.21037/qims-2024-2608","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Images obtained from stimulated Raman scattering can be used to identify histomorphologically relevant information intraoperatively. In order to leverage deep learning algorithms for distinguishing tumoral and non-tumoral tissue, data preprocessing remains a crucial task and may affect the classification performance. To date, the effect of different preprocessing techniques on deep learning algorithm performance is unclear. This study aims to make a contribution to closing this knowledge gap.</p><p><strong>Methods: </strong>To investigate the influence of different preprocessing techniques of images obtained from stimulated Raman scattering, six deep learning architectures (VGG19, ResNet50, InceptionResNetV2, Xception, ConvNeXt and Vision Transformer) and five different preprocessing procedures were compared. For this, annotated datasets comprising 542 images of tissue samples obtained from patients with oral squamous cell carcinoma and non-small cell lung carcinoma were used for network training. Each network was trained five times for 40 epochs. Performance metrics balanced accuracy, precision, recall and F1-score were recorded. Class activation and attention maps were used to highlight on which input pixels a prediction is based.</p><p><strong>Results: </strong>A scaling of the original pixel values of stimulated Raman scattering images to the range [0, 1] yielded a higher and more stable overall classification performance across the neural networks when compared to more sophisticated and computationally expensive methods [ <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.8327; standard deviation (SD) =0.0622 on scaled dataset and <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.7213 (SD =0.2315) on complex preprocessed dataset; P≤0.05]. Absolute performance was best on stimulated Raman histology images ( <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.8478; SD =0.1487).</p><p><strong>Conclusions: </strong>This study shows that preprocessing of pixel values of stimulated Raman scattering images can have a great impact on the performance and the stability of deep learning algorithms when applied for classification of cancer tissue.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 9","pages":"7711-7726"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397652/pdf/","citationCount":"0","resultStr":"{\"title\":\"Influence of preprocessing of stimulated Raman scattering images on the performance of deep neural networks for detecting cancer tissue.\",\"authors\":\"Andreas Weber, Kathrin Enderle-Ammour, Karl-Moritz Schröder, Marc C Metzger, Leonard Simon Brandenburg, Jürgen Beck, Jakob Straehle, David Steybe, Mohamed Hassan, Severin Schmid, Birte Ohm, Martin Werner, Bernward Passlick, Rainer Schmelzeisen, Uyen-Thao Le, Peter Bronsert\",\"doi\":\"10.21037/qims-2024-2608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Images obtained from stimulated Raman scattering can be used to identify histomorphologically relevant information intraoperatively. In order to leverage deep learning algorithms for distinguishing tumoral and non-tumoral tissue, data preprocessing remains a crucial task and may affect the classification performance. To date, the effect of different preprocessing techniques on deep learning algorithm performance is unclear. This study aims to make a contribution to closing this knowledge gap.</p><p><strong>Methods: </strong>To investigate the influence of different preprocessing techniques of images obtained from stimulated Raman scattering, six deep learning architectures (VGG19, ResNet50, InceptionResNetV2, Xception, ConvNeXt and Vision Transformer) and five different preprocessing procedures were compared. For this, annotated datasets comprising 542 images of tissue samples obtained from patients with oral squamous cell carcinoma and non-small cell lung carcinoma were used for network training. Each network was trained five times for 40 epochs. Performance metrics balanced accuracy, precision, recall and F1-score were recorded. Class activation and attention maps were used to highlight on which input pixels a prediction is based.</p><p><strong>Results: </strong>A scaling of the original pixel values of stimulated Raman scattering images to the range [0, 1] yielded a higher and more stable overall classification performance across the neural networks when compared to more sophisticated and computationally expensive methods [ <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.8327; standard deviation (SD) =0.0622 on scaled dataset and <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.7213 (SD =0.2315) on complex preprocessed dataset; P≤0.05]. Absolute performance was best on stimulated Raman histology images ( <math> <mrow> <mover><mrow><mi>F</mi> <mn>1</mn></mrow> <mo>¯</mo></mover> </mrow> </math> =0.8478; SD =0.1487).</p><p><strong>Conclusions: </strong>This study shows that preprocessing of pixel values of stimulated Raman scattering images can have a great impact on the performance and the stability of deep learning algorithms when applied for classification of cancer tissue.</p>\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 9\",\"pages\":\"7711-7726\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397652/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-2024-2608\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/12 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-2024-2608","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Influence of preprocessing of stimulated Raman scattering images on the performance of deep neural networks for detecting cancer tissue.
Background: Images obtained from stimulated Raman scattering can be used to identify histomorphologically relevant information intraoperatively. In order to leverage deep learning algorithms for distinguishing tumoral and non-tumoral tissue, data preprocessing remains a crucial task and may affect the classification performance. To date, the effect of different preprocessing techniques on deep learning algorithm performance is unclear. This study aims to make a contribution to closing this knowledge gap.
Methods: To investigate the influence of different preprocessing techniques of images obtained from stimulated Raman scattering, six deep learning architectures (VGG19, ResNet50, InceptionResNetV2, Xception, ConvNeXt and Vision Transformer) and five different preprocessing procedures were compared. For this, annotated datasets comprising 542 images of tissue samples obtained from patients with oral squamous cell carcinoma and non-small cell lung carcinoma were used for network training. Each network was trained five times for 40 epochs. Performance metrics balanced accuracy, precision, recall and F1-score were recorded. Class activation and attention maps were used to highlight on which input pixels a prediction is based.
Results: A scaling of the original pixel values of stimulated Raman scattering images to the range [0, 1] yielded a higher and more stable overall classification performance across the neural networks when compared to more sophisticated and computationally expensive methods [ =0.8327; standard deviation (SD) =0.0622 on scaled dataset and =0.7213 (SD =0.2315) on complex preprocessed dataset; P≤0.05]. Absolute performance was best on stimulated Raman histology images ( =0.8478; SD =0.1487).
Conclusions: This study shows that preprocessing of pixel values of stimulated Raman scattering images can have a great impact on the performance and the stability of deep learning algorithms when applied for classification of cancer tissue.