Jacob N Gloe, Eric A Borisch, Adam T Froemming, Akira Kawashima, Jordan D LeGout, Hirotsugu Nakai, Naoki Takahashi, Stephen J Riederer
{"title":"用于轴向t2加权前列腺MRI质量评估的深度学习:减少不必要重新扫描的工具。","authors":"Jacob N Gloe, Eric A Borisch, Adam T Froemming, Akira Kawashima, Jordan D LeGout, Hirotsugu Nakai, Naoki Takahashi, Stephen J Riederer","doi":"10.1186/s41747-025-00584-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>T2-weighted images are a critical component of prostate magnetic resonance imaging (MRI), and it would be useful to automatically assess image quality (IQ) on a patient-specific basis without radiologist oversight.</p><p><strong>Methods: </strong>This retrospective study comprised 1,412 axial T2-weighted prostate scans. Four experienced uroradiologists graded IQ using a 0-to-3 scale (0 = uninterpretable; 1 = marginally interpretable; 2 = adequately diagnostic; 3 = more than adequately diagnostic), binarized into nondiagnostic (IQ0 or IQ1), requiring rescanning, and diagnostic (IQ2 or IQ3), not requiring rescanning. The deep learning (DL) model was trained on 1,006 scans; 203 other scans were used for validation of multiple convolutional neural networks; the remaining 203 exams were used as a test set. 3D-DenseNet_169 was chosen among 11 models based on multiple evaluation criteria. The rescan predictions were compared to the number of rescans performed on a subset of 174 exams.</p><p><strong>Results: </strong>The model accurately predicts radiologist IQ scores (Cohen κ = 0.658), similar to the human inter-rater reliability (κ = 0.688-0.791). The model also predicts rescanning necessity similarly to radiologists: model κ = 0.537; reviewer κ = 0.577-0.703. The rescan model prediction area under the curve was 0.867.</p><p><strong>Conclusion: </strong>The DL model showed a strong ability to differentiate diagnostic from nondiagnostic axial T2-weighted prostate images, accurately mimicking expert radiologists' IQ scores. Using the model, the clinical unnecessary rescan rate could be reduced from over 50% to less than 30%.</p><p><strong>Relevance statement: </strong>DL assessment of T2-weighted prostate MRI scans can accurately assess IQ, determining the need to repeat inadequate scans as well as avoiding repeat scans of those with adequate diagnostic quality, resulting in reduced unnecessary rescanning.</p><p><strong>Key points: </strong>Artificial intelligence assessment of prostate MRI T2-weighted image quality can improve exam time management. The model showed over 75% accuracy in assessing prostate MRI T2-weighted image quality. Expert radiologists have a substantial agreement in evaluating prostate MRI T2-weighted image quality.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"44"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12040773/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep learning for quality assessment of axial T2-weighted prostate MRI: a tool to reduce unnecessary rescanning.\",\"authors\":\"Jacob N Gloe, Eric A Borisch, Adam T Froemming, Akira Kawashima, Jordan D LeGout, Hirotsugu Nakai, Naoki Takahashi, Stephen J Riederer\",\"doi\":\"10.1186/s41747-025-00584-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>T2-weighted images are a critical component of prostate magnetic resonance imaging (MRI), and it would be useful to automatically assess image quality (IQ) on a patient-specific basis without radiologist oversight.</p><p><strong>Methods: </strong>This retrospective study comprised 1,412 axial T2-weighted prostate scans. Four experienced uroradiologists graded IQ using a 0-to-3 scale (0 = uninterpretable; 1 = marginally interpretable; 2 = adequately diagnostic; 3 = more than adequately diagnostic), binarized into nondiagnostic (IQ0 or IQ1), requiring rescanning, and diagnostic (IQ2 or IQ3), not requiring rescanning. The deep learning (DL) model was trained on 1,006 scans; 203 other scans were used for validation of multiple convolutional neural networks; the remaining 203 exams were used as a test set. 3D-DenseNet_169 was chosen among 11 models based on multiple evaluation criteria. The rescan predictions were compared to the number of rescans performed on a subset of 174 exams.</p><p><strong>Results: </strong>The model accurately predicts radiologist IQ scores (Cohen κ = 0.658), similar to the human inter-rater reliability (κ = 0.688-0.791). The model also predicts rescanning necessity similarly to radiologists: model κ = 0.537; reviewer κ = 0.577-0.703. The rescan model prediction area under the curve was 0.867.</p><p><strong>Conclusion: </strong>The DL model showed a strong ability to differentiate diagnostic from nondiagnostic axial T2-weighted prostate images, accurately mimicking expert radiologists' IQ scores. Using the model, the clinical unnecessary rescan rate could be reduced from over 50% to less than 30%.</p><p><strong>Relevance statement: </strong>DL assessment of T2-weighted prostate MRI scans can accurately assess IQ, determining the need to repeat inadequate scans as well as avoiding repeat scans of those with adequate diagnostic quality, resulting in reduced unnecessary rescanning.</p><p><strong>Key points: </strong>Artificial intelligence assessment of prostate MRI T2-weighted image quality can improve exam time management. The model showed over 75% accuracy in assessing prostate MRI T2-weighted image quality. Expert radiologists have a substantial agreement in evaluating prostate MRI T2-weighted image quality.</p>\",\"PeriodicalId\":36926,\"journal\":{\"name\":\"European Radiology Experimental\",\"volume\":\"9 1\",\"pages\":\"44\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12040773/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Radiology Experimental\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s41747-025-00584-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Radiology Experimental","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s41747-025-00584-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Deep learning for quality assessment of axial T2-weighted prostate MRI: a tool to reduce unnecessary rescanning.
Background: T2-weighted images are a critical component of prostate magnetic resonance imaging (MRI), and it would be useful to automatically assess image quality (IQ) on a patient-specific basis without radiologist oversight.
Methods: This retrospective study comprised 1,412 axial T2-weighted prostate scans. Four experienced uroradiologists graded IQ using a 0-to-3 scale (0 = uninterpretable; 1 = marginally interpretable; 2 = adequately diagnostic; 3 = more than adequately diagnostic), binarized into nondiagnostic (IQ0 or IQ1), requiring rescanning, and diagnostic (IQ2 or IQ3), not requiring rescanning. The deep learning (DL) model was trained on 1,006 scans; 203 other scans were used for validation of multiple convolutional neural networks; the remaining 203 exams were used as a test set. 3D-DenseNet_169 was chosen among 11 models based on multiple evaluation criteria. The rescan predictions were compared to the number of rescans performed on a subset of 174 exams.
Results: The model accurately predicts radiologist IQ scores (Cohen κ = 0.658), similar to the human inter-rater reliability (κ = 0.688-0.791). The model also predicts rescanning necessity similarly to radiologists: model κ = 0.537; reviewer κ = 0.577-0.703. The rescan model prediction area under the curve was 0.867.
Conclusion: The DL model showed a strong ability to differentiate diagnostic from nondiagnostic axial T2-weighted prostate images, accurately mimicking expert radiologists' IQ scores. Using the model, the clinical unnecessary rescan rate could be reduced from over 50% to less than 30%.
Relevance statement: DL assessment of T2-weighted prostate MRI scans can accurately assess IQ, determining the need to repeat inadequate scans as well as avoiding repeat scans of those with adequate diagnostic quality, resulting in reduced unnecessary rescanning.
Key points: Artificial intelligence assessment of prostate MRI T2-weighted image quality can improve exam time management. The model showed over 75% accuracy in assessing prostate MRI T2-weighted image quality. Expert radiologists have a substantial agreement in evaluating prostate MRI T2-weighted image quality.