Deep learning-based 3D classification of head and neck cancer PET/MRI: Radiologist comparison and Grad-CAM interpretability

IF 1.3 4区 医学 Q4 PHYSIOLOGY
Joonas Liedes, Jussi Hirvonen, Oona Rainio, Sarita Murtojärvi, Simona Malaspina, Riku Klén, Jukka Kemppainen
{"title":"Deep learning-based 3D classification of head and neck cancer PET/MRI: Radiologist comparison and Grad-CAM interpretability","authors":"Joonas Liedes,&nbsp;Jussi Hirvonen,&nbsp;Oona Rainio,&nbsp;Sarita Murtojärvi,&nbsp;Simona Malaspina,&nbsp;Riku Klén,&nbsp;Jukka Kemppainen","doi":"10.1111/cpf.70030","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Purpose</h3>\n \n <p>To develop and evaluate a three-dimensional convolutional neural network for automated classification of PET/MRI images in head and neck cancer (HNC) patients, assessing its performance against radiologist interpretation and its potential as a diagnostic aid.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Data from 202 patients with HNC who underwent <sup>18</sup>F-FDG PET/MRI were used to train and validate PET-, MRI-, and PET/MRI-based models. Of these data, 101 patients were labelled as positive in terms of having HNC, and 101 patients as negative. An additional test set of 20 patients was also evaluated, where 10 patients were labelled as positive and 10 as negative. The model performance was assessed using sensitivity, specificity, accuracy, and AUC. Grad-CAM was utilised to improve interpretability and classification results on the test set were compared with a radiologist.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The PET-based model achieved an AUC of 0.92 on the test set, with an accuracy of 90%, a sensitivity of 100% and a specificity of 80%. PET/MRI and MRI-based models underperformed relative to the PET-based model. The radiologist achieved perfect classification accuracy. Analysis of Grad-CAM showed that the model classifications are based on real areas of interest. In addition, it gave valuable insight into using similar systems in identifying false positive findings.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The PET-based model demonstrated high sensitivity, indicating its potential as a pre-screening tool for HNC. However, specificity requires improvement to reduce false-positive rates. Enhanced datasets and refinement of model architecture will be crucial before clinical adoption. Grad-CAM provides valuable insights into model decisions, aiding clinical integration.</p>\n </section>\n </div>","PeriodicalId":10504,"journal":{"name":"Clinical Physiology and Functional Imaging","volume":"45 5","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cpf.70030","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Physiology and Functional Imaging","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cpf.70030","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

To develop and evaluate a three-dimensional convolutional neural network for automated classification of PET/MRI images in head and neck cancer (HNC) patients, assessing its performance against radiologist interpretation and its potential as a diagnostic aid.

Methods

Data from 202 patients with HNC who underwent 18F-FDG PET/MRI were used to train and validate PET-, MRI-, and PET/MRI-based models. Of these data, 101 patients were labelled as positive in terms of having HNC, and 101 patients as negative. An additional test set of 20 patients was also evaluated, where 10 patients were labelled as positive and 10 as negative. The model performance was assessed using sensitivity, specificity, accuracy, and AUC. Grad-CAM was utilised to improve interpretability and classification results on the test set were compared with a radiologist.

Results

The PET-based model achieved an AUC of 0.92 on the test set, with an accuracy of 90%, a sensitivity of 100% and a specificity of 80%. PET/MRI and MRI-based models underperformed relative to the PET-based model. The radiologist achieved perfect classification accuracy. Analysis of Grad-CAM showed that the model classifications are based on real areas of interest. In addition, it gave valuable insight into using similar systems in identifying false positive findings.

Conclusion

The PET-based model demonstrated high sensitivity, indicating its potential as a pre-screening tool for HNC. However, specificity requires improvement to reduce false-positive rates. Enhanced datasets and refinement of model architecture will be crucial before clinical adoption. Grad-CAM provides valuable insights into model decisions, aiding clinical integration.

Abstract Image

基于深度学习的头颈癌PET/MRI三维分类:放射科医师比较和Grad-CAM可解释性。
目的:开发和评估用于头颈癌(HNC)患者PET/MRI图像自动分类的三维卷积神经网络,评估其对放射科医生解释的表现及其作为诊断辅助的潜力。方法:使用202例接受18F-FDG PET/MRI检查的HNC患者的数据来训练和验证基于PET、MRI和PET/MRI的模型。在这些数据中,101例患者被标记为HNC阳性,101例患者被标记为阴性。另外还对20名患者进行了评估,其中10名患者被标记为阳性,10名患者被标记为阴性。通过敏感性、特异性、准确性和AUC评估模型的性能。使用Grad-CAM来提高可解释性,并将测试集上的分类结果与放射科医生进行比较。结果:基于pet的模型在测试集上的AUC为0.92,准确率为90%,灵敏度为100%,特异性为80%。PET/MRI和基于MRI的模型相对于基于PET的模型表现不佳。放射科医生达到了完美的分类准确度。对Grad-CAM的分析表明,模型分类是基于真实感兴趣的领域。此外,它还为使用类似系统识别假阳性结果提供了宝贵的见解。结论:基于pet的模型具有较高的敏感性,表明其具有作为HNC预筛选工具的潜力。然而,特异性需要改进以减少假阳性率。在临床应用之前,增强数据集和改进模型架构将是至关重要的。Grad-CAM为模型决策提供了有价值的见解,有助于临床整合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.40
自引率
5.60%
发文量
62
审稿时长
6-12 weeks
期刊介绍: Clinical Physiology and Functional Imaging publishes reports on clinical and experimental research pertinent to human physiology in health and disease. The scope of the Journal is very broad, covering all aspects of the regulatory system in the cardiovascular, renal and pulmonary systems with special emphasis on methodological aspects. The focus for the journal is, however, work that has potential clinical relevance. The Journal also features review articles on recent front-line research within these fields of interest. Covered by the major abstracting services including Current Contents and Science Citation Index, Clinical Physiology and Functional Imaging plays an important role in providing effective and productive communication among clinical physiologists world-wide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信