Leveraging prior knowledge in machine intelligence to improve lesion diagnosis for early cancer detection.

Medical physics Pub Date : 2025-04-23 DOI:10.1002/mp.17841
Zhengrong J Liang, Shaojie Chang, Yongfeng Gao, Weiguo Cao, Licheng R Kuo, Marc J Pomeroy, Lihong C Li, Almas F Abbasi, Jela Bandovic, Michael J Reiter, Perry J Pickhardt
{"title":"Leveraging prior knowledge in machine intelligence to improve lesion diagnosis for early cancer detection.","authors":"Zhengrong J Liang, Shaojie Chang, Yongfeng Gao, Weiguo Cao, Licheng R Kuo, Marc J Pomeroy, Lihong C Li, Almas F Abbasi, Jela Bandovic, Michael J Reiter, Perry J Pickhardt","doi":"10.1002/mp.17841","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Experts' interpretations of medical images for lesion diagnosis may not always align with the underlying in vivo tissue pathology and, therefore, cannot be considered the definitive truth regarding malignancy or benignity. While current machine learning (ML) models in medical imaging can replicate expert interpretations, their results may also diverge from the actual ground truth.</p><p><strong>Purpose: </strong>This study investigates various factors contributing to these discrepancies and proposes solutions.</p><p><strong>Methods: </strong>The central idea of the proposed solution is to integrate prior knowledge into ML models to enhance the characterization of in vivo tissues. The incorporation of prior knowledge into decision-making is task-specific, tailored to the data acquired for that task. This central idea was tested on the diagnosis of lesions using low dose computed tomography (LdCT) for early cancer detection, particularly focusing on more challenging, ambiguous or indeterminate lesions (IDLs) as classified by experts. One key piece of prior knowledge involves CT x-ray energy spectrum, where different energies interact with in vivo tissues within a lesion, producing variable but reproducible image contrasts that encapsulate biological information. Typically, CT imaging devices use only the high-energy portion of this spectrum for data acquisition; however, this study considers the full spectrum for lesion diagnostics. Another critical aspect of prior knowledge includes the functional or dynamic properties of in vivo tissues, such as elasticity, which can indicate pathological conditions. Instead of relying solely on abstract image features as current ML models do, this study extracts these tissue pathological characteristics from the image contrast variations.</p><p><strong>Results: </strong>The method was tested on LdCT images of four sets of IDLs, including pulmonary nodules and colorectal polyps, with pathological reports serving as the ground truth for malignancy or benignity. The method achieved an area under the receiver operating characteristic curve (AUC) of 0.98 ± 0.03, demonstrating a significant improvement over existing state-of-the-art ML models, which typically have AUCs in the 0.70 range.</p><p><strong>Conclusion: </strong>Leveraging prior knowledge in machine intelligence can enhance lesion diagnosis, resolve the ambiguity of IDLs interpreted by experts, and improve the effectiveness of LdCT screening for early-stage cancers.</p>","PeriodicalId":94136,"journal":{"name":"Medical physics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/mp.17841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Experts' interpretations of medical images for lesion diagnosis may not always align with the underlying in vivo tissue pathology and, therefore, cannot be considered the definitive truth regarding malignancy or benignity. While current machine learning (ML) models in medical imaging can replicate expert interpretations, their results may also diverge from the actual ground truth.

Purpose: This study investigates various factors contributing to these discrepancies and proposes solutions.

Methods: The central idea of the proposed solution is to integrate prior knowledge into ML models to enhance the characterization of in vivo tissues. The incorporation of prior knowledge into decision-making is task-specific, tailored to the data acquired for that task. This central idea was tested on the diagnosis of lesions using low dose computed tomography (LdCT) for early cancer detection, particularly focusing on more challenging, ambiguous or indeterminate lesions (IDLs) as classified by experts. One key piece of prior knowledge involves CT x-ray energy spectrum, where different energies interact with in vivo tissues within a lesion, producing variable but reproducible image contrasts that encapsulate biological information. Typically, CT imaging devices use only the high-energy portion of this spectrum for data acquisition; however, this study considers the full spectrum for lesion diagnostics. Another critical aspect of prior knowledge includes the functional or dynamic properties of in vivo tissues, such as elasticity, which can indicate pathological conditions. Instead of relying solely on abstract image features as current ML models do, this study extracts these tissue pathological characteristics from the image contrast variations.

Results: The method was tested on LdCT images of four sets of IDLs, including pulmonary nodules and colorectal polyps, with pathological reports serving as the ground truth for malignancy or benignity. The method achieved an area under the receiver operating characteristic curve (AUC) of 0.98 ± 0.03, demonstrating a significant improvement over existing state-of-the-art ML models, which typically have AUCs in the 0.70 range.

Conclusion: Leveraging prior knowledge in machine intelligence can enhance lesion diagnosis, resolve the ambiguity of IDLs interpreted by experts, and improve the effectiveness of LdCT screening for early-stage cancers.

利用机器智能中的先验知识来提高早期癌症检测的病变诊断。
背景:专家对病变诊断的医学图像的解释可能并不总是与潜在的体内组织病理一致,因此,不能被认为是恶性或良性的最终真相。虽然目前医学成像中的机器学习(ML)模型可以复制专家的解释,但它们的结果也可能偏离实际的基础事实。目的:本研究探讨造成这些差异的各种因素,并提出解决方法。方法:提出的解决方案的中心思想是将先验知识整合到ML模型中,以增强体内组织的表征。将先验知识纳入决策是针对特定任务的,是针对为该任务获得的数据量身定制的。这一中心思想在使用低剂量计算机断层扫描(LdCT)进行早期癌症检测的病变诊断中得到了验证,特别是针对专家分类的更具挑战性、模棱两可或不确定的病变(idl)。一个关键的先验知识涉及CT x射线能谱,其中不同的能量与病变内的体内组织相互作用,产生可变但可重复的图像对比,封装生物信息。通常,CT成像设备仅使用该光谱的高能部分进行数据采集;然而,本研究考虑了病变诊断的全谱。先验知识的另一个关键方面包括体内组织的功能或动态特性,例如弹性,这可以指示病理状况。与目前的ML模型完全依赖抽象的图像特征不同,本研究从图像对比度变化中提取这些组织病理特征。结果:该方法在肺结节、结直肠息肉等4组idl的LdCT图像上进行了验证,病理报告可作为判断其恶性或良性的基本依据。该方法实现了受试者工作特征曲线(AUC)下的面积为0.98±0.03,与现有最先进的ML模型相比有了显着改进,这些模型的AUC通常在0.70范围内。结论:利用机器智能中的先验知识可以增强病变诊断,解决专家对idl解释的模糊性,提高LdCT筛查早期癌症的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信