Interpretable and Open AI Models: A Mandate for the Future of HCC Diagnostics

IF 6 2区 医学 Q1 GASTROENTEROLOGY & HEPATOLOGY
Simone Famularo, Luca Boldrini, Matteo Donadon, Zenichi Morise
{"title":"Interpretable and Open AI Models: A Mandate for the Future of HCC Diagnostics","authors":"Simone Famularo,&nbsp;Luca Boldrini,&nbsp;Matteo Donadon,&nbsp;Zenichi Morise","doi":"10.1111/liv.16229","DOIUrl":null,"url":null,"abstract":"<p>We read with interest the recent study by Zheng et al. [<span>1</span>] that introduces an innovative MRI-based topology deep learning (DL) model achieving high diagnostic accuracy in the prediction microvascular invasion (MVI), highlighting a key development in the integration of artificial intelligence (AI) into oncologic imaging. Microvascular invasion is among the critical determinants of prognosis in HCC and is typically associated with early recurrence and poor survival outcomes. Accurate prediction of MVI is therefore paramount, not only to guide therapeutic decisions [<span>2</span>] but also to stratify patients for clinical trials. HCC begins often as a tumour with less-invasiveness in the early stages of its development. In the stage, it grows not invasively, forming a non-cancerous tissue capsule of compressed surrounding tissue. In liver resection for HCC, resection at the capsule level is sometimes considered R0 resection. However, during the developmental process, cancer cells invade outside the capsule and into vessels, such as the portal vein. After going through this step, the aggressiveness of HCC increases rapidly, and surgical intervention based on the aforementioned recognition cannot obtain a sufficient therapeutic effect. Therefore, the macroscopic type is strongly related to its prognosis after intervention [<span>3, 4</span>]. It has long been pointed out the difficulty connecting these findings to surgical outcomes. Present research by Zheng et al. suggests that new insights may be obtained by connecting detailed findings (not limited to morphological changes) of MRI to topology and putting them in AI analysis. Although minute extracapsular and/or vascular invasions can be confirmed in postoperative pathology, reliable preoperative imaging modality for their early detection has not yet been established.</p><p>Although the study represents a remarkable technical achievement, it also invites broader reflection on the role and responsibilities of AI in clinical decision-making: first, the development of interpretable AI models that clinicians can trust and understand; and second, the democratisation of these tools through open-source frameworks to ensure their widespread validation and application.</p><p>The growing field of AI in medicine has transformed how we approach complex problems, particularly in diagnostic imaging. Advances in convolutional neural networks (CNNs) and, more recently, topological data analysis (TDA), have enabled models to extract nuanced patterns from imaging data, surpassing the diagnostic capabilities of traditional radiological methods [<span>5</span>]. In the context of HCC, the concept of a ‘virtual biopsy’ is particularly compelling. By inferring histopathological features, such as MVI from imaging data alone, these models could obviate the need for invasive procedures, reduce patient morbidity and accelerate clinical decision-making. This field is rapidly developing, and the use of a combination of clinical and quantitative imaging data has already demonstrated the great potential that a virtual biopsy can have in managing these patients [<span>6</span>].</p><p>However, as the complexity of AI tools increases, so can their opacity [<span>7</span>] for the end user. The majority of deep learning models function as ‘black boxes’, producing predictions without offering insights into their reasoning. This lack of transparency is a significant barrier to their adoption in clinical practice, where medical professionals must understand and trust the rationale behind algorithm-driven recommendations. In high-stakes decisions, such as whether to recommend anatomic hepatic resection over less invasive treatments, or to allocate patients to transplant rather than other therapies, blind reliance on machine-generated outputs is not only undesirable but also ethically irresponsible. The main question is: Are we ready to make clinical choices for patients derived from algorithms whose reasoning and conclusions we do not fully understand?</p><p>The application of TDA proposed by Zheng et al. represents a commendable step towards improving the interpretability of AI models. By encoding spatial and structural relationships within MRI data, TDA enhances the model's ability to capture biologically relevant features, bridging the gap between computational output and clinical reasoning. The incorporation of saliency maps to highlight regions contributing to MVI predictions further demonstrates a commitment to transparency. However, the achieved interpretability still remains insufficient unless paired with broader accessibility and clinical integration.</p><p>Moreover, the utility of these models is often constrained by their proprietary nature. Closed-source algorithms hinder independent validation and limit their applicability to broader patient populations.</p><p>In the case of the model presented in this publication, its training and validation were restricted to patients within BCLC stages A and B, a subgroup that does not encompass the full spectrum of HCC presentations.</p><p>This is an important limit, as the presence of MVI can be detected in any stage. According to the Italian national register (HE.RC.O.LE.S.), it has been detected in 58.2% among BCLC 0-A, 47.9% among BCLC B and 41.9% among BCLC-C cases (Table 1). Furthermore, the BCLC therapeutic decision algorithm has recently been totally superseded by the ‘therapeutic hierarchy’ approach [<span>8</span>], which clearly indicates that surgery can be the first-line treatment regardless of the oncological stage. In this setting, limiting the possibility of applying the virtual biopsy proposed by the authors only to cases with less advanced disease, severely limits the applicability of the algorithm itself in everyday practice. The limitation of this model to BCLC stages A and B patients exemplifies a recurring limitation in AI-driven research: the disconnection between experimental design and real-world data diversity. Clinicians treating HCC encounter a spectrum of presentations, from early-stage lesions amenable to curative resection to advanced, multifocal disease requiring palliative interventions. Models confined to narrow patient subsets risk perpetuating inequities in access to care, particularly for underserved populations disproportionately affected by advanced-stage HCC.</p><p>Another significant limitation lies in the study's proprietary nature. To fully realise the promise of AI in medicine, the development of open, interpretable and universally accessible models is non-negotiable. Although the authors report promising external validation results, independent replication by other institutions is essential to establish robustness and generalisability and should be encouraged at best. This result cannot be achieved simply with data from two centres, but it is necessary to freely allow the use of the model. Although open-source frameworks foster collaboration and enhance the robustness of AI models, the development and deployment of such tools must be guided by clear regulatory frameworks. Without structured oversight, the integration of diverse and potentially biased datasets into open models risks introducing variability that may confuse real-world clinical applications. Establishing guidelines to manage data heterogeneity and ensure model reliability across populations is essential for mitigating these challenges.</p><p>In conclusion, the study by Zheng et al. underscores the transformative potential of AI in HCC management but also highlights the ethical and practical challenges that accompany these advancements. Interpretability and openness are not ancillary concerns; they are essential to the responsible integration of AI into medicine. As we navigate the rapidly evolving landscape of AI-driven diagnostics, we must remain steadfast in our commitment to these principles, ensuring that technological progress reliably serves the human elements of care.</p><p>The authors declare no conflicts of interest.</p>","PeriodicalId":18101,"journal":{"name":"Liver International","volume":"45 4","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/liv.16229","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Liver International","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/liv.16229","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

We read with interest the recent study by Zheng et al. [1] that introduces an innovative MRI-based topology deep learning (DL) model achieving high diagnostic accuracy in the prediction microvascular invasion (MVI), highlighting a key development in the integration of artificial intelligence (AI) into oncologic imaging. Microvascular invasion is among the critical determinants of prognosis in HCC and is typically associated with early recurrence and poor survival outcomes. Accurate prediction of MVI is therefore paramount, not only to guide therapeutic decisions [2] but also to stratify patients for clinical trials. HCC begins often as a tumour with less-invasiveness in the early stages of its development. In the stage, it grows not invasively, forming a non-cancerous tissue capsule of compressed surrounding tissue. In liver resection for HCC, resection at the capsule level is sometimes considered R0 resection. However, during the developmental process, cancer cells invade outside the capsule and into vessels, such as the portal vein. After going through this step, the aggressiveness of HCC increases rapidly, and surgical intervention based on the aforementioned recognition cannot obtain a sufficient therapeutic effect. Therefore, the macroscopic type is strongly related to its prognosis after intervention [3, 4]. It has long been pointed out the difficulty connecting these findings to surgical outcomes. Present research by Zheng et al. suggests that new insights may be obtained by connecting detailed findings (not limited to morphological changes) of MRI to topology and putting them in AI analysis. Although minute extracapsular and/or vascular invasions can be confirmed in postoperative pathology, reliable preoperative imaging modality for their early detection has not yet been established.

Although the study represents a remarkable technical achievement, it also invites broader reflection on the role and responsibilities of AI in clinical decision-making: first, the development of interpretable AI models that clinicians can trust and understand; and second, the democratisation of these tools through open-source frameworks to ensure their widespread validation and application.

The growing field of AI in medicine has transformed how we approach complex problems, particularly in diagnostic imaging. Advances in convolutional neural networks (CNNs) and, more recently, topological data analysis (TDA), have enabled models to extract nuanced patterns from imaging data, surpassing the diagnostic capabilities of traditional radiological methods [5]. In the context of HCC, the concept of a ‘virtual biopsy’ is particularly compelling. By inferring histopathological features, such as MVI from imaging data alone, these models could obviate the need for invasive procedures, reduce patient morbidity and accelerate clinical decision-making. This field is rapidly developing, and the use of a combination of clinical and quantitative imaging data has already demonstrated the great potential that a virtual biopsy can have in managing these patients [6].

However, as the complexity of AI tools increases, so can their opacity [7] for the end user. The majority of deep learning models function as ‘black boxes’, producing predictions without offering insights into their reasoning. This lack of transparency is a significant barrier to their adoption in clinical practice, where medical professionals must understand and trust the rationale behind algorithm-driven recommendations. In high-stakes decisions, such as whether to recommend anatomic hepatic resection over less invasive treatments, or to allocate patients to transplant rather than other therapies, blind reliance on machine-generated outputs is not only undesirable but also ethically irresponsible. The main question is: Are we ready to make clinical choices for patients derived from algorithms whose reasoning and conclusions we do not fully understand?

The application of TDA proposed by Zheng et al. represents a commendable step towards improving the interpretability of AI models. By encoding spatial and structural relationships within MRI data, TDA enhances the model's ability to capture biologically relevant features, bridging the gap between computational output and clinical reasoning. The incorporation of saliency maps to highlight regions contributing to MVI predictions further demonstrates a commitment to transparency. However, the achieved interpretability still remains insufficient unless paired with broader accessibility and clinical integration.

Moreover, the utility of these models is often constrained by their proprietary nature. Closed-source algorithms hinder independent validation and limit their applicability to broader patient populations.

In the case of the model presented in this publication, its training and validation were restricted to patients within BCLC stages A and B, a subgroup that does not encompass the full spectrum of HCC presentations.

This is an important limit, as the presence of MVI can be detected in any stage. According to the Italian national register (HE.RC.O.LE.S.), it has been detected in 58.2% among BCLC 0-A, 47.9% among BCLC B and 41.9% among BCLC-C cases (Table 1). Furthermore, the BCLC therapeutic decision algorithm has recently been totally superseded by the ‘therapeutic hierarchy’ approach [8], which clearly indicates that surgery can be the first-line treatment regardless of the oncological stage. In this setting, limiting the possibility of applying the virtual biopsy proposed by the authors only to cases with less advanced disease, severely limits the applicability of the algorithm itself in everyday practice. The limitation of this model to BCLC stages A and B patients exemplifies a recurring limitation in AI-driven research: the disconnection between experimental design and real-world data diversity. Clinicians treating HCC encounter a spectrum of presentations, from early-stage lesions amenable to curative resection to advanced, multifocal disease requiring palliative interventions. Models confined to narrow patient subsets risk perpetuating inequities in access to care, particularly for underserved populations disproportionately affected by advanced-stage HCC.

Another significant limitation lies in the study's proprietary nature. To fully realise the promise of AI in medicine, the development of open, interpretable and universally accessible models is non-negotiable. Although the authors report promising external validation results, independent replication by other institutions is essential to establish robustness and generalisability and should be encouraged at best. This result cannot be achieved simply with data from two centres, but it is necessary to freely allow the use of the model. Although open-source frameworks foster collaboration and enhance the robustness of AI models, the development and deployment of such tools must be guided by clear regulatory frameworks. Without structured oversight, the integration of diverse and potentially biased datasets into open models risks introducing variability that may confuse real-world clinical applications. Establishing guidelines to manage data heterogeneity and ensure model reliability across populations is essential for mitigating these challenges.

In conclusion, the study by Zheng et al. underscores the transformative potential of AI in HCC management but also highlights the ethical and practical challenges that accompany these advancements. Interpretability and openness are not ancillary concerns; they are essential to the responsible integration of AI into medicine. As we navigate the rapidly evolving landscape of AI-driven diagnostics, we must remain steadfast in our commitment to these principles, ensuring that technological progress reliably serves the human elements of care.

The authors declare no conflicts of interest.

可解释和开放的人工智能模型:HCC诊断的未来
我们饶有兴趣地阅读了Zheng等人最近的一项研究,该研究引入了一种创新的基于mri的拓扑深度学习(DL)模型,在预测微血管侵袭(MVI)方面实现了高诊断准确性,突出了人工智能(AI)与肿瘤成像集成的关键发展。微血管侵袭是HCC预后的关键决定因素之一,通常与早期复发和较差的生存结果相关。因此,准确预测MVI是至关重要的,不仅可以指导治疗决策,而且可以对临床试验的患者进行分层。HCC在其发展的早期阶段通常是一种侵袭性较低的肿瘤。在这个阶段,它的生长没有侵入性,形成一个压缩周围组织的非癌性组织囊。在肝癌肝切除术中,包膜水平的切除有时被认为是R0切除。然而,在发育过程中,癌细胞侵入囊外的血管,如门静脉。经过这一步骤后,HCC的侵袭性迅速增加,基于上述认识的手术干预不能获得足够的治疗效果。因此,宏观类型与其干预后的预后密切相关[3,4]。人们早就指出,很难将这些发现与手术结果联系起来。Zheng等人目前的研究表明,通过将MRI的详细发现(不限于形态学变化)与拓扑联系起来,并将其用于AI分析,可能会获得新的见解。尽管微小的囊外和/或血管侵犯可以在术后病理中得到证实,但可靠的术前成像方式尚未建立。尽管这项研究代表了一项非凡的技术成就,但它也引发了对人工智能在临床决策中的作用和责任的更广泛反思:首先,开发临床医生可以信任和理解的可解释人工智能模型;第二,通过开源框架使这些工具民主化,以确保它们得到广泛的验证和应用。人工智能在医学领域的不断发展改变了我们处理复杂问题的方式,尤其是在诊断成像方面。卷积神经网络(cnn)和最近的拓扑数据分析(TDA)的进步,使模型能够从成像数据中提取细微的模式,超越了传统放射学方法的诊断能力。在HCC的背景下,“虚拟活检”的概念尤其引人注目。通过仅从成像数据推断组织病理学特征,例如MVI,这些模型可以避免侵入性手术的需要,降低患者发病率并加快临床决策。这一领域正在迅速发展,结合临床和定量成像数据的使用已经证明了虚拟活检在管理这些患者方面的巨大潜力。然而,随着人工智能工具的复杂性增加,它们对最终用户的不透明度也会增加。大多数深度学习模型的功能就像“黑盒子”,产生预测,而不提供对其推理的见解。缺乏透明度是在临床实践中采用它们的一个重大障碍,在临床实践中,医疗专业人员必须理解和信任算法驱动的建议背后的基本原理。在高风险的决策中,比如是否推荐解剖性肝切除术而不是侵入性较小的治疗,或者是否将患者分配给移植而不是其他治疗,盲目依赖机器生成的输出不仅是不可取的,而且在伦理上是不负责任的。主要的问题是:我们是否准备好为那些我们不能完全理解其推理和结论的算法得出的患者做出临床选择?Zheng等人提出的TDA的应用是朝着提高人工智能模型的可解释性迈出的值得称赞的一步。通过编码MRI数据中的空间和结构关系,TDA增强了模型捕捉生物学相关特征的能力,弥合了计算输出和临床推理之间的差距。纳入显著性地图以突出对MVI预测有贡献的区域,进一步表明了对透明度的承诺。然而,除非与更广泛的可及性和临床整合相结合,否则实现的可解释性仍然不足。此外,这些模型的效用经常受到其专有性质的限制。闭源算法阻碍了独立验证,并限制了它们对更广泛患者群体的适用性。就本出版物中提出的模型而言,其训练和验证仅限于BCLC A期和B期患者,这一亚组不包括所有HCC表现。 这是一个重要的限制,因为在任何阶段都可以检测到MVI的存在。根据意大利国家登记(he . rc . o.l es), BCLC 0-A病例中有58.2%,BCLC B病例中有47.9%,BCLC- c病例中有41.9%(表1)。此外,BCLC治疗决策算法最近已完全被“治疗层次”方法[8]所取代,这清楚地表明,无论肿瘤分期如何,手术都可以成为一线治疗方法。在这种情况下,将作者提出的虚拟活检只应用于病情不太严重的病例的可能性限制了该算法本身在日常实践中的适用性。该模型对BCLC A期和B期患者的限制体现了人工智能驱动研究中反复出现的局限性:实验设计与现实世界数据多样性之间的脱节。临床医生治疗HCC会遇到各种各样的症状,从早期病变可以根治性切除到晚期多灶性疾病需要姑息性干预。局限于狭窄患者亚群的模型有可能使获得医疗服务的不公平现象长期存在,特别是对于受晚期HCC影响不成比例的服务不足人群。另一个重要的限制在于这项研究的专有性质。为了充分实现人工智能在医学中的前景,开发开放、可解释和普遍可访问的模型是不容置疑的。尽管作者报告了有希望的外部验证结果,但其他机构的独立复制对于建立稳健性和普遍性至关重要,最好是鼓励。这一结果不能简单地通过两个中心的数据来实现,但有必要自由地允许使用该模型。尽管开源框架促进了协作并增强了人工智能模型的稳健性,但此类工具的开发和部署必须以明确的监管框架为指导。如果没有结构化的监督,将不同的、可能有偏见的数据集集成到开放模型中,可能会引入可变性,从而混淆现实世界的临床应用。建立指导方针来管理数据异构性并确保跨人群的模型可靠性,对于缓解这些挑战至关重要。总之,Zheng等人的研究强调了人工智能在HCC管理中的变革潜力,但也强调了伴随这些进步的伦理和实践挑战。可解释性和开放性不是附属问题;它们对于负责任地将人工智能融入医学至关重要。当我们在人工智能驱动的诊断领域快速发展时,我们必须坚定不移地信守这些原则,确保技术进步可靠地服务于人类护理要素。作者声明无利益冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Liver International
Liver International 医学-胃肠肝病学
CiteScore
13.90
自引率
4.50%
发文量
348
审稿时长
2 months
期刊介绍: Liver International promotes all aspects of the science of hepatology from basic research to applied clinical studies. Providing an international forum for the publication of high-quality original research in hepatology, it is an essential resource for everyone working on normal and abnormal structure and function in the liver and its constituent cells, including clinicians and basic scientists involved in the multi-disciplinary field of hepatology. The journal welcomes articles from all fields of hepatology, which may be published as original articles, brief definitive reports, reviews, mini-reviews, images in hepatology and letters to the Editor.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信