Reconstruction error based implicit regularization method and its engineering application to lung cancer diagnosis

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Qinghe Zheng , Xinyu Tian , Mingqiang Yang , Shuang Han , Abdussalam Elhanashi , Sergio Saponara , Kidiyo Kpalma
{"title":"Reconstruction error based implicit regularization method and its engineering application to lung cancer diagnosis","authors":"Qinghe Zheng ,&nbsp;Xinyu Tian ,&nbsp;Mingqiang Yang ,&nbsp;Shuang Han ,&nbsp;Abdussalam Elhanashi ,&nbsp;Sergio Saponara ,&nbsp;Kidiyo Kpalma","doi":"10.1016/j.engappai.2024.109439","DOIUrl":null,"url":null,"abstract":"<div><div>The automatic diagnosis of lung cancer via artificial intelligence faces two hotspot issues: (1) insufficient data and (2) excessive redundant information, which make it difficult for convolutional neural networks (CNNs) to learn discriminative information of lung cancer. In this paper, we present the reconstruction error based implicit regularization method (REbIRM) that regularizes CNNs at the loss layer. During each training iteration, the reconstruction errors introduced by the two-stage discriminative auto-encoder are used to sharpen the generalization ability of deep CNNs by improving the decision boundary. In the application process, the trained deep CNN is used for completing computed tomography (CT) diagnostics. The main clinical benefit of our approach is that it is domain independent, requiring no specialized knowledge, and can therefore be applied to different types of datasets. To the best of our knowledge, this is the first attempt to implicitly regularize CNNs based on the reconstruction errors. Finally, experimental results on three CT image classification datasets show that REbIRM can achieve impressive results and that, in conjunction with Dropout, it obtains the state-of-the-art performance. REbIRM is also robust to the selection of hyper-parameters and only has the sublinear influence on the convergence of deep CNNs. Besides, empirical and theoretical evidence are provided to indicate that REbIRM prefers to converges in a constrained parameter space with flatter minima, which explains why it can generalize to new data. Finally, the nature of REbIRM is further explored through visualization techniques to analyze how it works in training deep CNNs.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624015975","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The automatic diagnosis of lung cancer via artificial intelligence faces two hotspot issues: (1) insufficient data and (2) excessive redundant information, which make it difficult for convolutional neural networks (CNNs) to learn discriminative information of lung cancer. In this paper, we present the reconstruction error based implicit regularization method (REbIRM) that regularizes CNNs at the loss layer. During each training iteration, the reconstruction errors introduced by the two-stage discriminative auto-encoder are used to sharpen the generalization ability of deep CNNs by improving the decision boundary. In the application process, the trained deep CNN is used for completing computed tomography (CT) diagnostics. The main clinical benefit of our approach is that it is domain independent, requiring no specialized knowledge, and can therefore be applied to different types of datasets. To the best of our knowledge, this is the first attempt to implicitly regularize CNNs based on the reconstruction errors. Finally, experimental results on three CT image classification datasets show that REbIRM can achieve impressive results and that, in conjunction with Dropout, it obtains the state-of-the-art performance. REbIRM is also robust to the selection of hyper-parameters and only has the sublinear influence on the convergence of deep CNNs. Besides, empirical and theoretical evidence are provided to indicate that REbIRM prefers to converges in a constrained parameter space with flatter minima, which explains why it can generalize to new data. Finally, the nature of REbIRM is further explored through visualization techniques to analyze how it works in training deep CNNs.
基于重建误差的隐式正则化方法及其在肺癌诊断中的工程应用
人工智能对肺癌的自动诊断面临两个热点问题:(1)数据不足;(2)冗余信息过多,这使得卷积神经网络(CNN)难以学习肺癌的鉴别信息。本文提出了基于重构误差的隐式正则化方法(REbIRM),该方法可在损失层对卷积神经网络进行正则化。在每次训练迭代中,两级判别自动编码器引入的重构误差被用于通过改善决策边界来增强深度 CNN 的泛化能力。在应用过程中,训练好的深度 CNN 被用于完成计算机断层扫描(CT)诊断。我们的方法的主要临床优势在于它不受领域限制,不需要专业知识,因此可以应用于不同类型的数据集。据我们所知,这是首次尝试根据重建误差对 CNN 进行隐式正则化。最后,在三个 CT 图像分类数据集上的实验结果表明,REbIRM 可以获得令人印象深刻的结果,与 Dropout 结合使用,它可以获得最先进的性能。REbIRM 对超参数的选择也很稳健,对深度 CNN 的收敛性只有亚线性影响。此外,经验和理论证据还表明,REbIRM 更倾向于在具有更平坦最小值的受限参数空间中收敛,这也解释了为什么它可以泛化到新数据。最后,通过可视化技术进一步探索了 REbIRM 的本质,分析了它在训练深度 CNN 时的工作原理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信