ESC-DRKD：用于医学图像异常检测的基于跳跃连接的增强直接反向知识蒸馏

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-07-14 DOI:10.1016/j.neucom.2025.130994

Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum

{"title":"ESC-DRKD：用于医学图像异常检测的基于跳跃连接的增强直接反向知识蒸馏","authors":"Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum","doi":"10.1016/j.neucom.2025.130994","DOIUrl":null,"url":null,"abstract":"<div><div>Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at <span><span>https://github.com/GE-123-cpu/ESC_DRKD</span><svg><path></path></svg></span></div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"651 ","pages":"Article 130994"},"PeriodicalIF":6.5000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ESC-DRKD: Enhanced skip connection-based direct reverse knowledge distillation for medical image anomaly detection\",\"authors\":\"Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum\",\"doi\":\"10.1016/j.neucom.2025.130994\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at <span><span>https://github.com/GE-123-cpu/ESC_DRKD</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"651 \",\"pages\":\"Article 130994\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225016662\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225016662","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

图像异常检测已成为医学诊断中的一个热门研究领域，其重点是在检测过程中使用仅对正常样本进行训练的模型来识别和定位异常图像。虽然许多图像异常检测（AD）模型在工业数据集上表现出卓越的性能，但它们往往难以有效地检测到分布复杂的医疗数据集中的异常。这是因为大多数方法容易过度泛化，导致无法恢复正常的图像特征。为了应对这一挑战，我们引入了一种新的方法，称为基于增强跳跃连接的直接反向知识蒸馏（ESC-DRKD），专门用于促进医学图像中的异常检测和定位。ESC-DRKD由预训练的教师编码器、可训练的投影层和学生解码器组成。首先，利用剪切粘贴方法生成伪异常图像。通过预训练的教师编码器从正常和伪异常图像中提取多尺度特征，利用投影层将伪异常图像的特征投影到正常特征的重要信息上。然后通过跳过连接将这些投影特征添加到每个级别的相应学生解码器的输出中，以增强正常特征的恢复。此外，教师编码器的最后一层的输出作为学生解码器的输入，从而防止了正常信息的丢失。在各种公共医疗数据集上进行了大量实验，测试了我们提出的方法的有效性。结果表明，ESC-DRKD在五个医疗数据集上优于最先进的AD模型，异常检测的平均AUROC提高超过3.0 %。代码可从https://github.com/GE-123-cpu/ESC_DRKD获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ESC-DRKD: Enhanced skip connection-based direct reverse knowledge distillation for medical image anomaly detection

Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at https://github.com/GE-123-cpu/ESC_DRKD

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.