Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum
{"title":"ESC-DRKD:用于医学图像异常检测的基于跳跃连接的增强直接反向知识蒸馏","authors":"Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum","doi":"10.1016/j.neucom.2025.130994","DOIUrl":null,"url":null,"abstract":"<div><div>Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at <span><span>https://github.com/GE-123-cpu/ESC_DRKD</span><svg><path></path></svg></span></div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"651 ","pages":"Article 130994"},"PeriodicalIF":6.5000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ESC-DRKD: Enhanced skip connection-based direct reverse knowledge distillation for medical image anomaly detection\",\"authors\":\"Chenkun Ge , Xiaojun Yu , Hao Zheng , Zeming Fan , Umair Muhammad , Jinna Chen , Perry Ping Shum\",\"doi\":\"10.1016/j.neucom.2025.130994\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at <span><span>https://github.com/GE-123-cpu/ESC_DRKD</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"651 \",\"pages\":\"Article 130994\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225016662\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225016662","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
ESC-DRKD: Enhanced skip connection-based direct reverse knowledge distillation for medical image anomaly detection
Image anomaly detection has emerged as a prevalent research area in medical diagnosis, focusing on using models trained exclusively on normal samples to identify and locate anomalous images during testing. While many image anomaly detection (AD) models exhibit remarkable performance on industrial datasets, they frequently struggle to effectively detect anomalies in medical datasets characterized by complex distributions. This is because most methods are prone to overgeneralization, resulting in ineffective restoration of normal image features. To tackle this challenge, we introduce a novel approach called Enhanced Skip Connection-Based Direct Reverse Knowledge Distillation (ESC-DRKD), specifically designed to facilitate anomaly detection and localization in medical images. ESC-DRKD consists of pre-trained teacher encoders, trainable projection layers, and student decoders. Firstly, pseudo abnormal images are generated using the CutPaste method. By extracting multi-scale features from both normal and pseudo abnormal images through the pretrained teacher encoders, projection layers are employed to project the features of pseudo abnormal images onto important information of normal features. These projected features are then added to the outputs of corresponding student decoders at each level through skip connections to enhance the restoration of normal features. Furthermore, the output of the final layer of the teacher encoders serves as the input to the student decoders, thereby preventing the loss of normal information. Extensive experiments on various public medical datasets are tested the effectiveness of our proposed method. The results demonstrate that ESC-DRKD outperforms the state-of-the-art AD models on the five medical datasets, achieving an average improvement of over 3.0 % AUROC for anomaly detection. Code is available at https://github.com/GE-123-cpu/ESC_DRKD
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.