{"title":"利用不确定性指导和边界知识精馏增强三维多器官分割","authors":"Xiangchun Yu, Longjun Ding, Tianqi Wu, Dingwen Zhang","doi":"10.1016/j.jvcir.2025.104574","DOIUrl":null,"url":null,"abstract":"<div><div>We propose the Uncertainty Guidance and Boundary Knowledge Distillation (UGBKD) framework for enhancing 3D multi-organ segmentation performance of student networks. UGBKD integrates three strategies: uncertainty-guided knowledge distillation, learning difficulty mining mechanism, and boundary knowledge distillation. The teacher-student distillation is adeptly guided by leveraging estimated uncertainty and the learning difficulty mining mechanism. Boundary knowledge distillation further alleviates blurred boundary challenges. Initially, a pre-trained denoising autoencoder DAE with anatomical perception priors is employed to estimate prediction uncertainty, and the uncertainty guided strategy promotes consistent knowledge transfer from the teacher. Subsequently, the learning difficulty mining mechanism focuses on difficult areas for the student. Lastly, boundary knowledge distillation extracts and transfers crucial boundary information to enhance the student’s boundary perception. Extensive experiments on WORD and BTCV datasets validate our proposed method’s effectiveness in improving segmentation accuracy and robustness. Code is available at <span><span>https://github.com/wutianqi-Learning/UGBKD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"112 ","pages":"Article 104574"},"PeriodicalIF":3.1000,"publicationDate":"2025-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing 3D multi-organ segmentation via uncertainty guidance and boundary knowledge distillation\",\"authors\":\"Xiangchun Yu, Longjun Ding, Tianqi Wu, Dingwen Zhang\",\"doi\":\"10.1016/j.jvcir.2025.104574\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We propose the Uncertainty Guidance and Boundary Knowledge Distillation (UGBKD) framework for enhancing 3D multi-organ segmentation performance of student networks. UGBKD integrates three strategies: uncertainty-guided knowledge distillation, learning difficulty mining mechanism, and boundary knowledge distillation. The teacher-student distillation is adeptly guided by leveraging estimated uncertainty and the learning difficulty mining mechanism. Boundary knowledge distillation further alleviates blurred boundary challenges. Initially, a pre-trained denoising autoencoder DAE with anatomical perception priors is employed to estimate prediction uncertainty, and the uncertainty guided strategy promotes consistent knowledge transfer from the teacher. Subsequently, the learning difficulty mining mechanism focuses on difficult areas for the student. Lastly, boundary knowledge distillation extracts and transfers crucial boundary information to enhance the student’s boundary perception. Extensive experiments on WORD and BTCV datasets validate our proposed method’s effectiveness in improving segmentation accuracy and robustness. Code is available at <span><span>https://github.com/wutianqi-Learning/UGBKD</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":54755,\"journal\":{\"name\":\"Journal of Visual Communication and Image Representation\",\"volume\":\"112 \",\"pages\":\"Article 104574\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Visual Communication and Image Representation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1047320325001889\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001889","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Enhancing 3D multi-organ segmentation via uncertainty guidance and boundary knowledge distillation
We propose the Uncertainty Guidance and Boundary Knowledge Distillation (UGBKD) framework for enhancing 3D multi-organ segmentation performance of student networks. UGBKD integrates three strategies: uncertainty-guided knowledge distillation, learning difficulty mining mechanism, and boundary knowledge distillation. The teacher-student distillation is adeptly guided by leveraging estimated uncertainty and the learning difficulty mining mechanism. Boundary knowledge distillation further alleviates blurred boundary challenges. Initially, a pre-trained denoising autoencoder DAE with anatomical perception priors is employed to estimate prediction uncertainty, and the uncertainty guided strategy promotes consistent knowledge transfer from the teacher. Subsequently, the learning difficulty mining mechanism focuses on difficult areas for the student. Lastly, boundary knowledge distillation extracts and transfers crucial boundary information to enhance the student’s boundary perception. Extensive experiments on WORD and BTCV datasets validate our proposed method’s effectiveness in improving segmentation accuracy and robustness. Code is available at https://github.com/wutianqi-Learning/UGBKD.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.