Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement.

IEEE Winter Conference on Applications of Computer Vision. IEEE Winter Conference on Applications of Computer Vision Pub Date : 2025-02-01 Epub Date: 2025-04-08 DOI:10.1109/wacv61041.2025.00395

Vamsi Krishna Vasa, Yujian Xiong, Peijie Qiu, Oana Dumitrascu, Wenhui Zhu, Yalin Wang

{"title":"Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement.","authors":"Vamsi Krishna Vasa, Yujian Xiong, Peijie Qiu, Oana Dumitrascu, Wenhui Zhu, Yalin Wang","doi":"10.1109/wacv61041.2025.00395","DOIUrl":null,"url":null,"abstract":"<p><p>Retinal fundus photography offers a non-invasive way to diagnose and monitor a variety of retinal diseases, but is prone to inherent quality glitches arising from systemic imperfections or operator/patient-related factors. However, high-quality retinal images are crucial for carrying out accurate diagnoses and automated analyses. The fundus image enhancement is typically formulated as a distribution alignment problem, by finding a one-to-one mapping between a low-quality image and its high-quality counterpart. This paper proposes a context-informed optimal transport (OT) learning framework for tackling unpaired fundus image enhancement. In contrast to standard generative image enhancement methods, which struggle with handling contextual information (e.g., over-tampered local structures and unwanted artifacts), the proposed context-aware OT learning paradigm better preserves local structures and minimizes unwanted artifacts. Leveraging deep contextual features, we derive the proposed context-aware OT using the earth mover's distance and show that the proposed context-OT has a solid theoretical guarantee. Experimental results on a large-scale dataset demonstrate the superiority of the proposed method over several state-of-the-art supervised and unsupervised methods in terms of signal-to-noise ratio, structural similarity index, as well as two downstream tasks. The code is available at https://github.com/Retinal-Research/Contextual-OT.</p>","PeriodicalId":73325,"journal":{"name":"IEEE Winter Conference on Applications of Computer Vision. IEEE Winter Conference on Applications of Computer Vision","volume":"2025 ","pages":"4016-4025"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337797/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Winter Conference on Applications of Computer Vision. IEEE Winter Conference on Applications of Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/wacv61041.2025.00395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Retinal fundus photography offers a non-invasive way to diagnose and monitor a variety of retinal diseases, but is prone to inherent quality glitches arising from systemic imperfections or operator/patient-related factors. However, high-quality retinal images are crucial for carrying out accurate diagnoses and automated analyses. The fundus image enhancement is typically formulated as a distribution alignment problem, by finding a one-to-one mapping between a low-quality image and its high-quality counterpart. This paper proposes a context-informed optimal transport (OT) learning framework for tackling unpaired fundus image enhancement. In contrast to standard generative image enhancement methods, which struggle with handling contextual information (e.g., over-tampered local structures and unwanted artifacts), the proposed context-aware OT learning paradigm better preserves local structures and minimizes unwanted artifacts. Leveraging deep contextual features, we derive the proposed context-aware OT using the earth mover's distance and show that the proposed context-OT has a solid theoretical guarantee. Experimental results on a large-scale dataset demonstrate the superiority of the proposed method over several state-of-the-art supervised and unsupervised methods in terms of signal-to-noise ratio, structural similarity index, as well as two downstream tasks. The code is available at https://github.com/Retinal-Research/Contextual-OT.

查看原文本刊更多论文

情境感知视网膜眼底图像增强的最优传输学习。

视网膜眼底摄影提供了一种非侵入性的方法来诊断和监测各种视网膜疾病，但由于系统缺陷或操作者/患者相关因素，容易产生固有的质量故障。然而，高质量的视网膜图像对于进行准确的诊断和自动分析至关重要。眼底图像增强通常是一个分布对齐问题，通过寻找低质量图像和高质量图像之间的一对一映射。本文提出了一种基于上下文信息的眼底图像优化学习框架。标准的生成图像增强方法难以处理上下文信息（例如，过度篡改的局部结构和不需要的工件），与之相反，提出的上下文感知OT学习范式更好地保留了局部结构并最大限度地减少了不需要的工件。利用深层语境特征，利用推土机的距离推导出本文提出的情境感知OT，并证明本文提出的情境感知OT具有坚实的理论保障。在大规模数据集上的实验结果表明，该方法在信噪比、结构相似性指数以及两个下游任务方面优于几种最先进的有监督和无监督方法。代码可在https://github.com/Retinal-Research/Contextual-OT上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Winter Conference on Applications of Computer Vision. IEEE Winter Conference on Applications of Computer Vision

自引率

0.00%

发文量