Limai Jiang , Ruitao Xie , Bokai Yang , Juan He , Huazhen Huang , Yi Pan , Yunpeng Cai
{"title":"Weakly supervised lesion localization and attribution for OCT images with a guided counterfactual explainer model","authors":"Limai Jiang , Ruitao Xie , Bokai Yang , Juan He , Huazhen Huang , Yi Pan , Yunpeng Cai","doi":"10.1016/j.eswa.2025.128129","DOIUrl":null,"url":null,"abstract":"<div><div>Lesion localization plays an important role in computer aided diagnosis. Due to the lacking of lesion annotations, weakly supervised methods using only image-level annotations are demanded for a wide variety of applications, especially optical coherence tomography diagnosis. Most weakly-supervised methods rely on attribution analysis. However, current methods suffer from imprecise attributions and lead to poor localization quality. To address this problem, we introduce a lesion localization method based on a new explainable AI approach, termed Optical Coherence Tomography Class Association Embedding (OCT-CAE), leverages image-level annotations and a cycle generative adversarial network with subspace decomposition to fuse global knowledge and enable counterfactual generation. Each sample is encoded into a pair of subspaces where a low-dimensional common subspace is created to embed manifold of classification-related information and an individual subspace to embed individual-specific information. With the trained OCT-CAE, the codes in the two subspaces can be freely recombined to generate realistic images. These generated images retain the class-related features defined by the common subspace while preserving individual-specific characteristics from the individual subspace. Lesion localization is achieved by altering the common code to induce a class-flip in the generated images. By comparing the modified and original images, we can identify lesion regions without requiring regional annotations. Extensive experiments on two publicly available datasets demonstrate that OCT-CAE effectively disentangles latent information space in images, achieving state-of-the-art performance. Our code is available at <span><span>https://github.com/lcmmai/OCT-CAE</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"287 ","pages":"Article 128129"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425017403","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Lesion localization plays an important role in computer aided diagnosis. Due to the lacking of lesion annotations, weakly supervised methods using only image-level annotations are demanded for a wide variety of applications, especially optical coherence tomography diagnosis. Most weakly-supervised methods rely on attribution analysis. However, current methods suffer from imprecise attributions and lead to poor localization quality. To address this problem, we introduce a lesion localization method based on a new explainable AI approach, termed Optical Coherence Tomography Class Association Embedding (OCT-CAE), leverages image-level annotations and a cycle generative adversarial network with subspace decomposition to fuse global knowledge and enable counterfactual generation. Each sample is encoded into a pair of subspaces where a low-dimensional common subspace is created to embed manifold of classification-related information and an individual subspace to embed individual-specific information. With the trained OCT-CAE, the codes in the two subspaces can be freely recombined to generate realistic images. These generated images retain the class-related features defined by the common subspace while preserving individual-specific characteristics from the individual subspace. Lesion localization is achieved by altering the common code to induce a class-flip in the generated images. By comparing the modified and original images, we can identify lesion regions without requiring regional annotations. Extensive experiments on two publicly available datasets demonstrate that OCT-CAE effectively disentangles latent information space in images, achieving state-of-the-art performance. Our code is available at https://github.com/lcmmai/OCT-CAE.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.