Saba Heidari Gheshlaghi , Milan Aryal , Nasim Yahya Soltani , Masoud Ganji
{"title":"数字病理学中分布外鲁棒性的可解释性图增强","authors":"Saba Heidari Gheshlaghi , Milan Aryal , Nasim Yahya Soltani , Masoud Ganji","doi":"10.1016/j.knosys.2025.113640","DOIUrl":null,"url":null,"abstract":"<div><div>Whole slide images (WSIs), which are high-resolution digital representations of tissue samples, present significant challenges for processing because of their gigapixel scale. Recent studies show that graph neural networks (GNNs), which leverage neighborhood information, can enhance cancer classification accuracy in WSIs. However, GNN performance is affected by out-of-distribution (OOD) data, which occurs when the training and testing data are from different sources. Detecting OOD samples in graph data is especially challenging due to its complexity, which makes GNNs vulnerable to performance degradation. To address this issue, we propose a novel data augmentation framework to improve GNN robustness against OOD samples. Our approach augments node features by sampling important subgraphs, simulating potential OOD scenarios during training. Experiments on three public WSI datasets demonstrate significant improvements in graph classification tasks on OOD samples. In this work, one dataset serves as the in-distribution benchmark, while the others represent OOD scenarios. These results highlight the potential of data augmentation to enhance GNN robustness against OOD samples, improving cancer classification in WSIs.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"320 ","pages":"Article 113640"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Explainability-based graph augmentation for out-of-distribution robustness in digital pathology\",\"authors\":\"Saba Heidari Gheshlaghi , Milan Aryal , Nasim Yahya Soltani , Masoud Ganji\",\"doi\":\"10.1016/j.knosys.2025.113640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Whole slide images (WSIs), which are high-resolution digital representations of tissue samples, present significant challenges for processing because of their gigapixel scale. Recent studies show that graph neural networks (GNNs), which leverage neighborhood information, can enhance cancer classification accuracy in WSIs. However, GNN performance is affected by out-of-distribution (OOD) data, which occurs when the training and testing data are from different sources. Detecting OOD samples in graph data is especially challenging due to its complexity, which makes GNNs vulnerable to performance degradation. To address this issue, we propose a novel data augmentation framework to improve GNN robustness against OOD samples. Our approach augments node features by sampling important subgraphs, simulating potential OOD scenarios during training. Experiments on three public WSI datasets demonstrate significant improvements in graph classification tasks on OOD samples. In this work, one dataset serves as the in-distribution benchmark, while the others represent OOD scenarios. These results highlight the potential of data augmentation to enhance GNN robustness against OOD samples, improving cancer classification in WSIs.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"320 \",\"pages\":\"Article 113640\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125006860\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006860","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Explainability-based graph augmentation for out-of-distribution robustness in digital pathology
Whole slide images (WSIs), which are high-resolution digital representations of tissue samples, present significant challenges for processing because of their gigapixel scale. Recent studies show that graph neural networks (GNNs), which leverage neighborhood information, can enhance cancer classification accuracy in WSIs. However, GNN performance is affected by out-of-distribution (OOD) data, which occurs when the training and testing data are from different sources. Detecting OOD samples in graph data is especially challenging due to its complexity, which makes GNNs vulnerable to performance degradation. To address this issue, we propose a novel data augmentation framework to improve GNN robustness against OOD samples. Our approach augments node features by sampling important subgraphs, simulating potential OOD scenarios during training. Experiments on three public WSI datasets demonstrate significant improvements in graph classification tasks on OOD samples. In this work, one dataset serves as the in-distribution benchmark, while the others represent OOD scenarios. These results highlight the potential of data augmentation to enhance GNN robustness against OOD samples, improving cancer classification in WSIs.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.