Toward efficient testing of graph neural networks via test input prioritization

IF 3.1 2区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Automated Software Engineering Pub Date : 2025-10-07 DOI:10.1007/s10515-025-00554-0

Lichen Yang, Qiang Wang, Zhonghao Yang, Daojing He, Yu Li

{"title":"Toward efficient testing of graph neural networks via test input prioritization","authors":"Lichen Yang, Qiang Wang, Zhonghao Yang, Daojing He, Yu Li","doi":"10.1007/s10515-025-00554-0","DOIUrl":null,"url":null,"abstract":"<div><p>Graph Neural Networks (GNNs) have demonstrated remarkable efficacy in handling graph-structured data; however, they exhibit failures after deployment, which can cause severe consequences. Hence, conducting thorough testing before deployment becomes imperative to ensure the reliability of GNNs. However, thorough testing requires numerous manually annotated test data. To mitigate the annotation cost, strategically prioritizing and labeling high-quality unlabeled inputs for testing becomes crucial, which facilitates uncovering more model failures with a limited labeling budget. Unfortunately, existing test input prioritization techniques either overlook the valuable information contained in graph structures or are overly reliant on attributes extracted from the target model, <i>i.e., model-aware attributes</i>, whose quality can vary significantly. To address these issues, we propose a novel test input prioritization framework, named <i>GraphRank</i>, for GNNs. GraphRank introduces model-agnostic attributes to compensate for the limitations of the model-aware ones. It also leverages the graph structure information to aggregate attributes from neighboring nodes, thereby enhancing the model-aware and model-agnostic attributes. Furthermore, GraphRank combines the above attributes with a binary classifier, using it as a ranking model to prioritize inputs. This classifier undergoes iterative training, which enables it to learn from each round’s feedback and improve its performance accordingly. Extensive experiments demonstrate GraphRank’s superiority over existing techniques.</p></div>","PeriodicalId":55414,"journal":{"name":"Automated Software Engineering","volume":"33 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automated Software Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10515-025-00554-0","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Graph Neural Networks (GNNs) have demonstrated remarkable efficacy in handling graph-structured data; however, they exhibit failures after deployment, which can cause severe consequences. Hence, conducting thorough testing before deployment becomes imperative to ensure the reliability of GNNs. However, thorough testing requires numerous manually annotated test data. To mitigate the annotation cost, strategically prioritizing and labeling high-quality unlabeled inputs for testing becomes crucial, which facilitates uncovering more model failures with a limited labeling budget. Unfortunately, existing test input prioritization techniques either overlook the valuable information contained in graph structures or are overly reliant on attributes extracted from the target model, i.e., model-aware attributes, whose quality can vary significantly. To address these issues, we propose a novel test input prioritization framework, named GraphRank, for GNNs. GraphRank introduces model-agnostic attributes to compensate for the limitations of the model-aware ones. It also leverages the graph structure information to aggregate attributes from neighboring nodes, thereby enhancing the model-aware and model-agnostic attributes. Furthermore, GraphRank combines the above attributes with a binary classifier, using it as a ranking model to prioritize inputs. This classifier undergoes iterative training, which enables it to learn from each round’s feedback and improve its performance accordingly. Extensive experiments demonstrate GraphRank’s superiority over existing techniques.

Abstract Image

查看原文本刊更多论文

基于测试输入优先级的图神经网络高效测试研究

图神经网络（gnn）在处理图结构数据方面表现出显著的有效性；然而，它们在部署后会出现故障，这可能会导致严重的后果。因此，在部署前进行彻底的测试是确保gnn可靠性的必要条件。然而，彻底的测试需要大量手工注释的测试数据。为了降低标注成本，策略性地对高质量的未标注输入进行优先级排序和标注变得至关重要，这有助于在有限的标注预算下发现更多的模型故障。不幸的是，现有的测试输入优先级技术要么忽略了图结构中包含的有价值的信息，要么过度依赖于从目标模型中提取的属性，即模型感知属性，其质量可能变化很大。为了解决这些问题，我们提出了一个新的测试输入优先级框架，名为GraphRank，用于gnn。GraphRank引入了与模型无关的属性，以弥补模型感知属性的局限性。它还利用图结构信息来聚合来自相邻节点的属性，从而增强了模型感知和模型不可知的属性。此外，GraphRank将上述属性与一个二元分类器结合起来，使用它作为排序模型对输入进行优先排序。该分类器经过迭代训练，使其能够从每一轮的反馈中学习，并相应地提高其性能。大量的实验证明了GraphRank优于现有技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Automated Software Engineering 工程技术-计算机：软件工程

CiteScore

4.80

自引率

11.80%

发文量

审稿时长

>12 weeks

期刊介绍： This journal details research, tutorial papers, survey and accounts of significant industrial experience in the foundations, techniques, tools and applications of automated software engineering technology. This includes the study of techniques for constructing, understanding, adapting, and modeling software artifacts and processes. Coverage in Automated Software Engineering examines both automatic systems and collaborative systems as well as computational models of human software engineering activities. In addition, it presents knowledge representations and artificial intelligence techniques applicable to automated software engineering, and formal techniques that support or provide theoretical foundations. The journal also includes reviews of books, software, conferences and workshops.