Efficient perturbation-aware distinguishing score for zero-shot neural architecture search

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-07-17 DOI:10.1016/j.asoc.2025.113447

Junhao Huang , Bing Xue , Yanan Sun , Mengjie Zhang , Gary G. Yen

{"title":"Efficient perturbation-aware distinguishing score for zero-shot neural architecture search","authors":"Junhao Huang , Bing Xue , Yanan Sun , Mengjie Zhang , Gary G. Yen","doi":"10.1016/j.asoc.2025.113447","DOIUrl":null,"url":null,"abstract":"<div><div>Zero-cost proxies are under the spotlight of neural architecture search (NAS) lately, thanks to their low computational cost in predicting architecture performance in a training-free manner. The NASWOT score is one of the representative proxies that measures the architecture’s ability to distinguish inputs at the activation layers. However, obtaining such a score still requires considerable calculations on a large kernel matrix about input similarity. Moreover, the NASWOT score is relatively coarse-grained and provides a rough estimation of the architecture’s ability to distinguish general inputs. In this paper, to further reduce the computational complexity, we first propose a simplified NASWOT scoring term by relaxing its original matrix-based calculation into a vector-based one. More importantly, we develop a fine-grained perturbation-aware term to measure how well the architecture can distinguish between inputs and their perturbed counterparts. We propose a layer-wise score multiplication approach to combine these two scoring terms, deriving a new proxy, named efficient perturbation-aware distinguishing score (ePADS). Experiments on various NAS spaces and datasets show that ePADS consistently outperforms other zero-cost proxies in terms of both predictive reliability and efficiency. Particularly, ePADS achieves the highest ranking correlation among the advanced competitors (e.g., Kendall’s coefficient of 0.620 on NAS-Bench-201 with ImageNet-16-120 and 0.485 on NDS-ENAS), and ePADS-based random architecture search spends only 0.018 GPU days on DARTS-CIFAR to find networks with an average error rate of 2.64%.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"182 ","pages":"Article 113447"},"PeriodicalIF":7.2000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625007586","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Zero-cost proxies are under the spotlight of neural architecture search (NAS) lately, thanks to their low computational cost in predicting architecture performance in a training-free manner. The NASWOT score is one of the representative proxies that measures the architecture’s ability to distinguish inputs at the activation layers. However, obtaining such a score still requires considerable calculations on a large kernel matrix about input similarity. Moreover, the NASWOT score is relatively coarse-grained and provides a rough estimation of the architecture’s ability to distinguish general inputs. In this paper, to further reduce the computational complexity, we first propose a simplified NASWOT scoring term by relaxing its original matrix-based calculation into a vector-based one. More importantly, we develop a fine-grained perturbation-aware term to measure how well the architecture can distinguish between inputs and their perturbed counterparts. We propose a layer-wise score multiplication approach to combine these two scoring terms, deriving a new proxy, named efficient perturbation-aware distinguishing score (ePADS). Experiments on various NAS spaces and datasets show that ePADS consistently outperforms other zero-cost proxies in terms of both predictive reliability and efficiency. Particularly, ePADS achieves the highest ranking correlation among the advanced competitors (e.g., Kendall’s coefficient of 0.620 on NAS-Bench-201 with ImageNet-16-120 and 0.485 on NDS-ENAS), and ePADS-based random architecture search spends only 0.018 GPU days on DARTS-CIFAR to find networks with an average error rate of 2.64%.

Abstract Image

查看原文本刊更多论文

零点神经结构搜索的有效摄动感知区分分数

近年来，零成本代理以其无需训练的低计算成本预测体系结构性能，成为神经结构搜索（NAS）的研究热点。NASWOT分数是衡量体系结构区分激活层输入的能力的代表性代理之一。然而，获得这样的分数仍然需要对输入相似度的大型核矩阵进行大量计算。此外，NASWOT分数是相对粗粒度的，并提供了对体系结构区分一般输入的能力的粗略估计。在本文中，为了进一步降低计算复杂度，我们首先提出了一个简化的NASWOT评分项，将其原始的基于矩阵的计算简化为基于向量的计算。更重要的是，我们开发了一个细粒度的扰动感知项来衡量架构如何区分输入和它们的扰动对应项。我们提出了一种分层分数乘法方法，将这两个评分项结合起来，得到一个新的代理，称为有效扰动感知区分分数（ePADS）。在各种NAS空间和数据集上的实验表明，ePADS在预测可靠性和效率方面始终优于其他零成本代理。特别是，ePADS在先进的竞争对手中实现了最高的排名相关性（例如，使用ImageNet-16-120的NAS-Bench-201上的Kendall系数为0.620，在NDS-ENAS上的Kendall系数为0.485），基于ePADS的随机架构搜索在DARTS-CIFAR上仅花费0.018 GPU天就能找到平均错误率为2.64%的网络。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.