On the effectiveness of hybrid pooling in mixup-based graph learning for language processing

IF 3.7 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Systems and Software Pub Date : 2024-06-17 DOI:10.1016/j.jss.2024.112139

Zeming Dong , Qiang Hu , Zhenya Zhang , Yuejun Guo , Maxime Cordy , Mike Papadakis , Yves Le Traon , Jianjun Zhao

{"title":"On the effectiveness of hybrid pooling in mixup-based graph learning for language processing","authors":"Zeming Dong , Qiang Hu , Zhenya Zhang , Yuejun Guo , Maxime Cordy , Mike Papadakis , Yves Le Traon , Jianjun Zhao","doi":"10.1016/j.jss.2024.112139","DOIUrl":null,"url":null,"abstract":"<div>Graph neural network (GNN)-based graph learning has been popular in natural language and programming language processing, particularly in text and source code classification. Typically, GNNs are constructed by incorporating alternating layers which learn transformations of graph node features, along with graph pooling layers that use graph pooling operators (e.g., Max-pooling) to effectively reduce the number of nodes while preserving the semantic information of the graph. Recently, to enhance GNNs in graph learning tasks, Manifold-Mixup, a data augmentation technique that produces synthetic graph data by linearly mixing a pair of graph data and their labels, has been widely adopted. However, the performance of Manifold-Mixup can be highly affected by graph pooling operators, and there have not been many studies that are dedicated to uncovering such affection. To bridge this gap, we take an early step to explore how graph pooling operators affect the performance of Mixup-based graph learning. To that end, we conduct a comprehensive empirical study by applying Manifold-Mixup to a formal characterization of graph pooling based on 11 graph pooling operations (9 hybrid pooling operators, 2 non-hybrid pooling operators). The experimental results on both natural language datasets (Gossipcop, Politifact) and programming language datasets (JAVA250, Python800) demonstrate that hybrid pooling operators are more effective for Manifold-Mixup than the standard Max-pooling and the state-of-the-art graph multiset transformer (GMT) pooling, in terms of producing more accurate and robust GNN models.Editor’s note: Open Science material was validated by the Journal of Systems and Software Open Science Board.</div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121224001845","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Graph neural network (GNN)-based graph learning has been popular in natural language and programming language processing, particularly in text and source code classification. Typically, GNNs are constructed by incorporating alternating layers which learn transformations of graph node features, along with graph pooling layers that use graph pooling operators (e.g., Max-pooling) to effectively reduce the number of nodes while preserving the semantic information of the graph. Recently, to enhance GNNs in graph learning tasks, Manifold-Mixup, a data augmentation technique that produces synthetic graph data by linearly mixing a pair of graph data and their labels, has been widely adopted. However, the performance of Manifold-Mixup can be highly affected by graph pooling operators, and there have not been many studies that are dedicated to uncovering such affection. To bridge this gap, we take an early step to explore how graph pooling operators affect the performance of Mixup-based graph learning. To that end, we conduct a comprehensive empirical study by applying Manifold-Mixup to a formal characterization of graph pooling based on 11 graph pooling operations (9 hybrid pooling operators, 2 non-hybrid pooling operators). The experimental results on both natural language datasets (Gossipcop, Politifact) and programming language datasets (JAVA250, Python800) demonstrate that hybrid pooling operators are more effective for Manifold-Mixup than the standard Max-pooling and the state-of-the-art graph multiset transformer (GMT) pooling, in terms of producing more accurate and robust GNN models.

Editor’s note: Open Science material was validated by the Journal of Systems and Software Open Science Board.

查看原文本刊更多论文

论混合池在基于混合的图学习语言处理中的有效性

基于图神经网络（GNN）的图学习在自然语言和编程语言处理领域，尤其是文本和源代码分类领域很受欢迎。通常情况下，图神经网络是通过交替层（学习图节点特征的变换）和图池层（使用图池运算符（如最大池化）有效减少节点数量，同时保留图的语义信息）来构建的。最近，为了增强 GNN 在图学习任务中的作用，Manifold-Mixup（一种通过线性混合一对图数据及其标签来生成合成图数据的数据增强技术）被广泛采用。然而，Manifold-Mixup 的性能可能会受到图池算子的严重影响，而专门揭示这种影响的研究并不多。为了弥补这一空白，我们率先探索了图池算子如何影响基于 Mixup 的图学习性能。为此，我们进行了一项全面的实证研究，将 Manifold-Mixup 应用于基于 11 个图池操作（9 个混合池操作，2 个非混合池操作）的图池正式表征。在自然语言数据集（Gossipcop、Politifact）和编程语言数据集（JAVA250、Python800）上的实验结果表明，与标准的最大池化（Max-pooling）和最先进的图多集变换器（GMT）池化相比，混合池化算子对 Manifold-Mixup 更为有效，能生成更准确、更健壮的 GNN 模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Systems and Software 工程技术-计算机：理论方法

CiteScore

8.60

自引率

5.70%

发文量

193

审稿时长

16 weeks

期刊介绍： The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: • Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution • Agile, model-driven, service-oriented, open source and global software development • Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems • Human factors and management concerns of software development • Data management and big data issues of software systems • Metrics and evaluation, data mining of software development resources • Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.