{"title":"用于无监督图匹配的对比学习网络","authors":"Yu Xie;Lianhang Luo;Tianpei Cao;Bin Yu;A. K. Qin","doi":"10.1109/TCSVT.2024.3457575","DOIUrl":null,"url":null,"abstract":"Graph matching aims to establish node correspondences between graphs, which is a classic combinatorial optimization problem. In recent years, (deep) learning-based methods have emerged as a superior alternative to traditional graph matching solvers. However, these methods typically rely on node-level correspondence labels, which can be prohibitively expensive or unrealistic. Inspired by contrastive learning that is a prevalent paradigm for self-supervised representation learning, we develop a Contrastive Learning Network for Unsupervised Graph Matching (CUGM), which is an end-to-end differentiable pipeline to learn node permutations. Specifically, we propose three-level augmentation including raw image augmentation, graph augmentation and model augmentation for generating diverse enough contrastive views to enrich training instances. Then a contrastive learning network is constructed to capture the higher-order structural information in graphs and learn the final node representations for yielding the affinity matrix to directly solve a linear assignment problem. More importantly, we propose a node-level contrastive loss with false negative cancellation for optimizing the whole network to extract the tailored node feature representations to improve graph matching accuracy. Experimental results on standard graph matching benchmarks demonstrate that our end-to-end unsupervised method achieves the competitive performance compared with state-of-the-art supervised and unsupervised graph matching methods.","PeriodicalId":13082,"journal":{"name":"IEEE Transactions on Circuits and Systems for Video Technology","volume":"35 1","pages":"643-656"},"PeriodicalIF":8.3000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Contrastive Learning Network for Unsupervised Graph Matching\",\"authors\":\"Yu Xie;Lianhang Luo;Tianpei Cao;Bin Yu;A. K. Qin\",\"doi\":\"10.1109/TCSVT.2024.3457575\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph matching aims to establish node correspondences between graphs, which is a classic combinatorial optimization problem. In recent years, (deep) learning-based methods have emerged as a superior alternative to traditional graph matching solvers. However, these methods typically rely on node-level correspondence labels, which can be prohibitively expensive or unrealistic. Inspired by contrastive learning that is a prevalent paradigm for self-supervised representation learning, we develop a Contrastive Learning Network for Unsupervised Graph Matching (CUGM), which is an end-to-end differentiable pipeline to learn node permutations. Specifically, we propose three-level augmentation including raw image augmentation, graph augmentation and model augmentation for generating diverse enough contrastive views to enrich training instances. Then a contrastive learning network is constructed to capture the higher-order structural information in graphs and learn the final node representations for yielding the affinity matrix to directly solve a linear assignment problem. More importantly, we propose a node-level contrastive loss with false negative cancellation for optimizing the whole network to extract the tailored node feature representations to improve graph matching accuracy. Experimental results on standard graph matching benchmarks demonstrate that our end-to-end unsupervised method achieves the competitive performance compared with state-of-the-art supervised and unsupervised graph matching methods.\",\"PeriodicalId\":13082,\"journal\":{\"name\":\"IEEE Transactions on Circuits and Systems for Video Technology\",\"volume\":\"35 1\",\"pages\":\"643-656\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Circuits and Systems for Video Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10671578/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems for Video Technology","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10671578/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Contrastive Learning Network for Unsupervised Graph Matching
Graph matching aims to establish node correspondences between graphs, which is a classic combinatorial optimization problem. In recent years, (deep) learning-based methods have emerged as a superior alternative to traditional graph matching solvers. However, these methods typically rely on node-level correspondence labels, which can be prohibitively expensive or unrealistic. Inspired by contrastive learning that is a prevalent paradigm for self-supervised representation learning, we develop a Contrastive Learning Network for Unsupervised Graph Matching (CUGM), which is an end-to-end differentiable pipeline to learn node permutations. Specifically, we propose three-level augmentation including raw image augmentation, graph augmentation and model augmentation for generating diverse enough contrastive views to enrich training instances. Then a contrastive learning network is constructed to capture the higher-order structural information in graphs and learn the final node representations for yielding the affinity matrix to directly solve a linear assignment problem. More importantly, we propose a node-level contrastive loss with false negative cancellation for optimizing the whole network to extract the tailored node feature representations to improve graph matching accuracy. Experimental results on standard graph matching benchmarks demonstrate that our end-to-end unsupervised method achieves the competitive performance compared with state-of-the-art supervised and unsupervised graph matching methods.
期刊介绍:
The IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) is dedicated to covering all aspects of video technologies from a circuits and systems perspective. We encourage submissions of general, theoretical, and application-oriented papers related to image and video acquisition, representation, presentation, and display. Additionally, we welcome contributions in areas such as processing, filtering, and transforms; analysis and synthesis; learning and understanding; compression, transmission, communication, and networking; as well as storage, retrieval, indexing, and search. Furthermore, papers focusing on hardware and software design and implementation are highly valued. Join us in advancing the field of video technology through innovative research and insights.