An Unbalanced Optimal Transport-Based Approach for Robust Dictionary Learning

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems Pub Date : 2025-01-15 DOI:10.1109/TNNLS.2025.3526254

Shengjia Wang;Zhiguo Wang;Xi-Le Zhao;Xiaojing Shen

{"title":"An Unbalanced Optimal Transport-Based Approach for Robust Dictionary Learning","authors":"Shengjia Wang;Zhiguo Wang;Xi-Le Zhao;Xiaojing Shen","doi":"10.1109/TNNLS.2025.3526254","DOIUrl":null,"url":null,"abstract":"Dictionary learning (DL) is a pivotal task in machine learning and signal processing, involving extracting representative features from a given dataset. However, conventional DL models are known to be highly sensitive to outliers. To circumvent this issue, we introduce a new and robust DL model based on unbalanced optimal transport (UOT). Compared to DL models based on conventional robust distances and the Wasserstein distance, our model not only captures and leverages the structural information within the data but also demonstrates strong resilience to outliers. By employing the structure of the proposed robust DL model, we develop a novel hybrid block coordinate descent (BCD) algorithm. The proposed algorithm maintains computational tractability by exploiting special block structures of the subproblems. In addition, we establish the convergence of our algorithm without the Lipschitz smooth condition. Through extensive experimentation, we validate our theoretical results and demonstrate the effectiveness of the proposed method on synthetic data, MNIST data, Olivetti faces dataset, and hyperspectral images (HSIs) datasets.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 6","pages":"11149-11163"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10843127/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Dictionary learning (DL) is a pivotal task in machine learning and signal processing, involving extracting representative features from a given dataset. However, conventional DL models are known to be highly sensitive to outliers. To circumvent this issue, we introduce a new and robust DL model based on unbalanced optimal transport (UOT). Compared to DL models based on conventional robust distances and the Wasserstein distance, our model not only captures and leverages the structural information within the data but also demonstrates strong resilience to outliers. By employing the structure of the proposed robust DL model, we develop a novel hybrid block coordinate descent (BCD) algorithm. The proposed algorithm maintains computational tractability by exploiting special block structures of the subproblems. In addition, we establish the convergence of our algorithm without the Lipschitz smooth condition. Through extensive experimentation, we validate our theoretical results and demonstrate the effectiveness of the proposed method on synthetic data, MNIST data, Olivetti faces dataset, and hyperspectral images (HSIs) datasets.

查看原文本刊更多论文

基于非平衡最优传输的鲁棒字典学习方法

字典学习（DL）是机器学习和信号处理中的一项关键任务，涉及从给定数据集中提取代表性特征。然而，传统的深度学习模型对异常值高度敏感。为了规避这一问题，我们引入了一种新的基于不平衡最优运输（UOT）的鲁棒深度学习模型。与基于传统鲁棒距离和Wasserstein距离的深度学习模型相比，我们的模型不仅捕获和利用了数据中的结构信息，而且对异常值表现出了很强的弹性。利用所提出的鲁棒深度学习模型的结构，我们开发了一种新的混合块坐标下降（BCD）算法。该算法通过利用子问题的特殊块结构来保持计算的可跟踪性。此外，我们还证明了该算法在没有Lipschitz光滑条件下的收敛性。通过大量的实验，我们验证了我们的理论结果，并证明了所提出的方法在合成数据、MNIST数据、Olivetti人脸数据集和高光谱图像（hsi）数据集上的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.