Dynamically Scaled Temperature in Self-Supervised Contrastive Learning

IEEE transactions on artificial intelligence Pub Date : 2025-01-03 DOI:10.1109/TAI.2024.3524979

Siladittya Manna;Soumitri Chattopadhyay;Rakesh Dey;Umapada Pal;Saumik Bhattacharya

引用次数: 0

Abstract

In contemporary self-supervised contrastive algorithms such as SimCLR and MoCo, the task of balancing attraction between two semantically similar samples and repulsion between two samples of different classes is primarily affected by the presence of hard negative samples. While the InfoNCE loss has been shown to impose penalties based on hardness, the temperature hyperparameter is the key to regulate the penalties and the tradeoff between uniformity and tolerance. In this work, we focus our attention on improving the performance of InfoNCE loss in self-supervised learning by proposing a novel cosine similarity dependent temperature scaling function to effectively optimize the distribution of the samples in the feature space. We also provide mathematical analyzes to support the construction of such a dynamically scaled temperature function. Experimental evidence shows that the proposed framework outperforms the contrastive loss-based SSL algorithms.

查看原文本刊更多论文

自监督对比学习中的动态缩放温度

在当代的自监督对比算法中，如SimCLR和MoCo，平衡两个语义相似样本之间的吸引力和两个不同类别样本之间的排斥力的任务主要受到硬负样本存在的影响。虽然已经证明InfoNCE损耗会基于硬度施加惩罚，但温度超参数是调节惩罚以及在均匀性和公差之间进行权衡的关键。在这项工作中，我们通过提出一种新的余弦相似度依赖的温度缩放函数来有效地优化样本在特征空间中的分布，从而提高自监督学习中InfoNCE损失的性能。我们还提供了数学分析来支持这种动态缩放温度函数的构建。实验结果表明，该框架优于基于对比损失的SSL算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on artificial intelligence

CiteScore

7.70

自引率

0.00%

发文量