Positive Sample Mining: Fuzzy Threshold-Based Contrastive Learning for Enhanced Unsupervised Skeleton-Based Action Recognition

Hengsheng Xu;Jianqi Zhong;Deliang Lian;Hanxu Hou;Wenming Cao
{"title":"Positive Sample Mining: Fuzzy Threshold-Based Contrastive Learning for Enhanced Unsupervised Skeleton-Based Action Recognition","authors":"Hengsheng Xu;Jianqi Zhong;Deliang Lian;Hanxu Hou;Wenming Cao","doi":"10.1109/TAI.2025.3531831","DOIUrl":null,"url":null,"abstract":"Contrastive learning is one of the fundamental paradigms for unsupervised 3-D skeleton-based action recognition. Existing contrastive learning paradigms typically enhance model discrimination by increasing the distance between different action samples in the feature space. However, this approach can inadvertently lead to an increase in the intraclass distance for the same action category, thereby affecting the effectiveness of action recognition. To address this issue, we introduce an innovative unsupervised framework named fuzzy threshold-based contrastive learning (FTCL). This novel approach leverages the concept of fuzzy thresholds to handle sample partitioning within the feature space. In essence, given a dataset of human actions, we distinguish different action samples as “negative samples” and identical action samples as “positive samples.” By analyzing the similarity distribution between these positive and negative samples, we apply the principles of fuzzy thresholds to evaluate the attributes of the negative samples. This refined evaluation facilitates a judicious reassignment of positive and negative sample classifications, thus circumventing the challenges associated with increased intraclass distances. Furthermore, to obtain better action representations from skeleton data, we model and contrast skeleton data from different spatiotemporal perspectives, capturing rich spatiotemporal information in the feature representation of actions. Extensive experiments on the NTU-60, NTU-120, and PKU-MMD datasets were conducted to validate our proposed FTCL. The experimental results demonstrate that our approach achieves significant improvements.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 7","pages":"1918-1931"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10848264/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Contrastive learning is one of the fundamental paradigms for unsupervised 3-D skeleton-based action recognition. Existing contrastive learning paradigms typically enhance model discrimination by increasing the distance between different action samples in the feature space. However, this approach can inadvertently lead to an increase in the intraclass distance for the same action category, thereby affecting the effectiveness of action recognition. To address this issue, we introduce an innovative unsupervised framework named fuzzy threshold-based contrastive learning (FTCL). This novel approach leverages the concept of fuzzy thresholds to handle sample partitioning within the feature space. In essence, given a dataset of human actions, we distinguish different action samples as “negative samples” and identical action samples as “positive samples.” By analyzing the similarity distribution between these positive and negative samples, we apply the principles of fuzzy thresholds to evaluate the attributes of the negative samples. This refined evaluation facilitates a judicious reassignment of positive and negative sample classifications, thus circumventing the challenges associated with increased intraclass distances. Furthermore, to obtain better action representations from skeleton data, we model and contrast skeleton data from different spatiotemporal perspectives, capturing rich spatiotemporal information in the feature representation of actions. Extensive experiments on the NTU-60, NTU-120, and PKU-MMD datasets were conducted to validate our proposed FTCL. The experimental results demonstrate that our approach achieves significant improvements.
正样本挖掘:基于模糊阈值的增强无监督骨架动作识别的对比学习
对比学习是无监督三维骨架动作识别的基本范式之一。现有的对比学习范式通常通过增加特征空间中不同动作样本之间的距离来增强模型辨别能力。然而,这种方法可能会无意中导致同一动作类别的类内距离增加,从而影响动作识别的有效性。为了解决这个问题,我们引入了一种创新的无监督框架,称为模糊阈值对比学习(FTCL)。这种新方法利用模糊阈值的概念来处理特征空间内的样本划分。本质上,给定一个人类行为的数据集,我们将不同的行为样本区分为“负样本”,将相同的行为样本区分为“正样本”。通过分析正负样本之间的相似度分布,应用模糊阈值原理对负样本的属性进行评价。这种精细的评估有助于对阳性和阴性样本分类进行明智的重新分配,从而规避与类内距离增加相关的挑战。此外,为了从骨骼数据中获得更好的动作表示,我们从不同的时空角度对骨骼数据进行建模和对比,在动作的特征表示中捕获丰富的时空信息。在NTU-60、NTU-120和PKU-MMD数据集上进行了广泛的实验来验证我们提出的FTCL。实验结果表明,我们的方法取得了显著的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信