A few-shot learning-based dual-input neural network for complex spectrogram recognition system with millimeter-wave radar

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-04-23 DOI:10.1007/s40747-025-01848-2

Kaiyu Chen, Shaoxi Wang, Wei Li, Yucheng Wang, Cunqian Feng, Yannian Zhou, Jian Cao, Binfeng Zong, Minming Gu

{"title":"A few-shot learning-based dual-input neural network for complex spectrogram recognition system with millimeter-wave radar","authors":"Kaiyu Chen, Shaoxi Wang, Wei Li, Yucheng Wang, Cunqian Feng, Yannian Zhou, Jian Cao, Binfeng Zong, Minming Gu","doi":"10.1007/s40747-025-01848-2","DOIUrl":null,"url":null,"abstract":"<p>Graph data-driven machine learning methods for human activity recognition (HAR) have achieved success recently using sufficient data. In the realm of everyday life, we encounter a notable challenge: the scarcity of labeled radar samples. This limitation is compounded by the stark disparities in data distribution between simulated and measured activity domains. In this article, a generalized graph contrastive learning framework (DSMFT-Net) incorporated with Boulic-Thalmann simulation model for few-shot HAR is proposed. DSMFT-Net combines a clustering strategy with contrastive learning to develop a robust, domain-invariant feature representation. Particularly, the method divided into two phases: single radar range Doppler spectrogram prototypical contrast, enhancing the classification discriminative features by improving the compaction of prototypes and instances within a domain. Then, cross prototypical contrast of simulated and measured radar range Doppler spectrogram domain, focuses on discovering domain-invariant features through prototype-instance matching and proximity exploration. Moreover, mutual information maximization ensures the reliability of predictions, while pseudo-label information aids in self-supervised contrastive pre-training by comparing positive and negative sample pairs. The effectiveness of the model is empirically validated through testing conducted in both open and complex office environments. The experimental results indicate that the proposed method achieves an average accuracy of 93.3% under 5-shot setting and 96.5% under 10-shot setting across six human activity recognition tasks. These findings highlight the effectiveness of the proposed method in achieving high performance even with limited labeled data.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"128 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01848-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Graph data-driven machine learning methods for human activity recognition (HAR) have achieved success recently using sufficient data. In the realm of everyday life, we encounter a notable challenge: the scarcity of labeled radar samples. This limitation is compounded by the stark disparities in data distribution between simulated and measured activity domains. In this article, a generalized graph contrastive learning framework (DSMFT-Net) incorporated with Boulic-Thalmann simulation model for few-shot HAR is proposed. DSMFT-Net combines a clustering strategy with contrastive learning to develop a robust, domain-invariant feature representation. Particularly, the method divided into two phases: single radar range Doppler spectrogram prototypical contrast, enhancing the classification discriminative features by improving the compaction of prototypes and instances within a domain. Then, cross prototypical contrast of simulated and measured radar range Doppler spectrogram domain, focuses on discovering domain-invariant features through prototype-instance matching and proximity exploration. Moreover, mutual information maximization ensures the reliability of predictions, while pseudo-label information aids in self-supervised contrastive pre-training by comparing positive and negative sample pairs. The effectiveness of the model is empirically validated through testing conducted in both open and complex office environments. The experimental results indicate that the proposed method achieves an average accuracy of 93.3% under 5-shot setting and 96.5% under 10-shot setting across six human activity recognition tasks. These findings highlight the effectiveness of the proposed method in achieving high performance even with limited labeled data.

查看原文本刊更多论文

基于少量学习的双输入神经网络在毫米波雷达复杂频谱图识别系统中的应用

最近，利用足够的数据，图形数据驱动的人类活动识别（HAR）机器学习方法取得了成功。在日常生活中，我们遇到了一个显著的挑战：标记雷达样本的稀缺性。这种限制由于模拟活动域和测量活动域之间数据分布的明显差异而更加复杂。本文提出了一种结合Boulic-Thalmann仿真模型的广义图对比学习框架（DSMFT-Net）。DSMFT-Net将聚类策略与对比学习相结合，开发出鲁棒的、域不变的特征表示。具体而言，该方法分为单雷达距离多普勒谱图原型对比两个阶段，通过改进原型和实例在一个域内的压缩程度来增强分类判别特征。然后，对模拟和实测雷达距离多普勒谱图域进行交叉原型对比，重点通过原型-实例匹配和邻近探测发现域不变特征。此外，互信息最大化保证了预测的可靠性，而伪标签信息通过比较正负样本对有助于自监督对比预训练。通过在开放和复杂办公环境中进行的测试，实证验证了模型的有效性。实验结果表明，在6个人体活动识别任务中，该方法在5次射击设置下的平均准确率为93.3%，在10次射击设置下的平均准确率为96.5%。这些发现强调了所提出的方法即使在有限的标记数据下也能实现高性能的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.