Zhuang Zhang;Lijun Zhang;Dejian Meng;Wei Tian;Jun Yan
{"title":"Spectral Scaling-Based Augmentation for Corruption-Robust Image Classification","authors":"Zhuang Zhang;Lijun Zhang;Dejian Meng;Wei Tian;Jun Yan","doi":"10.1109/LSP.2025.3549014","DOIUrl":"https://doi.org/10.1109/LSP.2025.3549014","url":null,"abstract":"Image classifiers often degrade in performance when test images differ significantly from the training distribution due to real-world image corruptions. Frequency-based augmentations can be used to address this issue, but existing methods excel against corruptions caused by noise and blur while struggling with those caused by contrast and fog. To tackle these challenges, we propose a novel image augmentation method grounded in a new perspective of relative spectral differences. This perspective characterizes spectral variations introduced by common corruptions as changes in non-zero frequencies, providing a unified understanding of their effects on image spectra. Building on this insight, the proposed method incorporates two key modules: a random spectral scaling module that captures statistical properties of image spectra and a deep spectral scaling module that adaptively learns spectral adjustments through a neural network. Experiments demonstrate that the proposed method improves overall robustness across various corruptions, with notable gains of 6.3% and 6.4% on contrast and fog, respectively, where existing methods often fall short.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1206-1210"},"PeriodicalIF":3.2,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143667475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jialong Zhong;Tingwei Liu;Yongri Piao;Weibing Sun;Huchuan Lu
{"title":"ProSegDiff: Prostate Segmentation Diffusion Network Based on Adaptive Adjustment of Injection Features","authors":"Jialong Zhong;Tingwei Liu;Yongri Piao;Weibing Sun;Huchuan Lu","doi":"10.1109/LSP.2025.3548422","DOIUrl":"https://doi.org/10.1109/LSP.2025.3548422","url":null,"abstract":"Recently, methods based on Diffusion Probability Models (DPM) have achieved notable success in the field of medical image segmentation. However, most of these methods do not perform well in segmenting ambiguous areas when dealing with prostate segmentation tasks due to the low distinguishability of prostate images and the high overlap of its boundary with adjacent organs. To address this issue, this paper introduces a diffusion-based framework named ProSegDiff, ProSegDiff employs an Adapter to dynamically adjust features from the conditional network to align with the denoising process of the denoising network. Furthermore, the denoising process is conducted in the latent space to minimize the consumption of computational resources, and a proposed selection strategy is employed to identify the better results from multiple inferences. Extensive comparative experiments on four benchmark datasets demonstrate the effectiveness of this method, which achieves superior performance across four evaluation metrics.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1236-1240"},"PeriodicalIF":3.2,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143688160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mingjing Cui;Yunxiang Jiang;Dongyuan Lin;Shiyuan Wang;Fuliang He
{"title":"Enhanced Batch Adaptive Filter Based on Fractional-Order Generalized Cauchy Kernel Loss","authors":"Mingjing Cui;Yunxiang Jiang;Dongyuan Lin;Shiyuan Wang;Fuliang He","doi":"10.1109/LSP.2025.3548432","DOIUrl":"https://doi.org/10.1109/LSP.2025.3548432","url":null,"abstract":"Adaptive filters utilizing the low-order moments hidden in robust loss functions have achieved desirable performance under Gaussian input and impulsive noises. However, when the input cannot be modeled by Gaussian process and is simultaneously contaminated by outliers, these filters may suffer from misalignment. To this end, applying fractional-order calculus in stochastic gradient descent method, this letter proposes a fractional-order generalized Cauchy kernel loss (FoGCKL) algorithm to model complex <inline-formula> <tex-math>$alpha$</tex-math></inline-formula>-stable process input. The mean square deviation (MSD) is calculated to evaluate the steady-state performance of FoGCKL. To further avoid steady-state jitters and improve filtering accuracy, an enhanced batch method is constructed in FoGCKL using optimized weighted term, generating another enhanced batch FoGCKL (EB-FoGCKL) algorithm. Simulations on system identification verify the correctness of theoretical analysis and demonstrate the superiorities of FoGCKL and EB-FoGCKL.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1201-1205"},"PeriodicalIF":3.2,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143667392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-Domain Time-Frequency Fusion Feature Contrastive Learning for Machinery Fault Diagnosis","authors":"Yang Wei;Kai Wang","doi":"10.1109/LSP.2025.3548466","DOIUrl":"https://doi.org/10.1109/LSP.2025.3548466","url":null,"abstract":"The scarcity of a large amount of labeled data for adequately training of deep learning models, along with their restricted generalization capabilities, persistently hinders the real-world practical application of data-driven deep learning in few-shot fault diagnosis and transfer task fault diagnosis. This paper proposes a self-supervised Wide Kernel Time-Frequency Fusion (WTFF) contrastive learning method that leverages extensive unlabeled signals to extract discriminative time-frequency fusion features, thereby enhancing fault diagnosis performance even with a limited number of labeled samples. Moreover, the WTFF integrates a multi-layer time-frequency wide convolutional neural network (TFCNN) encoder with a novel local and global time-frequency contrastive loss (LGTFCL) to capture time frequency consistency by facilitating the alignment of time-domain and frequency-domain feature embeddings across the shallow and deep network layers. In the fine-tuning phase, time frequency features across various levels learned from transferred pretrained model are fused to extract signal characteristics that exhibit both time and frequency discrimination. The proposed method demonstrates superior diagnostic accuracy and robustness in experiments involving few-shot and transfer learning-based fault diagnosis.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1116-1120"},"PeriodicalIF":3.2,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143675976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"From Body Parts to Holistic Action: A Fine-Grained Teacher-Student CLIP for Action Recognition","authors":"Yangjun Ou;Xiao Shi;Jia Chen;Ruhan He;Chi Liu","doi":"10.1109/LSP.2025.3548448","DOIUrl":"https://doi.org/10.1109/LSP.2025.3548448","url":null,"abstract":"Action recognition in dynamic video remains challenging, particularly when distinguishing between visually similar actions. While existing methods often rely on holistic representations, they overlook the fine-grained details that are significant for accurate classification. We propose a novel Fine-grained Teacher-student CLIP (FT-CLIP) that integrates body part analysis with holistic action recognition through a teacher-student architecture, bridging the gap between fine-grained action parsing and overall action understanding. The teacher model processes individual body parts alongside specialized description to generate part-specific features, which are then aggregated and distilled into the student model. Through knowledge distillation with learnable prompts, the student model effectively learns to capture subtle action distinctions while maintaining efficient inference. FT-CLIP achieves a more nuanced understanding of complex actions by progressing from detailed body part analysis to comprehensive action recognition. Experiments on Kinetics-TPS under a fully-supervised setting and on HMDB51 and UCF101 under a zero-shot setting demonstrate the effectiveness of our method.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1336-1340"},"PeriodicalIF":3.2,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143726488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Centroid-Free K-Means With Balanced Clustering","authors":"Bin Meng;Fangfang Li;Fan Yang;Quanxue Gao","doi":"10.1109/LSP.2025.3547665","DOIUrl":"https://doi.org/10.1109/LSP.2025.3547665","url":null,"abstract":"Currently, a wide array of clustering algorithms have emerged, yet many approaches rely on K-means to detect clusters. However, K-means is highly sensitive to the selection of the initial cluster centers, which poses a significant obstacle to achieving optimal clustering results. Moreover, its capability to handle nonlinearly separable data is less than satisfactory. To overcome the limitations of traditional K-means, we draw inspiration from manifold learning to reformulate the K-means algorithm into a new clustering method based on manifold structures. This method not only eliminates the need to calculate centroids in traditional approaches, but also preserves the consistency between manifold structures and clustering labels. Furthermore, we introduce the <inline-formula> <tex-math>$ell _{2,1}$</tex-math></inline-formula>-norm to naturally maintain class balance during the clustering process. Additionally, we develop a versatile K-means variant framework that can accommodate various types of distance functions, thereby facilitating the efficient processing of nonlinearly separable data. The experimental results of several databases confirm the superiority of our proposed model.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1191-1195"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143667393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Millimeter-Wave MIMO Transmission for FBMC Systems With Lens Antenna Arrays","authors":"Ying Wang;Qiang Guo;Jianhong Xiang;Yu Zhong","doi":"10.1109/LSP.2025.3547268","DOIUrl":"https://doi.org/10.1109/LSP.2025.3547268","url":null,"abstract":"Millimeterwave (mmWave) techniques will be a key enabler for wireless communications to achieve high data rates. Additionally, Filter Bank Multi-Carrier (FBMC) with good spectral properties has also been regarded as an important transmission technique for future wireless communications. In this letter, we design and analyze an FBMC-based mmWave Multiple-input Multiple-output (MIMO) system. Specifically, we first pre-code quadrature amplitude modulation symbols in time to ensure that the MIMO technique becomes simple in FBMC. Secondly, we determine the optimal subcarrier spacing by maximizing the signal-to-interference ratio. Finally, using a lens antenna array combined with a simple channel estimator, we transmit data to the receiver. Simulation results show that FBMC can effectively support multi-antenna and mmWave techniques, providing favorable efficiency and reliability. Furthermore, we also verify that Alamouti's space time block code can provide considerable diversity gain.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1141-1145"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143654861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Outlier-Robust Multistatic Target Localization","authors":"Piyush Varshney;Prabhu Babu;Petre Stoica","doi":"10.1109/LSP.2025.3547859","DOIUrl":"https://doi.org/10.1109/LSP.2025.3547859","url":null,"abstract":"Multistatic localization techniques employ noisy range measurements collected via multiple transmitters and receivers to localize a target. However, in many realistic scenarios the data are corrupted by outliers which may be due to the failure of or malicious attack on one or more sensors. The presence of outliers leads to performance degradation in terms of target localization accuracy. In this letter, we address the problem of multistatic target localization when the measurements contain outliers. We employ a multi-hypothesis testing method based on the false discovery rate (FDR) to detect the outliers. More specifically, we consider a penalized maximum likelihood problem for joint estimation of the number and positions of the outliers as well as the target position, and the noise variance. To solve this problem, an iterative algorithm employing the majorization-minimization technique that minimizes the objective in a monotonic manner is developed. Through numerical simulations, we compare the proposed algorithm with other robust state-of-the-art algorithms and show that the proposed algorithm has superior performance.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1161-1165"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143667674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection","authors":"Yang Xiao;Rohan Kumar Das","doi":"10.1109/LSP.2025.3547861","DOIUrl":"https://doi.org/10.1109/LSP.2025.3547861","url":null,"abstract":"Transformers and their variants have achieved great success in speech processing. However, their multi-head self-attention mechanism is computationally expensive. Therefore, one novel selective state space model, Mamba, has been proposed as an alternative. Building on its success in automatic speech recognition, we apply Mamba for spoofing attack detection. Mamba is well-suited for this task as it can capture the artifacts in spoofed speech signals by handling long-length sequences. However, Mamba's performance may suffer when it is trained with limited labeled data. To mitigate this, we propose combining a new structure of Mamba based on a dual-column architecture with self-supervised learning, using the pre-trained wav2vec 2.0 model. The experiments show that our proposed approach achieves competitive results and faster inference on the ASVspoof 2021 LA and DF datasets, and on the more challenging In-the-Wild dataset, it emerges as the strongest candidate for spoofing attack detection.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1276-1280"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Feature Selection Modulation Network for Efficient Image Super-Resolution","authors":"Chen Wu;Ling Wang;Xin Su;Zhuoran Zheng","doi":"10.1109/LSP.2025.3547669","DOIUrl":"https://doi.org/10.1109/LSP.2025.3547669","url":null,"abstract":"In the realm of image super-resolution, learning-based methods have made significant progress. However, limited computational resources still restrict their application. This prompts us to develop an efficient method for achieving effective image super-resolution. In this letter, we propose a novel adaptive feature selection modulation network (AFSMNet) tailored for efficient image super-resolution. Specifically, we design feature modulation blocks, which include the adaptive feature selection modulation (AFSM) module and the self-gating feed-forward network (SFN). The AFSM module dynamically computes the importance of each feature channel. For channels with differing levels of importance, we employ distinct processing strategies, thereby concentrating the computational resources of the network on the more critical features as much as possible. This approach facilitates the maintenance of a low computational cost without compromising performance. The SFN restricts the flow of irrelevant feature information within the network through a simple gating mechanism. In this way, our method achieves efficient and effective image super-resolution. Extensive experiment results show that the proposed method achieves a better trade-off between reconstruction performance and computational efficiency compared to the current state-of-the-art lightweight super-resolution methods.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"1231-1235"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143688116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}