{"title":"Human gait recognition using dense residual network and hybrid attention technique with back-flow mechanism","authors":"Mohammad Iman Junaid, Sandeep Madarapu, Samit Ari","doi":"10.1016/j.dsp.2025.105401","DOIUrl":"10.1016/j.dsp.2025.105401","url":null,"abstract":"<div><div>Gait recognition is a promising biometric technique for person identification, either as a standalone method or in combination with other modalities. A major challenge lies in extracting robust gait features from silhouettes that remain invariant to variation in clothing, carried objects, and camera viewpoints. Recent advances using attention-based convolutional neural networks (CNNs) have improved gait recognition performance; however, many existing methods struggle to preserve semantic information across network layers due to information loss during the stages of downsampling. To address this issue, a novel residual dense back-flow attention network (RDBA-Net) is proposed, which integrates dual-branch hybrid self-attention network (DHSAN) modules with densely connected residual dense blocks (RDBs), and the output features are concatenated in a back-flow direction. This design enables effective learning of discriminative gait features by leveraging attention cues at both spatial-level and temporal-level from silhouette sequences. Furthermore, back-flow mechanism enhances feature learning in earlier layers by reusing refined semantic information from deeper layers. Experimental evaluations on two benchmark datasets, CASIA B and OU-MVLP, demonstrate that RDBA-Net, achieves notable improvements in accuracy compared to existing state-of-the-art methods, with gains up to 91.6% on CASIA B and 89.2% on OU-MVLP under challenging conditions.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105401"},"PeriodicalIF":2.9,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144307331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haishun Du , Sen Wang , Wenzhe Zhang , Linbing Cao
{"title":"PMDFN3D: Pre-mid dual fusion network for 3D object detection","authors":"Haishun Du , Sen Wang , Wenzhe Zhang , Linbing Cao","doi":"10.1016/j.dsp.2025.105399","DOIUrl":"10.1016/j.dsp.2025.105399","url":null,"abstract":"<div><div>In recent years, multi-modality 3D object detection technology is gradually becoming the mainstream of 3D object detection. In multi-modality 3D object detection, effectively fusing information from point cloud data and image data remains a significant challenge. Existing multi-modality 3D object detection models mainly use one of the pre-, mid- or post-fusion strategies to fuse image data and point cloud data, and each of these fusion strategies has some shortcomings. Currently, integrating multiple fusion strategies into a framework is still a research gap in the field of multi-modality 3D object detection. To fill this gap, we propose a pre-mid dual fusion network for 3D object detection (PMDFN3D), which skillfully integrates the pre-fusion and mid-fusion into a unified framework. Specifically, we first design a depth-guided cross-modality feature fusion module that enables the effective integration of image and point features without requiring complex feature alignment operations. Then, we design a neighboring feature interaction attention module to mitigate the impact of down-sampling operations in the point cloud backbone network on the precision of point features. Finally, we design a simple object-level feature selector and an object-level feature-guided cross-modality feature fusion module, which adaptively integrate image features relevant to the objects with object-level point features. Experimental results on the SUN RGB-D dataset demonstrate that our network has achieved state-of-the-art performance in 3D object detection.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105399"},"PeriodicalIF":2.9,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144263122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A high-reliability dual-mode M-ary differential chaos shift keying system with index modulation","authors":"Gang Zhang , Yongqi Guo , Xinyu Xiong","doi":"10.1016/j.dsp.2025.105388","DOIUrl":"10.1016/j.dsp.2025.105388","url":null,"abstract":"<div><div>This paper introduces a novel high-efficiency dual-mode multi-carrier differential chaos shift keying (HRDM-MDCSK-IM) system that integrates index modulation and noise suppression techniques. The system utilizes distinct modulation schemes for active and inactive time slots on each subcarrier and encodes them using reference selection indexing and combined Walsh code indexing, thereby enhancing the system's efficiency. At the receiver's end, the system employs the sliding average of reference signals to reduce noise and performs secondary noise reduction on information signals during active time slots to recover information bits. This approach optimizes the bit error rate (BER) performance and improves the overall system performance. Simulation results demonstrate that the HRDM-MDCSK-IM system significantly outperforms existing benchmark systems in terms of data rates and spectral efficiency. The paper also derives the theoretical expressions for the BER of the system under both additive white Gaussian noise (AWGN) and multipath Rayleigh fading channels, and verifies their accuracy through simulation. The theoretical analysis, coupled with simulation results, confirms the superior performance of the HRDM-MDCSK-IM system and highlights its potential for practical applications in high-efficiency wireless communication systems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105388"},"PeriodicalIF":2.9,"publicationDate":"2025-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144297767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"VOLEMIA: Non-invasive blood pressure estimation using temporal-spectral convolutional network","authors":"Trishna Saikia , Satwik Vankayalapati , Puneet Gupta , Pasi Liljeberg","doi":"10.1016/j.dsp.2025.105393","DOIUrl":"10.1016/j.dsp.2025.105393","url":null,"abstract":"<div><div>This paper introduces a novel method, <em>VOLEMIA</em>, to improve blood pressure (BP) estimation from the photoplethysmography (PPG) signal. Existing literature has often relied on long-duration PPG signals, which can be noise-prone, thereby compromising the performance of BP estimation. As a solution, <em>VOLEMIA</em> presents the PulseBlend Deconstructor (PBD), which partitions the lengthy PPG signal into shorter segments and consolidates the segments to extract the noise-resilient PPG signal. Furthermore, <em>VOLEMIA</em> presents the Pulse Spectra Extractor (PSA) mechanism to extract pulsatile spectral features from the PPG signal because spectral features provide relevant cues for systolic BP (SBP) and diastolic BP (DBP). Unlike existing methods, <em>VOLEMIA</em> incorporates these features into an advanced sequential deep learning framework while also considering the correlation between SBP and DBP. A new composite loss function is proposed to enable the network to learn both individual and correlated BP features, enhancing performance. Experimental results on our newly designed DILPPG and publicly available MIMIC-II dataset demonstrate that <em>VOLEMIA</em> exhibits superior performance than the existing methods across both datasets. Also, it indicates that key components of <em>VOLEMIA</em>, like PBD, PSA, and composite loss function, play a crucial role in performance improvement. Dataset link: <span><span>https://github.com/TrishnaSaikia/-DILPPG-Dataset.git</span><svg><path></path></svg></span></div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105393"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144254332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Online spatial alignment and fusion for networked radars on moving platforms only using target position information","authors":"Chenyu Zhu, Xiaoyu Cong, Yubing Han, Weixing Sheng","doi":"10.1016/j.dsp.2025.105375","DOIUrl":"10.1016/j.dsp.2025.105375","url":null,"abstract":"<div><div>Spatial alignment is a prerequisite for cooperative detection in networked radars, even minor biases in spatial alignment can result in large errors in the converted target geolocation. Existing spatial alignment algorithms commonly rely on the Global Positioning System (GPS) and Inertial Measurement Unit (IMU) to provide positional data and attitude angles. To overcome this limitation, we formulate the spatial alignment relationships between radars as an optimization function based on a sliding window mechanism. This function is then solved recursively using a combination of Tikhonov regularization and recursive least squares (RLS) to obtain accurate spatial alignment estimates. To provide criteria for the selection of reference radars before multi-radar alignment, a dynamic preselection strategy is put forward. This strategy creates a prior advantage for parameter estimation by analyzing the correlations between target trajectories from different radars. Considering the coupling between alignment and fusion processes, we present a feedback adjustment method to further improve the accuracy of alignment and fusion. Simulation results show the effectiveness of the proposed algorithm and its superior performance compared with traditional algorithms under the same conditions.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105375"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144241641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Statistical CSI-based dual-hop hybrid RIS-assisted wireless communication","authors":"Shuang Liang, Guangliang Ren","doi":"10.1016/j.dsp.2025.105398","DOIUrl":"10.1016/j.dsp.2025.105398","url":null,"abstract":"<div><div>Statistical channel state information (S-CSI) based ergodic achievable rate maximization is investigated for dual-hop hybrid reconfigurable intelligent surfaces (D-HRIS) assisted single-user wireless communication systems. A communication model, in which the transmitted signal is reflected through passive RIS (pRIS) to the active RIS (aRIS) and then reflected to the receiver via aRIS, is regarded as D-HRIS-assisted communication. The ergodic achievable rate is analyzed and its approximate expression is derived for this system. Based on the S-CSI, a low-complexity iterative updating scheme is proposed to design the precoding of the base station (BS) and the reflecting matrices of hybrid RISs (HRIS) to maximize the ergodic achievable rate. Specifically, the ergodic achievable rates of the proposal are 1.32 and 1.16 times larger than those of the dual-hop pRIS-aided scheme when the number of elements at the second-hop RIS is 20 and 100, respectively. When the Rician-K factor is larger than 0 dB, the ergodic achievable rate of the proposal is close to that of the instantaneous channel state information (I-CSI) based D-HRIS-assisted communication. And the performance of the system can still be guaranteed when RIS employs discrete phase shifters.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105398"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144241642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Muhammad Shalihan , Zhiqiang Cao , Khattiya Pongsirijinda , Benny Kai Kiat Ng , Billy Pik Lik Lau , Ran Liu , Chau Yuen , U-Xuan Tan
{"title":"Localization through mitigating and compensating UWB NLOS ranging error with neural network","authors":"Muhammad Shalihan , Zhiqiang Cao , Khattiya Pongsirijinda , Benny Kai Kiat Ng , Billy Pik Lik Lau , Ran Liu , Chau Yuen , U-Xuan Tan","doi":"10.1016/j.dsp.2025.105397","DOIUrl":"10.1016/j.dsp.2025.105397","url":null,"abstract":"<div><div>Indoor localization of robots is crucial for enabling effective navigation and path planning. Ultra-wideband (UWB) technology is gaining popularity due to its low cost and high accuracy. However, environmental obstructions often lead to Non-Line-Of-Sight (NLOS) signal propagation, which significantly impacts ranging and localization accuracy. Existing NLOS mitigation approaches typically discard identified NLOS measurements or utilize Channel Impulse Response (CIR), which may not be accessible from off-the-shelf UWB devices without directly extracting data from the UWB chip. To address these challenges, we propose the Compensated-Ranging Weighted Least Square (CR-WLS) localization approach, which mitigates the effects of NLOS ranging measurements without discarding them. Our approach focuses solely on the data available from off-the-shelf UWB devices without any extraction steps to obtain CIR data. The proposed method incorporates a Neural Network (NN) model trained on ranging and Received Signal Strength (RSS) data. The NN model outputs a weight and a compensation ratio for each ranging measurement. This weight and ratio are used to improve the localization accuracy using a Weighted Least Square (WLS) strategy. To validate the effectiveness of our approach, we conducted experiments in three different indoor environments. Our results demonstrate that the proposed CR-WLS approach outperforms the conventional Least Square (LS) approach, which does not consider NLOS, by up to 77.44% in localization accuracy.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105397"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144241643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SVD-based square-root Kalman filtering: A survey of existing implementation methods and novel techniques","authors":"M.V. Kulikova, G.Yu. Kulikov","doi":"10.1016/j.dsp.2025.105391","DOIUrl":"10.1016/j.dsp.2025.105391","url":null,"abstract":"<div><div>Singular value decomposition (SVD) is well known to be successfully utilized in the Kalman filtering realm for deriving numerically stable square-root implementation methods. It is as a powerful alternative to the traditional Cholesky factorization-based square-root approach, which has been in use in the engineering literature since the early 1960s. In this paper, we explore all existing SVD factorization-based square-root methods derived for the discrete-time Kalman filtering (KF). We examine time-invariant state-space models and, as a consequence, our survey includes both the Riccati and Chandrasekhar recursion-based KF methodologies. Each approach additionally contains the covariance-type algorithms, information-type methods and the mixed-type variants when they exist. We also propose two novel Riccati-based algorithms that belong to the information-type filtering. One of them is derived by using the hyperbolic SVD to create the homogeneous information-type SVD filter unlike the previously derived method. In our overview, we discuss the properties and difference in implementation ways, we provide the summary of each algorithm and discuss the problems that are still open in this realm for a future research. The numerical tests are also given. They exhibit a numerical behavior of the implementation methods on both well- and ill-conditioned problems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105391"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144254330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Caihong Yuan, Bo Jiang, Xiaopan Chen, Xiaoke Zhu, Wenjuan Liang
{"title":"Dual-auxiliary modalities and multistage semantic information fusion network for visible-infrared person re-identification","authors":"Caihong Yuan, Bo Jiang, Xiaopan Chen, Xiaoke Zhu, Wenjuan Liang","doi":"10.1016/j.dsp.2025.105392","DOIUrl":"10.1016/j.dsp.2025.105392","url":null,"abstract":"<div><div>Visible-Infrared person Re-IDentification (VI-ReID) poses a formidable challenge due to the substantial discrepancies between infrared and visible light imagery. Some existing approaches attempt to address this issue by extracting modality-shared representations directly from the original images or by using a single intermediate modality as a bridge. However, these methods often fall short in effectively narrowing the wide gap between visible and infrared modalities, leading to limited performance improvements. To overcome this limitation, we propose the Dual-Auxiliary Modalities and multistage Semantic Information Fusion Network (DMMSIF). Specifically, we utilize dual-auxiliary intermediate modalities to create a smoother transition between visible and infrared images. By incorporating these auxiliary modalities, we enable a more gradual approximation between the two original modalities, thereby significantly reducing modality discrepancies and minimizing information loss during the feature extraction process. And, we propose the Multistage Semantic Information Fusion (MSIF) module to explore and integrate relational semantic information across different stages of the network. By doing so, we can obtain a more discriminative and informative multistage fusion feature, which captures both local and global contexts effectively. Furthermore, we propose two novel losses (Modality Distribution Alignment Loss and Hybrid-Center Triplet Loss) to effectively align four modalities by thorough mutual learning among them and extract more robust and discriminative feature representations that are better suited for VI-ReID tasks. Extensive experiments on the SUYSU-MM01, RegDB, and LLcm datasets demonstrate the effectiveness of our method.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105392"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144254250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Duc Thinh Vu , Ba Cao Nguyen , Danh Khoa Nguyen , Taejoon Kim , Bui Vu Minh , Anh Vu Le
{"title":"Enhancing mmWave performance: Multi-RIS and antenna selection under hardware impairments","authors":"Duc Thinh Vu , Ba Cao Nguyen , Danh Khoa Nguyen , Taejoon Kim , Bui Vu Minh , Anh Vu Le","doi":"10.1016/j.dsp.2025.105389","DOIUrl":"10.1016/j.dsp.2025.105389","url":null,"abstract":"<div><div>This article examines the capacity performance of millimeter-wave (mmWave) communication systems utilizing multiple reconfigurable intelligent surfaces (RISs). Specifically, transceiver hardware impairments (THI) at both source and destination are taken into account. To dramatically reduce the effects of THI, transmit antenna selection (TAS) is exploited at the source. The mathematical formulas of achievable rate (AR) and energy efficiency (EE) of the considered multi-RIS-mmWave systems with TAS are derived over Nakagami-<em>m</em> channels under the THI conditions. Numerical results clarify the big effects of THI on the AR and EE of the multi-RIS-mmWave systems. It is because the AR and EE with THI are greatly lower than those without THI (without THI is perfect transceiver hardware (PTH)). In this circumstance, utilizing TAS as well as RISs can achieve many benefits. In addition to the combined advantages of TAS and RISs, their individual advantages are also verified. In particular, the AR and EE achieved with TAS are significantly greater than those obtained without TAS. Then, other solutions such as increasing the number of RISs/reflecting elements (REs), rising the number of transmission antennas, and using suitable frequencies and bandwidths are provided to improve the AR and EE of the considered multi-RIS-mmWave systems with TAS and THI. Finally, the AR and EE expressions are verified by Monte-Carlo simulations.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"166 ","pages":"Article 105389"},"PeriodicalIF":2.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144312478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}