Xiangqun Li, Jiawen Liang, Jinyu Zhu, Shengping Shi, Fangyu Ding, Jianpeng Sun, Bo Liu
{"title":"Small Sample Fiber Full State Diagnosis Based on Fuzzy Clustering and Improved ResNet Network","authors":"Xiangqun Li, Jiawen Liang, Jinyu Zhu, Shengping Shi, Fangyu Ding, Jianpeng Sun, Bo Liu","doi":"10.1049/2024/5512014","DOIUrl":"10.1049/2024/5512014","url":null,"abstract":"<div>\u0000 <p>The optical time domain reflectometer (OTDR) curve features of communication fibers exhibit subtle differences among their normal, subhealthy, and faulty operating states, making it challenging for existing machine learning-based fault diagnosis algorithms to extract these minute features. In addition, the OTDR curve field fault data are scarce, and data-driven deep neural network that needs a lot of data training cannot meet the requirements. In response to this issue, this paper proposes a communication fiber state diagnosis model based on fuzzy clustering and an improved ResNet. First, the pretrained residual network (ResNet) is modified by removing the classification layer and retaining the feature extraction layers. A global average pooling (GAP) layer is designed as a replacement for the fully connected layer. Second, fuzzy clustering, instead of the softmax classification layer, is employed in ResNet for its characteristic of requiring no subsequent data training. The improved model requires only a small amount of sample training to optimize the parameters of the GAP layer, thereby accommodating state diagnosis in scenarios with limited data availability. During the diagnosis process, the OTDR curves are input into the network, resulting in 512 features outputted in the GAP layer. These features are used to construct a feature vector matrix, and a dynamic clustering graph is formed using fuzzy clustering to realize the fiber state diagnosis. Through on-site data detection and validation, it has been demonstrated that the improved ResNet can effectively identify the full cycle of fiber states.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5512014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139852307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging","authors":"Zheng Sun, Yue Yao, Ru Wang","doi":"10.1049/2024/5664618","DOIUrl":"https://doi.org/10.1049/2024/5664618","url":null,"abstract":"Motion artifacts are a major challenge in the in vivo application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of in vivo intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139862886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging","authors":"Zheng Sun, Yue Yao, Ru Wang","doi":"10.1049/2024/5664618","DOIUrl":"10.1049/2024/5664618","url":null,"abstract":"<div>\u0000 <p>Motion artifacts are a major challenge in the <i>in vivo</i> application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of <i>in vivo</i> intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5664618","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139803067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Power Resource Allocation Algorithm for Dual-Function Radar–Communication System","authors":"Yue Xiao, Zhenkai Zhang, Xiaoke Shang","doi":"10.1049/2024/5072597","DOIUrl":"10.1049/2024/5072597","url":null,"abstract":"<div>\u0000 <p>In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5072597","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139870587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Power Resource Allocation Algorithm for Dual-Function Radar–Communication System","authors":"Yue Xiao, Zhenkai Zhang, Xiaoke Shang","doi":"10.1049/2024/5072597","DOIUrl":"https://doi.org/10.1049/2024/5072597","url":null,"abstract":"In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139810955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function","authors":"Matin Pashaian, Sanaz Seyedin","doi":"10.1049/2024/8881007","DOIUrl":"10.1049/2024/8881007","url":null,"abstract":"<div>\u0000 <p>We propose a multi-objective joint model of non-negative matrix factorization (NMF) and deep neural network (DNN) with a new loss function for speech enhancement. The proposed loss function (<i>L</i><sub>MOFD</sub>) is a weighted combination of a frequency differential spectrum mean squared error (MSE)-based loss function (<i>L</i><sub>FD</sub>) and a multi-objective MSE loss function (<i>L</i><sub>MO</sub>). The conventional MSE loss function computes the discrepancy between the estimated speech and clean speech across all frequencies, disregarding the process of changing amplitude in the frequency domain which contains valuable information. The differential spectrum representation retains spectral peaks that carry important information. Using this representation helps to ensure that this information in the speech signal is reserved. Also, on the other hand, noise spectra typically have a flat shape and as the differential operation makes the flat spectral partly close to zero, the differential spectrum is resistant to noises with smooth structures. Thus, we propose using a frequency-differentiated loss function that considers the magnitude spectrum differentiations between the neighboring frequency bins in each time frame. This approach maintains the spectrum variations of the objective signal in the frequency domain, which can effectively reduce the noise deterioration effects. The multi-objective MSE term (<i>L</i><sub>MO</sub>) is a combined two-loss function related to the NMF coefficients which are the intermediate output targets, and the original spectral signals as the actual output targets. The use of encoded NMF coefficients as low-dimensional structural features for DNN serves as prior knowledge and helps the learning process. <i>L</i><sub>MO</sub> is used beside <i>L</i><sub>FD</sub> to take advantage of both the properties of the original and the differential spectrum in the training loss function. Moreover, a DNN-based noise classification and fusion strategy (NCF) is proposed to exploit a discriminative model for noise reduction. The experiments reveal the improvements of the proposed approach compared to the previous methods.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/8881007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139601400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for Image Compressive Sensing (CS)","authors":"Youhao Yu, Richard M. Dansereau","doi":"10.1049/2024/6666549","DOIUrl":"10.1049/2024/6666549","url":null,"abstract":"<div>\u0000 <p>Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/6666549","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139630022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Anomaly Detection with Attention (DADA): A Novel Approach for Identifying Multipath Interference in Radar Signals","authors":"Kang Yan, Weidong Jin, Yingkun Huang, Zhenhua Li, Pucha Song, Ligang Huang","doi":"10.1049/2024/5026821","DOIUrl":"10.1049/2024/5026821","url":null,"abstract":"<div>\u0000 <p>Multipath interference in radar signals caused by sea, ground, and other environments poses significant challenges to the target detection, tracking, and classification capabilities of radar systems. Existing methods for radar signal identification require labeled samples and focus mainly on the classification of normal signals. However, in practice, anomalous samples (multipath interference signals) may be scarce and highly imbalanced (i.e., mostly normal samples). To address this problem, we propose a deep anomaly detection with attention (DADA) for semisupervised detection of multipath radar signals. The method transforms radar signals into time–frequency images and is trained exclusively on normal samples. The autoencoder architecture is extended with a feature extractor network to capture latent sample features. CBAM attention is introduced to improve feature extraction. By learning the distribution of normal samples in high-dimensional image space and low-dimensional feature space, a two-dimensional feature space representing normal samples is constructed. A one-class SVM then learns the boundary of normal samples for anomaly detection. Extensive experiments on radar signal datasets validate the effectiveness of the proposed approach.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5026821","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139531325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"IRR-Net: A Joint Learning Framework for Image Reconstruction and Recognition of Photoacoustic Tomography","authors":"Zheng Sun, Bing Ai, Meichen Sun, Yingsa Hou","doi":"10.1049/2023/6615953","DOIUrl":"https://doi.org/10.1049/2023/6615953","url":null,"abstract":"In photoacoustic tomography (PAT), object identification and classification are usually performed as postprocessing processes after image reconstruction. Since useful information about the target implied in the raw signal can be lost during image reconstruction, this two-step scheme can reduce the accuracy of tissue characterization. For learning-based methods, it is time consuming to train the network of each subtask separately. In this paper, we report on an end-to-end joint learning framework for simultaneous image reconstruction and object recognition, named IRR-Net. It establishes direct mapping of raw photoacoustic signals to high-quality images with recognized targets. The network consists of an image reconstruction module, an optimization module, and a recognition module, which achieved signal-to-image, image-to-image, and image-to-class conversion, respectively. We built simulation, phantom and in vivo data sets to train and test IRR-Net. The results show that the proposed method successfully yields concurrent improvements in both the quality of the reconstructed images and the accuracy of target recognition at a lower time cost compared to the separately trained networks.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139164286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Low-Complexity BFGS-Based Soft-Output MMSE Detector for Massive MIMO Uplink","authors":"Lin Li, Jianhao Hu","doi":"10.1049/2023/8887060","DOIUrl":"https://doi.org/10.1049/2023/8887060","url":null,"abstract":"For the massive multiple-input multiple-output (MIMO) uplink, the linear minimum mean square error (MMSE) detector is near-optimal but involves undesirable matrix inversion. In this paper, we propose a low-complexity soft-output detector based on the simplified Broyden–Fletcher–Goldfarb–Shanno method to realize the matrix-inversion-free MMSE detection iteratively. To accelerate convergence with minimal computational overhead, an appropriate initial solution is presented leveraging the channel-hardening property of massive MIMO. Moreover, we employ a low-complexity approximated approach to calculating the log-likelihood ratios with negligible performance losses. Simulation results finally verify that the proposed detector can achieve the near-MMSE performance with a few iterations and outperforms the recently reported linear detectors in terms of lower complexity and faster convergence for the realistic massive MIMO systems.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134957184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}