Frontiers in signal processing最新文献_第3页

RIS-aided integrated sensing and communication: a mini-review ris辅助集成传感和通信:一个小回顾

Frontiers in signal processing Pub Date : 2023-05-05 DOI: 10.3389/frsip.2023.1197240

Mirza Asif Haider, Yimin D. Zhang

引用次数: 0

Degradation learning and Skip-Transformer for blind face restoration 盲面恢复的退化学习和Skip-Transformer

Frontiers in signal processing Pub Date : 2023-05-02 DOI: 10.3389/frsip.2023.1106465

Ahmed Cheikh Sidiya, Xuanang Xu, N. Xu, Xin Li

{"title":"Degradation learning and Skip-Transformer for blind face restoration","authors":"Ahmed Cheikh Sidiya, Xuanang Xu, N. Xu, Xin Li","doi":"10.3389/frsip.2023.1106465","DOIUrl":"https://doi.org/10.3389/frsip.2023.1106465","url":null,"abstract":"Blindrestoration of low-quality faces in the real world has advanced rapidly in recent years. The rich and diverse priors encapsulated by pre-trained face GAN have demonstrated their effectiveness in reconstructing high-quality faces from low-quality observations in the real world. However, the modeling of degradation in real-world face images remains poorly understood, affecting the property of generalization of existing methods. Inspired by the success of pre-trained models and transformers in recent years, we propose to solve the problem of blind restoration by jointly exploiting their power for degradation and prior learning, respectively. On the one hand, we train a two-generator architecture for degradation learning to transfer the style of low-quality real-world faces to the high-resolution output of pre-trained StyleGAN. On the other hand, we present a hybrid architecture, called Skip-Transformer (ST), which combines transformer encoder modules with a pre-trained StyleGAN-based decoder using skip layers. Such a hybrid design is innovative in that it represents the first attempt to jointly exploit the global attention mechanism of the transformer and pre-trained StyleGAN-based generative facial priors. We have compared our DL-ST model with the latest three benchmarks for blind image restoration (DFDNet, PSFRGAN, and GFP-GAN). Our experimental results have shown that this work outperforms all other competing methods, both subjectively and objectively (as measured by the Fréchet Inception Distance and NIQE metrics).","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75394101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Issues of ubiquitous music archaeology: Shared knowledge, simulation, terseness, and ambiguity in early computer music 无所不在的音乐考古问题:早期电脑音乐中的共享知识、模拟、简洁和歧义

Frontiers in signal processing Pub Date : 2023-04-05 DOI: 10.3389/frsip.2023.1132672

Victor Lazzarini, Damián Keller , Nemanja Radivojević

引用次数: 1

ICA’s bug: How ghost ICs emerge from effective rank deficiency caused by EEG electrode interpolation and incorrect re-referencing ICA的bug:由于EEG电极插值和不正确的重新引用导致的有效秩不足，如何产生幽灵ic

Frontiers in signal processing Pub Date : 2023-04-03 DOI: 10.3389/frsip.2023.1064138

Hyeonseok Kim, Justin Luo, Shannon Chu, C. Cannard, Sven Hoffmann, M. Miyakoshi

{"title":"ICA’s bug: How ghost ICs emerge from effective rank deficiency caused by EEG electrode interpolation and incorrect re-referencing","authors":"Hyeonseok Kim, Justin Luo, Shannon Chu, C. Cannard, Sven Hoffmann, M. Miyakoshi","doi":"10.3389/frsip.2023.1064138","DOIUrl":"https://doi.org/10.3389/frsip.2023.1064138","url":null,"abstract":"Independent component analysis (ICA) has been widely used for electroencephalography (EEG) analyses. However, ICA performance relies on several crucial assumptions about the data. Here, we focus on the granularity of data rank, i.e., the number of linearly independent EEG channels. When the data are rank-full (i.e., all channels are independent), ICA produces as many independent components (ICs) as the number of input channels (rank-full decomposition). However, when the input data are rank-deficient, as is the case with bridged or interpolated electrodes, ICA produces the same number of ICs as the data rank (forced rank deficiency decomposition), introducing undesired ghost ICs and indicating a bug in ICA. We demonstrated that the ghost ICs have white noise properties, in both time and frequency domains, while maintaining surprisingly typical scalp topographies, and can therefore be easily missed by EEG researchers and affect findings in unknown ways. This problem occurs when the minimum eigenvalue λ min of the input data is smaller than a certain threshold, leading to matrix inversion failure as if the rank-deficient inversion was forced, even if the data rank is cleanly deficient by one. We defined this problem as the effective rank deficiency. Using sound file mixing simulations, we first demonstrated the effective rank deficiency problem and determined that the critical threshold for λ min is 10−7 in the given situation. Second, we used empirical EEG data to show how two preprocessing stages, re-referencing to average without including the initial reference and non-linear electrode interpolation, caused this forced rank deficiency problem. Finally, we showed that the effective rank deficiency problem can be solved by using the identified threshold ( λ min = 10−7) and the correct re-referencing procedure described herein. The former ensures the achievement of effective rank-full decomposition by properly reducing the input data rank, and the latter allows avoidance of a widely practiced incorrect re-referencing approach. Based on the current literature, we discuss the ambiguous status of the initial reference electrode when re-referencing. We have made our data and code available to facilitate the implementation of our recommendations by the EEG community.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83631245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

MRET: Multi-resolution transformer for video quality assessment 用于视频质量评估的多分辨率变压器

Frontiers in signal processing Pub Date : 2023-03-13 DOI: 10.3389/frsip.2023.1137006

Junjie Ke, Tian Zhang, Yilin Wang, P. Milanfar, Feng Yang

引用次数: 0

Recording and analysing physical control variables used in clarinet playing: A musical instrument performance capture and analysis toolbox (MIPCAT) 记录和分析单簧管演奏中使用的物理控制变量:乐器性能捕获和分析工具箱(MIPCAT)

Frontiers in signal processing Pub Date : 2023-02-10 DOI: 10.3389/frsip.2023.1089366

A. Almeida, Weicong Li, Emery Schubert, John Smith, J. Wolfe

{"title":"Recording and analysing physical control variables used in clarinet playing: A musical instrument performance capture and analysis toolbox (MIPCAT)","authors":"A. Almeida, Weicong Li, Emery Schubert, John Smith, J. Wolfe","doi":"10.3389/frsip.2023.1089366","DOIUrl":"https://doi.org/10.3389/frsip.2023.1089366","url":null,"abstract":"Measuring fine-grained physical interaction between the human player and the musical instrument can significantly improve our understanding of music performance. This article presents a Musical Instrument Performance Capture and Analysis Toolbox (MIPCAT) that can be used to capture and to process the physical control variables used by a musician while performing music. This includes both a measurement apparatus with sensors and a software toolbox for analysis. Several of the components used here can also be applied in other musical contexts. The system is here applied to the clarinet, where the instrument sensors record blowing pressure, reed position, tongue contact, and sound pressures in the mouth, mouthpiece, and barrel. Radiated sound and multiple videos are also recorded to allow details of the embouchure and the instrument’s motion to be determined. The software toolbox can synchronise measurements from different devices, including video sources, extract time-variable descriptors, segment by notes and excerpts, and summarise descriptors per note, phrase, or excerpt. An example of its application shows how to compare performances from different musicians.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89084071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simultaneous segmentation of multiple structures in fundal images using multi-tasking deep neural networks 基于多任务深度神经网络的基础图像多结构同时分割

Frontiers in signal processing Pub Date : 2023-01-09 DOI: 10.3389/frsip.2022.936875

Sunil Kumar Vengalil, Bharath K. Krishnamurthy, N. Sinha

{"title":"Simultaneous segmentation of multiple structures in fundal images using multi-tasking deep neural networks","authors":"Sunil Kumar Vengalil, Bharath K. Krishnamurthy, N. Sinha","doi":"10.3389/frsip.2022.936875","DOIUrl":"https://doi.org/10.3389/frsip.2022.936875","url":null,"abstract":"Introduction: Fundal imaging is the most commonly used non-invasive technique for early detection of many retinal diseases such as diabetic retinopathy (DR). An initial step in automatic processing of fundal images for detecting diseases is to identify and segment the normal landmarks: the optic disc, blood vessels, and macula. In addition to these structures, other parameters such as exudates that help in pathological evaluations are also visible in fundal images. Segmenting features like blood vessels pose multiple challenges because of their fine-grained structure that must be captured at original resolution and the fact that they are spread across the entire retina with varying patterns and densities. Exudates appear as white patches of irregular shapes that occur at multiple locations, and they can be confused with the optic disc, if features like brightness or color are used for segmentation. Methods: Segmentation algorithms solely based on image processing involve multiple parameters and thresholds that need to be tuned. Another approach is to use machine learning models with inputs of hand-crafted features to segment the image. The challenge in this approach is to identify the correct features and then devise algorithms to extract these features. End-to-end deep neural networks take raw images with minimal preprocessing, such as resizing and normalization, as inputs, learn a set of images in the intermediate layers, and then perform the segmentation in the last layer. These networks tend to have longer training and prediction times because of the complex architecture which can involve millions of parameters. This also necessitates huge numbers of training images (2000‒10,000). For structures like blood vessels and exudates that are spread across the entire image, one approach used to increase the training data is to generate multiple patches from a single training image, thus increasing the total number of training samples. Patch-based time cannot be applied to structures like the optic disc and fovea that appear only once per image. Also the prediction time is larger because segmenting a full image involves segmenting multiple patches in the image. Results and Discussion: Most of the existing research has been focused on segmenting these structures independently to achieve high performance metrics. In this work, we propose a multi-tasking, deep learning architecture for segmenting the optic disc, blood vessels, macula, and exudates simultaneously. Both training and prediction are performed using the whole image. The objective was to improve the prediction results on blood vessels and exudates, which are relatively more challenging, while utilizing segmentation of the optic disc and the macula as auxiliary tasks. Our experimental results on images from publicly available datasets show that simultaneous segmentation of all these structures results in a significant improvement in performance. The proposed approach makes predictions of all f","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"42 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74451103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decimation keystone algorithm for forward-looking monopulse imaging on platforms with uniformly accelerated motion 匀速运动平台前视单脉冲成像的抽取梯形算法

Frontiers in signal processing Pub Date : 2023-01-05 DOI: 10.3389/frsip.2022.1074053

Ze Li, Yue Li

{"title":"Decimation keystone algorithm for forward-looking monopulse imaging on platforms with uniformly accelerated motion","authors":"Ze Li, Yue Li","doi":"10.3389/frsip.2022.1074053","DOIUrl":"https://doi.org/10.3389/frsip.2022.1074053","url":null,"abstract":"Forward-looking imaging for maneuvering platforms has garnered significant interest in many military and civilian fields. As the maneuvering trajectory in the scanning period can be simplified as the constant acceleration maneuver, monopulse imaging is applied to enhance the azimuthal resolution of the forward-looking image. However, the maneuver causes severe range migration and Doppler shift; this often results in range location error due to the space-varying Doppler shifts and the failure of angle estimation. We propose a decimation keystone algorithm based on the chirp-Z transform (CZT). First, the pulse repetition frequency (PRF) is decimated with an integer; thus, the azimuthal sampling sequence is decimated into many sub-sequences. Then, the linear range walk correction (LRWC) is performed on each sub-sequence using the keystone transform, significantly reducing the influence of the change of Doppler-ambiguity-number on range location. Further, the sub-sequences are regrouped as one sequence, and the range curvature due to the acceleration is compensated in the frequency domain. Finally, the varying Doppler centroid in each coherent processing interval (CPI) is analyzed and compensated for the sum-difference angular measurements. Simulation results demonstrate the effectiveness of the proposed algorithm for forward-looking imaging under constant acceleration maneuvers and the feasibility of range location error correction.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77190655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Subject-invariant feature learning for mTBI identification using LSTM-based variational autoencoder with adversarial regularization 基于lstm的对抗正则化变分自编码器mTBI识别的主体不变特征学习

Frontiers in signal processing Pub Date : 2022-11-30 DOI: 10.3389/frsip.2022.1019253

Shiva Salsabilian, L. Najafizadeh

{"title":"Subject-invariant feature learning for mTBI identification using LSTM-based variational autoencoder with adversarial regularization","authors":"Shiva Salsabilian, L. Najafizadeh","doi":"10.3389/frsip.2022.1019253","DOIUrl":"https://doi.org/10.3389/frsip.2022.1019253","url":null,"abstract":"Developing models for identifying mild traumatic brain injury (mTBI) has often been challenging due to large variations in data from subjects, resulting in difficulties for the mTBI-identification models to generalize to data from unseen subjects. To tackle this problem, we present a long short-term memory-based adversarial variational autoencoder (LSTM-AVAE) framework for subject-invariant mTBI feature extraction. In the proposed model, first, an LSTM variational autoencoder (LSTM-VAE) combines the representation learning ability of the variational autoencoder (VAE) with the temporal modeling characteristics of the LSTM to learn the latent space representations from neural activity. Then, to detach the subject’s individuality from neural feature representations, and make the model proper for cross-subject transfer learning, an adversary network is attached to the encoder in a discriminative setting. The model is trained using the 1 held-out approach. The trained encoder is then used to extract the representations from the held-out subject’s data. The extracted representations are then classified into normal and mTBI groups using different classifiers. The proposed model is evaluated on cortical recordings of Thy1-GCaMP6s transgenic mice obtained via widefield calcium imaging, prior to and after inducing injury. In cross-subject transfer learning experiment, the proposed LSTM-AVAE framework achieves classification accuracy results of 95.8% and 97.79%, without and with utilizing conditional VAE (cVAE), respectively, demonstrating that the proposed model is capable of learning invariant representations from mTBI data.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"86 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80586582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance analysis of code division multiplexing communication under evaporation duct environment 蒸发管道环境下码分复用通信性能分析

Frontiers in signal processing Pub Date : 2022-11-22 DOI: 10.3389/frsip.2022.1067055

Wenjing Liu, Xiqing Liu, Shi Yan, Ling Zhao, M. Peng

引用次数: 0