2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)最新文献

筛选
英文 中文
Speech enhancement by iterating forward pass through U-net 通过U-net迭代前向传递的语音增强
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241307
Tomasz Grzywalski, S. Drgas
{"title":"Speech enhancement by iterating forward pass through U-net","authors":"Tomasz Grzywalski, S. Drgas","doi":"10.23919/spa50552.2020.9241307","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241307","url":null,"abstract":"In recent years speech enhancement has shown great progress that was driven mostly by using bigger and more sophisticated neural networks. In this work we investigate the possibility to use state-of-the-art speech enhancement neural network and modify it in such a way that will allow it to process the noisy signal multiple times. By doing so we expect, that with each iteration the enhancement will improve. Experiments conducted using the WSJ0, Noisex-92 and DCASE datasets show, that U-net with gated dilated convolutions is able to achieve better SI-SDR, STOI and PESQ after processing the noisy signal two times, with the improvement being consistent across all SNRs and tested noise types. This is achieved without any additional trainable parameters and no additional memory requirements compared to the baseline model.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129359455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Active tone elimination algorithm using FFT with interpolation and zero-padding 主动音调消除算法使用FFT与插值和零填充
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241255
Michal Luczynski, A. Dobrucki, S. Brachmański
{"title":"Active tone elimination algorithm using FFT with interpolation and zero-padding","authors":"Michal Luczynski, A. Dobrucki, S. Brachmański","doi":"10.23919/spa50552.2020.9241255","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241255","url":null,"abstract":"This article presents a method for eliminating tonal components from acoustic signals. Tonal components are quasi-periodic signals whose amplitude, frequency and phase can change slowly with either a certain tendency or randomly. Active elimination of a single tonal component consists in adding a synthesized component with opposite polarity. The main challenge is to find a compromise between the frequency resolution and the delay resulting from the sample acquisition and the operation of the reset signal generation algorithm.The elimination is realized in three stages: detection of tonal components parameters, synthesis of the cancelling signal, addition of the cancelling signal to the input signal. Detection is carried out using the FFT transform, the resolution of which is increased by means of time windows, spectrum interpolation and zero-padding. Various methods of synthesis of the cancelling signal were also checked. An analysis of detection errors was performed compared to the standard FFT transform. An elimination simulation was also done to analyze the effectiveness of the reduction. The result of the work is the evaluation of the method for the application in the elimination of tonal components of acoustic signals as well as in systems of active reduction of narrowband noise.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127807721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CCTV based system for detection of anti-virus masks 基于CCTV的防病毒口罩检测系统
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241303
K. Podbucki, J. Suder, T. Marciniak, A. Dabrowski
{"title":"CCTV based system for detection of anti-virus masks","authors":"K. Podbucki, J. Suder, T. Marciniak, A. Dabrowski","doi":"10.23919/spa50552.2020.9241303","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241303","url":null,"abstract":"This paper presents the use of neural networks to detect people with anti-virus masks. The algorithm allows to determine if a person has a correctly worn mask (covered nose and mouth). The use of neural networks has been compared with typical Haar cascade frontal face solutions available in the OpenCV library. The proposed solution has been checked for efficiency and precision, as well as for the minimum resolution requirements of the resulting facial image. The software works on still images as well as on video sequences from computer webcams and CCTV cameras.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115891309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Algorithm for Human Fall Detection Based on Acceleration Measurement 基于加速度测量的人体跌倒检测算法
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241243
Barbara Wilk, M. Augustyn, G. Wilk
{"title":"Algorithm for Human Fall Detection Based on Acceleration Measurement","authors":"Barbara Wilk, M. Augustyn, G. Wilk","doi":"10.23919/spa50552.2020.9241243","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241243","url":null,"abstract":"According to the World Health Organization, a fall is defined as an unexpected event in which participant comes to rest on the ground, floor, or lower level. Falls are one of the most serious life-threatening events. Automatic detection of a fall can reduce the time of an arrival of medical attention and consequences of prolonged lying after a fall.In this paper, a novel algorithm is presented for a human fall detection based on acceleration measurement using the 3axis sensor placed in the pocket. This algorithm was tested on two data sets with simulated falls and various daily activities. The obtained results show that the proposed algorithm allows us to achieve both sensitivity of 93% and specificity of 94.5% at the same time. These are values much higher than currently reported in the literature.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122040820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speaker verification with TIMIT corpus - some remarks on classical methods 用TIMIT语料库验证说话人——对经典方法的几点评述
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241298
A. Dustor
{"title":"Speaker verification with TIMIT corpus - some remarks on classical methods","authors":"A. Dustor","doi":"10.23919/spa50552.2020.9241298","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241298","url":null,"abstract":"The aim of this paper is to present some research on speaker verification system based on Gaussian Mixture Model-Universal Background Model (GMM-UBM) approach. All tests were done for the TIMIT corpus. Performance for the standard Mel-Frequency Cepstral Coefficients (MFCC) and dynamic delta features is shown. Influence of feature dimensionality and model complexity on Equal Error Rate (EER) is presented. Additionally, an impact of Voice Activity Detection (VAD) and normalization techniques like Cepstral Mean and Variance Normalization (CMVN) and RelAtive SpecTrA (RASTA) filtering is covered. Each combination of factors was examined. It is shown that careful selection of traditional techniques may lead to very satisfying results when it comes to achieved EER values.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123417875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The SMART4ALL toolbox for boosting technology and business development in South, Eastern and Central Europe 促进南欧、东欧和中欧技术和业务发展的SMART4ALL工具箱
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241304
G. Keramidas
{"title":"The SMART4ALL toolbox for boosting technology and business development in South, Eastern and Central Europe","authors":"G. Keramidas","doi":"10.23919/spa50552.2020.9241304","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241304","url":null,"abstract":"SMART4ALL is a four-year Innovation Action project funded under Horizon 2020 framework under call DT-ICT-01-2019: Smart Anything Everywhere – Area 2: Customized low energy computing powering CPS and the IoT. The target of the project is to establish a unique Pan European network of Digital Innovation Hubs that will not only support innovation and reveal business opportunities across South, Eastern and Central Europe, but will build capacity via the development of self-sustained, cross-border pathfinder application experiments. The project will provide a total funding of 2,2 Mio Euros via 9 open calls and it will support 88 cross-border pathfinder application experiments from European consortia. Each experiment will get funding up to €80,000 and it will be supported with novel coaching services from world lead experts in ethics, technology, funding and business development. Apart from this, SMART4ALL sets forward a unique concept called Marketplace-as-a-Service (MaaS). Maas is a one-stop-smart-shop for startups, SMEs and slightly bigger companies. This presentation will introduce SMART4ALL and it will concentrate on the “Prepare for Growth” services offered by Maas.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127351073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analytic Design of Uniform Circular Filter Banks 均匀圆形滤波器组的解析设计
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241281
R. Matei
{"title":"Analytic Design of Uniform Circular Filter Banks","authors":"R. Matei","doi":"10.23919/spa50552.2020.9241281","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241281","url":null,"abstract":"This work proposes an analytic design method for a particular class of 2D filter banks, with circular shape in the frequency plane and a maximally-flat characteristic. The circular filters comprising the filter bank result starting from a zerophase prototype by applying specific frequency mappings; they are efficient, of relatively low order and good selectivity. Their transfer functions can be expressed in a closed form and depend explicitly on a parameter given by specified number of filters, e.g. in the case of an uniform circular filter bank. A design example is provided for imposed specification and the designed filter bank is tested in filtering a given texture image.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126384195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Artificial intelligence application to improve the performance of distance learning servers during the coronavirus pandemic threat period 应用人工智能提高新冠病毒大流行威胁期间远程学习服务器的性能
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241301
P. Kłosowski
{"title":"Artificial intelligence application to improve the performance of distance learning servers during the coronavirus pandemic threat period","authors":"P. Kłosowski","doi":"10.23919/spa50552.2020.9241301","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241301","url":null,"abstract":"As of early 2020, almost all countries are fighting the coronavirus pandemic by implementing the rigorous restrictions recommended by the World Health Organisation to help reduce the number of infections as much as possible. The restrictions also apply to members of the academic community. The universities have suspended all teaching activities - except online. Most universities have introduced solutions for distance learning. However, organisation of distance education requires appropriate technological infrastructure. Providing the right IT infrastructure is not an easy challenge, because network devices and network servers note record-breaking peak loads during this time. It seems that there are potentially many possibilities of using artificial intelligence to improve the performance of distance learning platforms, information systems and network infrastructure in this difficult and demanding period. Examples of such use of artificial intelligence applications are presented in this article. The paper shows that the use of artificial intelligence to improve the operation of distance learning servers is potentially possible in many areas. It is also worth noting that the application of artificial neural networks and LSTM neural networks for this purpose seems very promising. The presentation of sample experiments and obtained results in this article seems to confirm this thesis.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115236063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Robust Beamforming Method Based on Double-layer Reconstruction of Covariance Matrix 基于协方差矩阵双层重构的鲁棒波束形成方法
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241257
Cao Silei, Li Tianyu, Wang Yao
{"title":"Robust Beamforming Method Based on Double-layer Reconstruction of Covariance Matrix","authors":"Cao Silei, Li Tianyu, Wang Yao","doi":"10.23919/spa50552.2020.9241257","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241257","url":null,"abstract":"Focusing on the problem that the performance of traditional adaptive beamformer declines sharply when the covariance matrix contains the target signal component and the mismatch occurs in target steering vector, a robust beamforming algorithm based on double-layer reconstruction of interference-plus-noise covariance matrix is proposed in this paper. Firstly, the sparse reconstruction method is used to estimate the interference-plus-noise covariance matrix. Then the interference-plus-noise covariance matrix is optimized by estimating the interference steering vector and interference power. Secondly, based on subspace theory, an optimization model of steering vector is established, and the convex optimization model is solved by iterative method to obtain the optimal weight vector. The simulation results show that the proposed algorithm can improve the robustness of the beamformer in the case of target vector constraint error and array error. Also, the algorithm performs well in low snapshot number condition, and the output performance is better than current methods.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115359086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Determination of Low-Level Audio Descriptors of a Musical Instrument Sound Using Neural Network 用神经网络确定乐器声音的低级音频描述符
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI: 10.23919/spa50552.2020.9241264
Maciej Blaszke, Damian Koszewski
{"title":"Determination of Low-Level Audio Descriptors of a Musical Instrument Sound Using Neural Network","authors":"Maciej Blaszke, Damian Koszewski","doi":"10.23919/spa50552.2020.9241264","DOIUrl":"https://doi.org/10.23919/spa50552.2020.9241264","url":null,"abstract":"Audio files and the audio channel of video files can be described with temporal, spectral, cepstral, and perceptual audio descriptors. The so-called low-level descriptors are closely related to the signal characteristics. One can discern at least three levels of extraction granularity from the signal: at any point in the signal, in small arbitrary regions (i.e., frames) and longer pre-segmented regions. Even though there are tools (e.g., MIRToolbox, Python/libROSA) available for computing these descriptors, the resulting feature vector is always redundant as it contains many high-correlated descriptors and there are some limitations connected to the performance of these tools. That is why, in this study, a method for obtaining those descriptors using Artificial Neural Network (ANN) with a deep structure (i.e., DNN) is proposed. In such a scheme, the raw audio signal representing a given musical instrument is fed to the DNN input. Such a network can be used as a standalone module or as a pre-trained part of the bigger architecture. The results of deep network performance in the context of MPEG-7 descriptor derivation are shown along with the loss function convergence and behavior.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"34 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123461214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信