2022 IEEE International Conference on Signal Processing and Communications (SPCOM)最新文献

筛选
英文 中文
A Hierarchical Approach for Decoding Human Reach-and-Grasp Activities based on EEG Signals 一种基于脑电图信号的人类伸手抓握活动分层解码方法
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840794
Bhagyasree Kanuparthi, A. Turlapaty
{"title":"A Hierarchical Approach for Decoding Human Reach-and-Grasp Activities based on EEG Signals","authors":"Bhagyasree Kanuparthi, A. Turlapaty","doi":"10.1109/SPCOM55316.2022.9840794","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840794","url":null,"abstract":"Physically disabled patients such as the paralyzed, amputees and stroke patients find it difficult to perform daily activities on their own. A Brain-Computer Interface (BCI) using Electroencephalography (EEG) signals is an option for the rehabilitation of these patients. The BCI function can be enhanced by decoding the movements from a limb through an intuitive control of the prosthetic arm. However, decoding them with the traditional classifiers is a challenging task. In this paper, a two-stage hierarchical framework is proposed for the decoding of reach-and-grasp actions. In stage-l, the action signals are separated from rest segments based on power spectral density features and a fine k-nearest neighbor classifier (FKNN). In stage-2, the signals identified as action are further classified into palmar and lateral type reach-and-grasp actions using the mean absolute value features with the FKNN classifier. In comparison with the existing classifiers, the proposed method has a superior performance of 85.38% test accuracy.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125976282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Glottal instants extraction from speech signal using Deep Feature Loss 基于深度特征损失的语音信号声门瞬时信号提取
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840808
Supritha M. Shetty, Suraj Durgesht, K. Deepak
{"title":"Glottal instants extraction from speech signal using Deep Feature Loss","authors":"Supritha M. Shetty, Suraj Durgesht, K. Deepak","doi":"10.1109/SPCOM55316.2022.9840808","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840808","url":null,"abstract":"Electroglottograph (EGG) is a device used to measure the conductance between the vocal folds. The analysis of EGG signal has many applications in the literature such as speech-to-text synthesis, voice disorder analysis, emotion recognition, speaker verification, etc. Therefore, the EGG device is essential to record the vocal folds activity. Alternatively, a new method is proposed in this work to synthesize the EGG waveform from speech signal using a context aggregation convolutional neural network. The synthesis network is trained by accounting the deep feature losses obtained by comparing it with another network called the EGG classification network. The synthesized EGG signal needs to be characterized. During the voiced speech production, the instants at which the vocal folds attain complete closure are called glottal closure instants (GCIs). Likewise, the opening instants are called glottal opening instants (GOIs). Such instants are reliably measured using the EGG signal. The performance of the proposed method is compared with other state-of-the-art techniques. The CMU-Arctic database has a parallel corpus of speech and EGG signal recorded simultaneously. This database is used for training the synthesis network and for comparison purposes. It is found that the performance of extracting glottal instants from synthesized EGG signals is comparable to other methods.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125330721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computer-aided Cataract Grading Under Adversarial Environment 敌对环境下的计算机辅助白内障分级
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840821
T. Pratap, Priyanka Kokil
{"title":"Computer-aided Cataract Grading Under Adversarial Environment","authors":"T. Pratap, Priyanka Kokil","doi":"10.1109/SPCOM55316.2022.9840821","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840821","url":null,"abstract":"Cataract is the most common cause of blindness in the world. Early detection and treatment can lower the risk of cataract progression. The diagnostic performance of existing computer-aided cataract grading (CACG) methods often deteriorates due to the sophisticated image capture technology. The common retinal fundus image aberrations such as noise and blur are unavoidable in practice. In this paper, a CACG method is proposed to achieve robust cataract grading under adversarial conditions such as noise and blur. The presented CACG method is designed using three deep neural network variants. Each variant is fine-tuned individually using good, noisy, and blur retinal fundus images to achieve optimum performance. Further, the input image quality detection module is incorporated in the proposed CACG method to detect input image distortion and then pivots the input image to the desired deep neural network variant. Gaussian noise and blur models are used to evaluate the effectiveness of the suggested CACG method. The proposed CACG approach exhibits superior performance to existing methods under adversarial conditions.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129220025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SPCOM 2022 Cover Page SPCOM 2022封面
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/spcom55316.2022.9840800
{"title":"SPCOM 2022 Cover Page","authors":"","doi":"10.1109/spcom55316.2022.9840800","DOIUrl":"https://doi.org/10.1109/spcom55316.2022.9840800","url":null,"abstract":"","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122429292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal Surgical Gesture Segmentation and Classification in Multi-gesture Robotic Surgery using Fine-tuned features and Calibrated MS-TCN 基于微调特征和校准MS-TCN的多手势机器人手术时间外科手势分割与分类
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840779
Snigdha Agarwal, Chakka Sai Pradeep, N. Sinha
{"title":"Temporal Surgical Gesture Segmentation and Classification in Multi-gesture Robotic Surgery using Fine-tuned features and Calibrated MS-TCN","authors":"Snigdha Agarwal, Chakka Sai Pradeep, N. Sinha","doi":"10.1109/SPCOM55316.2022.9840779","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840779","url":null,"abstract":"Temporal Gesture Segmentation is an active research problem for many applications such as surgical skill assessment, surgery training, robotic training. In this paper, we propose a novel method for Gesture Segmentation on untrimmed surgical videos of the challenging JIGSAWS dataset by using a two-step methodology. We train and evaluate our method on 39 videos of the Suturing task which has 10 gestures. The length of gestures ranges from 1 second to 75 seconds and full video length varies from 1 minute to 5 minutes. In step one, we extract encoded frame-wise spatio-temporal features on full temporal resolution of the untrimmed videos. In step two, we use these extracted features to identify gesture segments for temporal segmentation and classification. To extract high-quality features from the surgical videos, we also pre-train gesture classification models using transfer learning on the JIGSAWS dataset using two state-of-the-art pretrained backbone architectures. For segmentation, we propose an improved calibrated MS-TCN (CMS-TCN) by introducing a smoothed focal loss as loss function which helps in regularizing our TCN to avoid making over-confident decisions. We achieve a frame-wise accuracy of 89.8% and an Edit Distance score of 91.5%, an improvement of 2.2% from previous works. We also propose a novel evaluation metric that normalizes the effect of correctly classifying the frames of larger segments versus smaller segments in a single score.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132781519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Morse Wavelet Features for Pop Noise Detection 莫尔斯小波特征的流行噪声检测
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840840
Priyanka Gupta, Piyushkumar K. Chodingala, H. Patil
{"title":"Morse Wavelet Features for Pop Noise Detection","authors":"Priyanka Gupta, Piyushkumar K. Chodingala, H. Patil","doi":"10.1109/SPCOM55316.2022.9840840","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840840","url":null,"abstract":"Spoofed Speech Detection (SSD) problem has been an important problem, especially for Automatic Speaker Verification (ASV) systems. However, the techniques used for designing countermeasure systems for SSD task are attack-specific, and therefore the solutions are far from a generalized SSD system, which can detect any type of spoofed speech. On the other hand, Voice Liveness Detection (VLD) systems rely on the characteristics of live speech (i.e., pop noise) to detect whether an utterance is live or not. Given that the attacker has the freedom to mount any type of attack, VLD systems play a crucial role in defending against spoofing attacks, irrespective of the type of spoof used by the attacker. To that effect, we propose Generalized Morse Wavelet (GMW)-based features for VLD, with Convolutional Neural Network (CNN) as the classifier at the back-end. In this context, we use pop noise as a discriminative acoustic cue to detect live speech. Pop noise is present in live speech signals at low frequencies (typically $leq 40$ Hz), caused by human breath reaching at the closely-placed microphone. We show that for $gamma =3$, the Morse wavelet has the highest concentration of information denoted by the least area of the Heisenberg’s box. Hence, we take $gamma =3$ for our experiments on Morse wavelets. We compare the performance of our system with Short-Time Fourier Transform (STFT)-Support Vector Machine (SVM)-based original baseline, and other existing systems, such as Constant Q-Transform (CQT)-SVM, STFT-CNN, and bump wavelet-CNN. With overall accuracy of 86.90% on evaluation set, our proposed system significantly outperforms STFT-SVM-based original baseline, CQT-SVM, STFT-CNN, and bump wavelet-CNN by an absolute margin of 18.97 %, 8. 02%, 15. 09%, and 12. 21%, respectively. Finally, we have also analyzed the effect of various phoneme types on VLD system performance.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124847403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integrated Hierarchical and Flat Classifiers for Food Image Classification using Epistemic Uncertainty 基于认知不确定性的食品图像分层和平面分类器集成
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840761
Vishwesh Pillai, Pranav Mehar, M. Das, Deep Gupta, P. Radeva
{"title":"Integrated Hierarchical and Flat Classifiers for Food Image Classification using Epistemic Uncertainty","authors":"Vishwesh Pillai, Pranav Mehar, M. Das, Deep Gupta, P. Radeva","doi":"10.1109/SPCOM55316.2022.9840761","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840761","url":null,"abstract":"The problem of food image recognition is an essential one in today’s context because health conditions such as diabetes, obesity, and heart disease require constant monitoring of a person’s diet. To automate this process, several models are available to recognize food images. Due to a considerable number of unique food dishes and various cuisines, a traditional flat classifier ceases to perform well. To address this issue, prediction schemes consisting of both flat and hierarchical classifiers, with the analysis of epistemic uncertainty are used to switch between the classifiers. However, the accuracy of the predictions made using epistemic uncertainty data remains considerably low. Therefore, this paper presents a prediction scheme using three different threshold criteria that helps to increase the accuracy of epistemic uncertainty predictions. The performance of the proposed method is demonstrated using several experiments performed on the MAFood-121 dataset. The experimental results validate the proposal performance and show that the proposed threshold criteria help to increase the overall accuracy of the predictions by correctly classifying the uncertainty distribution of the samples.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129322953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
C-Band Iris Coupled Cavity Bandpass Filter c波段虹膜耦合腔带通滤波器
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840777
Shashank Soi, Sudheer Kumar Singh, Rajendra Singh, Ashok Kumar
{"title":"C-Band Iris Coupled Cavity Bandpass Filter","authors":"Shashank Soi, Sudheer Kumar Singh, Rajendra Singh, Ashok Kumar","doi":"10.1109/SPCOM55316.2022.9840777","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840777","url":null,"abstract":"This paper presents the design of compact, tunable, high rejection 6th-order C-Band Iris Coupled Cavity Bandpass Filter. The design approach followed includes the use of Chebychev low pass filter prototype elements to calculate normalized capacitance per unit length between resonators & ground and also between adjacent resonators. With the help of coupling and tuning screws, the bandwidth and center frequency of the filter can be tuned for desired performance. Coaxial capacitance formula is used to compute the diameter of the screws. CST tool is used to simulate & optimize the theoretically calculate physical dimensions to further improve the filter performance and obtain better tolerance sensitivity. Finally, a 6th order prototype is fabricated and tuned to obtain the desired performance. The cavity design & resonator calculations have been carried out in such a manner that the same hardware can be tuned to both the frequency bands i.e., 4.4-4.6 GHz (Band I) and 4.8-5.0 GHz (Band II) to meet the desired specifications. A prototype is fabricated and experimental validation is presented.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"19 34","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114044168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Binary Intelligent Reflecting Surfaces Assisted OFDM Systems 二元智能反射面辅助OFDM系统
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840791
L. Yashvanth, C. Murthy, B. Deepak
{"title":"Binary Intelligent Reflecting Surfaces Assisted OFDM Systems","authors":"L. Yashvanth, C. Murthy, B. Deepak","doi":"10.1109/SPCOM55316.2022.9840791","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840791","url":null,"abstract":"Intelligent reflecting surfaces (IRSs) enhance the performance of wireless systems by reflecting the incoming signals towards a desired user, especially in the mmWave bands. However, this requires optimizing the discrete reflection coefficients of the IRS elements, which crucially depends on the availability of accurate channel state information (CSI) of all links in the system. Further, in wideband systems employing orthogonal frequency division multiplexing (OFDM), a given IRS configuration cannot be simultaneously optimal for all the subcarriers, and hence the phase optimization is not straightforward. In this paper, we propose a novel IRS phase configuration scheme in OFDM systems by first leveraging the sparsity of the channel in the angular domain to estimate the CSI using simultaneous orthogonal matching pursuit (SOMP) algorithm, and then devising a novel and computationally efficient binary IRS phase configuration algorithm using majorization-minimization (MM). Simulation results illustrate the efficacy of the approach in comparison with the state-of-the-art.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114606296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A unified neural MRA architecture combining wavelet CNN and wavelet pooling for texture classification 结合小波CNN和小波池的纹理分类统一神经MRA架构
2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI: 10.1109/SPCOM55316.2022.9840760
K. K. Tarafdar, Q. Saifee, V. Gadre
{"title":"A unified neural MRA architecture combining wavelet CNN and wavelet pooling for texture classification","authors":"K. K. Tarafdar, Q. Saifee, V. Gadre","doi":"10.1109/SPCOM55316.2022.9840760","DOIUrl":"https://doi.org/10.1109/SPCOM55316.2022.9840760","url":null,"abstract":"This paper introduces a novel unified neural Multi-Resolution Analysis (MRA) architecture that uses Discrete Wavelet Transform (DWT) integrated Convolutional Neural Network (CNN) along with DWT pooling. As convolution with pooling operation in CNN has equivalence with filtering and downsampling operation in a DWT filter bank, both are unified to form an end-to-end deep learning wavelet CNN model. The DWT pooling mechanism is also used to further enhance the MRA capability of this wavelet CNN. Using the first two wavelets of the Daubechies family, we present here a comprehensive set of improved texture classification results with several updates in the model architecture. These updates in the CNN model architecture apply to any node generally associated with the time-frequency analysis of the input signal.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123038488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信