{"title":"An Unsupervised Sequence-to-Sequence Autoencoder Based Human Action Scoring Model","authors":"Hiteshi Jain, Gaurav Harit","doi":"10.1109/GlobalSIP45357.2019.8969424","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969424","url":null,"abstract":"Developing a model for the task of assessing quality of human action is a key research area in computer vision. The quality assessment task has been posed as a supervised regression problem, where models have been trained to predict score, given action representation features. However, human proficiency levels can widely vary and so do their scores. Providing all such performance variations and their respective scores is an expensive solution as it requires a domain expert to annotate many videos. The question arises - Can we exploit the variations of the performances from that of expert and map the variations to their respective scores? To this end, we introduce a novel sequence-to-sequence autoencoder-based scoring model which learns the representation from only expert performances and judges an unknown performance based on how well it can be regenerated from the learned model. We evaluated our model in predicting scores of a complex Sun- Salutation action sequence, and demonstrate that our model gives remarkable prediction accuracy compared to the baselines.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130925203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cramér-Rao Bound for Joint Angle and Delay Estimators by Partial Relaxation","authors":"Ahmad Bazzi","doi":"10.1109/GlobalSIP45357.2019.8969286","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969286","url":null,"abstract":"Novel Fisher-Information Matrix (FIM) and Cramér-Rao Bound (CRB) expressions for the problem of the \"partially relaxed\" Joint Angle and Delay Estimation (JADE) are derived and analyzed in this paper. In particular, exact closed form expressions of the CRB on the Angles and Times of Arrival of multiple sources are presented. Furthermore, interesting asymptotic and desirable properties are demonstrated, such as high SNR behaviour and lower bound expressions on the CRBs of Angles and Times of Arrival of multiple sources. Computer simulations are also given to visualize CRB behaviour in regimes of interest.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134397869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bring Light to the Night: Classifying Thermal Image Via Convolutional Neural Network Based on Visible Domain Transformation","authors":"G. Lu","doi":"10.1109/GlobalSIP45357.2019.8969076","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969076","url":null,"abstract":"Most existing vision systems target at processing images captured during the day time. However, it is also essential to enable cameras to see the scenes during the night, such as in outdoor places where no light exists and power outage in indoor environments. We capture thermal images to observe objects in the dark environment. Based on the captured thermal images, we develop a convolutional neural network to classify the images. As thermal images require to invest a substantial amount of time to create clear images, we also rely on color images to enrich the training samples and apply transfer learning to refine the CNN classification models. The visible source domain network is learned together with a decoding network to enforce the source domain learning outcome resembling the target thermal domain properties.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132034257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Ising Dropout with Node Grouping for Training and Compression of Deep Neural Networks","authors":"H. Salehinejad, Zijian Wang, S. Valaee","doi":"10.1109/GlobalSIP45357.2019.8969121","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969121","url":null,"abstract":"Dropout is a popular regularization method to reduce over-fitting while training deep neural networks and compress the inference model. In this paper, we propose Ising dropout with node grouping, which represents a deep multilayer perceptron (MLP) neural network as a graph with fixed grouped nodes and uses the Ising energy to drop group of nodes. This method is an extension to our proposed Ising dropout method, which had the limit of solving the Ising energy model for MLPs with limited graph order. The proposed fixed grouping method enables applying drop-out to deep MLPs with any order. Performance of this method is evaluated on handwritten digits (MNIST), Fashion-MNIST, Free Spoken Digit Dataset (FSDD), and Street View House Numbers (SVHN) datasets and compared with the standard dropout and standout methods. Preliminary results show that the proposed approach can keep the classification performance competitive to the original network while eliminating optimization of unnecessary network parameters in each training cycle. This method can compress the inference model significantly while maintaining the classification performance.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"292 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132553080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recovery of Event Related Potential Signals using Compressive Sensing and Kronecker Technique","authors":"S. A. Khoshnevis, S. Ghorshi","doi":"10.1109/GlobalSIP45357.2019.8969504","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969504","url":null,"abstract":"Brain-computer interfaces (BCIs) are devices that are developed to enable the brain to communicate with a machine directly. These devices usually make use of event-related potential (ERP) component of electroencephalography (EEG) signals. BCIs have several applications, but perhaps the most important one is to communicate with the advance neuromuscular patients. P300 Speller is a method that was developed by making use of BCIs and ERPs to make it possible to communicate with a computer through EEG recordings. The sensitive nature of these signals makes it essential to make sure they have a high recovery rate once they have been compressed. Compressive sensing (CS) is a compression method which takes advantage of the potential sparsity of the signals and aims to reconstruct a signal from a smaller number of measurements that is specified by the Nyquist theorem. CS has been studied in various signal processing areas. Because of the low power consumption and the elapsed time for generating CS measurements, CS became as one of the most efficient compression methods. In this work, we study the applicability of CS and its recovery quality for ERP signals. We run the experiments based on random and deterministic sensing matrices and two different sparsifying bases. The simulation results show that the ERP signal is very suitable for CS compression up to 75% compression ratio (CR). For the recovery phase, we investigate the effects of the recently developed preprocessing approach called Kronecker- based technique. By using Kronecker-based technique in recovery, we could recover the original signal with high accuracy up to 30 dB.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114151066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Admm for Gridless Dod and Doa Estimation in Bistatic Mimo Radar Based on Decoupled Atomic Norm Minimization with One Snapshot","authors":"Wen-Gen Tang, Hong Jiang, Qi Zhang","doi":"10.1109/GlobalSIP45357.2019.8969436","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969436","url":null,"abstract":"In this paper, the issue of gridless compressed sensing (CS)-based direction-of-departure (DOD) and direction-of- arrival (DOA) estimation for bistatic multiple-input multiple- output (MIMO) radar is investigated with one snapshot, and an alternating direction method of multipliers (ADMM) for 2D parameter estimation using decoupled atomic norm minimization (DANM) is proposed. In the proposed algorithm, the decoupled atom set and atomic norm for DOD and DOA estimation are defined, then the atomic norm is transformed into a semi-definite programming (SDP) minimization problem. To decrease the computational complexity of solving SDP using interior point method based SDPT3 solver in CVX toolbox, the DANM with ADMM is deduced, which can greatly decrease the running time, especially in the presence of large scale arrays. Finally, the DOD and DOA are obtained via shift-invariance parameter estimation. It overcomes the grid-mismatch effect of grid-partition of the conventional CS methods, and outperforms the traditional subspace-based methods. Numerical simulations are presented to demonstrate the estimation performance of the proposed algorithm.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"378 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131871962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-Mode Generalized Space-Time Index Modulation: A High-Rate Index Modulation Scheme for MIMO-ISI Channels","authors":"L. N. Theagarajan","doi":"10.1109/GlobalSIP45357.2019.8969398","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969398","url":null,"abstract":"In this paper, a new index modulation technique, referred to as the multi-mode generalized space-time index modulation (MM-GSTIM) is presented. MM-GSTIM is a high-rate index modulation scheme for multiantenna inter-symbol interference (MIMO-ISI) channels. The proposed MM-GSTIM is a single carrier scheme with zero padding (ZP). MM-GSTIM performs ‘mode-indexing’ to efficiently increase the spectral efficiency. In a given MM-GSTIM frame, disjoint subsets of space-time slots are used to transmit modulation symbols from disjoint modulation alphabets (referred to as the modes). The choice of which space-time slots are used for transmission of a particular mode in an MM-GSTIM matrix conveys information bits. Thus, MM-GSTIM is a generalization of the well-known index modulation schemes such as spatial modulation (SM), generalized spatial modulation (GSM), and space-time index modulation. Here, an analytical characterization of the diversity order of the proposed MM-GSTIM scheme and the conditions required to maximize the transmission rate of multi-mode index modulation schemes are derived. Through simulations, it is shown that the proposed MM-GSTIM scheme can achieve better performance compared to other modulation schemes in MIMO-ISI channels.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131955163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Ensemble Learning: A Communications Receiver Over Wireless Fading Channels","authors":"Amer Al-Baidhani, H. Fan","doi":"10.1109/GlobalSIP45357.2019.8969302","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969302","url":null,"abstract":"Deep learning algorithms have proven themselves powerful in different applications because of their ability of generalization. In this paper, we introduce a deep learning wireless communications receiver network that enables reliable data transmission over wireless multipath Rayleigh fading channels. Significant improvements in terms of Bit Error Rate (BER) are achieved in simulation for various channel models compared to the traditional receiver. We also present a training procedure that leads to better generalization of the network by dividing it into jointly trained subnetworks as a Deep Ensemble learner while leveraging the regularization by combining information from different subnetworks.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134407437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distributed Sparse Activity Detection in Cell-Free Massive MIMO Systems","authors":"Mangqing Guo, M. C. Gursoy","doi":"10.1109/GlobalSIP45357.2019.8969500","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969500","url":null,"abstract":"Distributed sparse activity detection in cell-free massive multiple-input multiple-output (MIMO) systems is considered in this paper. At the beginning of each channel coherence interval, all the active users send pilots to the access points (APs). Then, each AP makes its own decision on the activity of all the users based on the approximate message passing (AMP) iterative procedure. Following this, the optimal fusion rule is used at the fusion center to make the final decisions on the activity of all the users based on the individual decisions and the corresponding reliability obtained at all the APs. The performance levels of this distributed sparse activity detection method are analyzed with Monte Carlo simulations.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126962893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anne-Carole Honfoga, T. Nguyen, M. Dossou, V. Moeyaert
{"title":"Application of FBMC to DVB-T2: a Comparison vs Classical OFDM Transmissions","authors":"Anne-Carole Honfoga, T. Nguyen, M. Dossou, V. Moeyaert","doi":"10.1109/GlobalSIP45357.2019.8969550","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969550","url":null,"abstract":"In this paper, filter bank based multicarrier/Offset Quadrature Amplitude Modulation (FBMC/OQAM) is compared with a Cyclic Prefix based OFDM (CP-OFDM). Indeed, due to its high level of compatibility with OFDM systems, FBMC system is proposed as a new technique for European Digital Video Broadcasting-Terrestrial Second generation (DVB-T2) systems. To meet the performance requirement of the DVB-T2, the approach is based on a technique named nearly perfect reconstruction (NPR) filter bank system. The main characteristic of this approach is the use of Frequency Sampling (FS) Sub-Channel Equalizer, based on Complex Finite Impulse Response (CFIR) filter, to deal with the high frequency selective impairments. Simulation results indicate that, compared to FBMC, CP-OFDM requires about 1dB extra Signal to Noise Ratio (SNR) value at a bit error rate of 10-3 for high order modulation formats and at large number of sub-carriers. As FBMC system discards the use of the cyclic prefix, commonly used in OFDM, the data rate can be increased of up to 14.84% or even 25% maintaining the same bandwidth utilization.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124814326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}