Jinfeng Kang, B. Gao, Peng Huang, Lifeng Liu, Xiaoyan Liu, H. Y. Yu, Shimeng Yu, H. Wong
{"title":"RRAM based synaptic devices for neuromorphic visual systems","authors":"Jinfeng Kang, B. Gao, Peng Huang, Lifeng Liu, Xiaoyan Liu, H. Y. Yu, Shimeng Yu, H. Wong","doi":"10.1109/ICDSP.2015.7252074","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7252074","url":null,"abstract":"Neuromorphic computing is an attractive computation paradigm with the features of massive parallelism, adaptivity to the complex input information, and tolerance to errors. As one of the most crucial components in a neuromorphic system, the electronic synapse requires high device integration density and low-energy consumption. Oxide-based resistive switching devices (RRAM) have emerged as the leading candidate to realize the synapse functions due to the extra-low energy loss per spike. This work will address the design and optimization of oxide-based RRAM synaptic devices and the impacts of the synaptic devices parameters on the performance of neuromorphic visual system. Possible solutions are also provided to suppress the intrinsic variation of the oxide-RRAM based synaptic devices to achieve high recognition accuracy and efficiency of neuromorphic visual systems.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129093991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wideband DOA estimation by joint sparse representation under Bayesian learning framework","authors":"Lu Wang, Lifan Zhao, G. Bi, C. Wan","doi":"10.1109/ICDSP.2015.7251894","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251894","url":null,"abstract":"Wideband direction of arrival (DOA) estimation is a practical problem frequently occurring in sonar application. Compared to the entire angular domain, targets only occupy a few directions and the received signals are considered to be sparse in the angular domain. It is further noted that signals in different spectrum bands show a strong joint sparsity due to the fact that targets from different directions share the spectrum. This paper exploits the joint sparsity of the signals and reformulates the DOA estimation problem under the Bayesian learning framework. The resulted method is a data-driven learning process and does not need the tedious parameter tuning. Comparing to the conventional delay-sum beamformer, the proposed method has the advantages of reduced number of sensors, reduced spatial aliasing and increased resolution. The improved performance is validated by real sonar data experiments.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"56 83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123505360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chaogeng Huang, Hong Xu, Yanhong Chen, Hongbo Song, Jing Zhang
{"title":"A new delta operator based IIR lattice filter structure","authors":"Chaogeng Huang, Hong Xu, Yanhong Chen, Hongbo Song, Jing Zhang","doi":"10.1109/ICDSP.2015.7251327","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251327","url":null,"abstract":"Delta operator-based implementation over the conventional shift operator approach has been applied to the direct form based structure, and the satisfactory performance including low roundoff noise gain and low coefficient sensitivity, has been achieved. However, the approach for delta operator has not been applied into the lattice filter structure. In this paper, a novel lattice filter structure is derived based on the delta operator. The expression of roundoff noise gain for the proposed structure is derived. For an Nth order digital filter, the proposed structure requires 5N + 1 multipliers, which yields the same implementation complexity as the normalized lattice structure. A numerical example is presented to demonstrate the performance of the proposed structure.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121717461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A reconfigurable parallel FPGA accelerator for the kernel affine projection algorithm","authors":"X. Ren, Qihang Yu, Badong Chen, Nanning Zheng, Pengju Ren","doi":"10.1109/ICDSP.2015.7252008","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7252008","url":null,"abstract":"Kernel affine projection algorithm (KAPA) is an efficient online kernel learning method, because it not only inherits the advantages of other kernel adaptive filtering (KAF) algorithms, but also reduces the gradient noise significantly. More importantly, it provides a unifying framework for many KAF algorithms. However, suffering from huge computational load, especially when network size is large, it is not suitable for real-time applications. In order to extend its availability, we design a reconfigurable parallel FPGA accelerator for it. The generally used Gaussian kernel is chosen. Moreover, a novel quantization method is adopted to constrain the network size, so as to further reduce computational load and storage overhead. The proposed accelerator allows multiple input data to be processed simultaneously, accelerating the execution rate. Shift registers are used to record the results of different input data. The codebook and coefficients are updated for each input in sequential order along with the shifting of registers constantly. Finally, the FPGA accelerator with eight datapaths, which works at 100MHz, achieves an average speedup of 404.47 versus C code running on a 3GHz Intel(R) Core(TM) i5-2320 CPU.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124132038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Are subtle expressions too sparse to recognize?","authors":"A. Ngo, Sze‐Teng Liong, John See, R. Phan","doi":"10.1109/ICDSP.2015.7252080","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7252080","url":null,"abstract":"As subtle emotions are slightly and involuntarily expressed, they need to be recorded by high-speed camera. Though this high frame-per-second rate allows better capture of subtle expressions, it typically generates a lot of redundant frames with rapid varying illumination and noise but without significant motions. The redundancy is analyzed and eliminated by Sparsity-Promoting Dynamic Mode Decomposition (DMDSP), which helps synthesize dynamically condensed sequences. Moreover, DMDSP can also visualize dynamics of subtle expressions in both temporal and spectral domains. As meaningful subtle expressions are temporarily sparse, DMDSP would be able to extract these meaningful dynamics and improve recognition rates of subtle expressions. The hypothesis is evaluated on CASME II, a database of spontaneous subtle facial expressions. Recognition performance measured by F1-score, recall and precision metrics showed a significant leap of improvement when DMDSP is used to preserve a small percentage of meaningful frames in sequences with temporally high sparsity levels.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126552711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WLS design of centro-symmetric 2-D FIR filters using matrix iterative algorithm","authors":"Ruijie Zhao","doi":"10.1109/ICDSP.2015.7251325","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251325","url":null,"abstract":"The weighted least squares (WLS) design problem of two-dimensional (2-D) FIR filters with centro-symmetric response is considered in this paper. Its optimality condition is given and expressed as a pair of matrix equations, which involves two matrix variables, i.e., the coefficients of filters to be determined. Then, based on the optimality condition, a matrix iterative algorithm is developed to solve the design problem and its convergence is established using linear operator theory. Because the coefficients of filters are in their natural matrix forms in the algorithm, great savings in computations and memory space required are achieved. Finally, a design example and comparisons with existing methods are provided to illustrate the effectiveness and efficiency of the proposed algorithm.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125455842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li
{"title":"Formant excursion in singing synthesis","authors":"P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li","doi":"10.1109/ICDSP.2015.7251852","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251852","url":null,"abstract":"This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125666981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Runtime techniques for efficient Ray-Tracing on heterogeneous systems","authors":"Chih-Chen Kao, W. Hsu","doi":"10.1109/ICDSP.2015.7251838","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251838","url":null,"abstract":"The prevalence of real time multimedia delivery appliances has led to the developments of a wide variety of efficient architectures and supporting software techniques. Specifically, Ray-Tracing, a well-known physically-based rendering algorithm, has been receiving great attentions in research and development with the evolution of multi-core architecture since massive parallelism is inherent in that application. Unfortunately, the type of computation in Ray-tracing is known as an instance of irregular application which possesses attributes that may vary during execution and are often unpredictable, making it difficult to run efficiently on SIMD/SIMT based GPGPU architectures. For example, the irregularity in such applications may cause control flow divergence, load imbalance and low efficiency in the memory hierarchy of heterogeneous computing systems. To address these issues, researchers have been trying different approaches such as MIMD based homogeneous platform or specific hardware solutions. While these approaches tend to emphasize on dedicated special-purpose hardware configurations, our work illustrates that with appropriate analysis and tuning for irregularity within Ray-Tracing, it is possible to achieve high performance and high efficiency on current heterogeneous systems by applying software-based runtime approach. We studied and proposed phase guided dynamic work partitioning, a light-weight and fast analysis technique, to collect information during program phases at runtime in order to guide work partitioning in subsequent phases for more efficient work dispatching on heterogeneous systems. The experiments have shown that the performance gain of this approach can be as high as 5 times faster than the original system.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115779713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Shahnaz, S. Sultana, S. Fattah, R. H. M. Rafi, I. Ahmmed, Weiping Zhu, M. Ahmad
{"title":"Emotion recognition based on EMD-Wavelet analysis of speech signals","authors":"C. Shahnaz, S. Sultana, S. Fattah, R. H. M. Rafi, I. Ahmmed, Weiping Zhu, M. Ahmad","doi":"10.1109/ICDSP.2015.7251881","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251881","url":null,"abstract":"In this paper, a speech emotion recognition method is proposed based on wavelet analysis on decomposed speech data obtained via empirical mode decomposition (EMD). Instead of analyzing the given speech signal directly, first the intrinsic mode functions (IMFs) are extracted by using the EMD and then the discrete wavelet transform (DWT) is performed only on the selected dominant IMFs. Both approximate and detail DWT coefficients of the dominant IMF are taken into consideration. It is found that some higher order statistics of these EMD-DWT coefficients corresponding to different emotions exhibit distinguishing characteristics and these statistical parameters are chosen as the desired features. For the purpose of classification, K nearest neighbor (KNN) classifier is employed along with the hierarchical clustering. Extensive simulations are carried out on widely used EMO-DB speech emotion database containing four class emotions, namely angry, happy, sad and neutral. Simulation results show that the proposed EMD-Wavelet based feature can provide quite satisfactory recognition performance with reduced feature dimension.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132243221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Novel feature for identification of focal EEG signals with k-Means and fuzzy c-means algorithms","authors":"K. Rai, V. Bajaj, Anil Kumar","doi":"10.1109/ICDSP.2015.7251904","DOIUrl":"https://doi.org/10.1109/ICDSP.2015.7251904","url":null,"abstract":"In this paper, a new method for automatic identification of focal electroencephalogram (EEG) signals is proposed. Detection of focal EEG signals locates the epileptogenic area which is an important task for successful surgery. The proposed method is based on empirical mode decomposition (EMD) that uses the ratio of amplitude modulation bandwidth (BAM) and frequency modulation bandwidth (BFM), as a feature for identification of focal EEG signals. The feature average bandwidths ratio (AvgBratio) extracted from analytic intrinsic mode functions (IMFs) is set to input in k-Means and fuzzy c-mean (FCM) unsupervised learning. Statistical test Kruskal-Wallis shows the effective discrimination ability of the feature. The experimental results shows that proposed method is precisely proficient to classify focal and non-focal EEG signals using single narrow frequency band. A comparative analysis of both unsupervised learning techniques is performed by elapsed time, time complexity, and accuracy.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134197065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}