Hongxia Wang, Shengfeng Xu, Yuzheng Zhang, Qicheng Liu, Ji Deng, Yicheng Mou, Chengkun Shi
{"title":"Driving and localization design of a multi-sensor transport robot","authors":"Hongxia Wang, Shengfeng Xu, Yuzheng Zhang, Qicheng Liu, Ji Deng, Yicheng Mou, Chengkun Shi","doi":"10.1109/ICSP48669.2020.9320913","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320913","url":null,"abstract":"Based on the actual demand of a transport robot, in order to ensure the speediness and reliability, and reduce the requirement for power, we proposed the concept of reducing the degree of freedom of the robot arm to only one, which is in the vertical direction, to speed up the process of grabbing and placing. To make up for the loss of flexibility, we propose a design scheme of an omni-directional mobile robot which requires a more complex design of driving and global localization based on multi-sensor fusion.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116960986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniel Melesse, Mahmoud Khalil, Elias Kagabo, T. Ning, Kevin Huang
{"title":"Appearance-Based Gaze Tracking Through Supervised Machine Learning","authors":"Daniel Melesse, Mahmoud Khalil, Elias Kagabo, T. Ning, Kevin Huang","doi":"10.1109/ICSP48669.2020.9321075","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321075","url":null,"abstract":"Applications that use human gaze have become increasingly more popular in the domain of human-computer interfaces, and advances in eye gaze tracking technology over the past few decades have led to the development of promising gaze estimation techniques. In this paper, a low-cost, in-house video camera-based gaze tracking system was developed, trained and evaluated. Seminal gaze detection methods constrained the application space to indoor conditions, and in most cases techniques required intrusive hardware. More modern gaze detection techniques try to eliminate the use of any additional hardware to reduce monetary cost as well as undue burden to the user, all the while maintaining accuracy of detection. In this work, image acquisition was achieved using a low-cost USB web camera mounted at a fixed position on the viewing screen or laptop. In order to determine the point of gaze, the Viola Jones face detection algorithm is used to extract facial features from the image frame. The gaze is then calculated using image processing techniques to extract gaze features, namely related to the image position of the pupil. Thousands of images are classified and labeled to form an in-house database. A multi-class Support Vector Machine (SVM) was trained and tested on this data set to distinguish point of gaze from input face image. Cross validation was used to train the model. Confusion matrices, accuracy, precision, and recall are used to evaluate the performance of the classification model. Evaluation of the proposed appearance-based technique using two different kernel functions is also assessed in detail.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121028675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Localization of Closely-Spaced Speech Sources Based on Small Microphone Arrays","authors":"Siyu Sun, Qinmengying Yan, Wusheng Zhang, Haijian Zhang","doi":"10.1109/ICSP48669.2020.9320993","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320993","url":null,"abstract":"Indoor source localization based on a microphone array is still a difficult problem especially for closely-spaced sources in strong reverberation environments, in such cases a large number of microphones are often utilized. Considering the industrial design and cost, source localization using a small microphone array in complex conditions is worthy of investigation. In this paper, we propose a small-size array based localization method under the condition of close proximity and strong reverberation. The essence of the proposed method lies in the detection of single source time-frequency (TF) points (SSPs) of each source, which simplifies the problem of multisource localization in close proximity into several single-source localization ones. The detected SSPs are then used for precise source localization in a sparse Bayesian framework, wherein the reflection coefficient can be simultaneously estimated by implementing a Bayesian dictionary learning. Experimental results confirm the effectiveness of our method in locating spatially-close sources compared with some state-of-the-art methods.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123714617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Efficient Lightweight Object Detector for Railway Tunnel Safety Monitoring","authors":"Enze Yang, Yuxin Liu, Shuoyan Liu","doi":"10.1109/ICSP48669.2020.9320914","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320914","url":null,"abstract":"Tunnel disaster usually poses a huge threat to trains and passengers, hence the monitoring of the tunnel environment becomes particularly important. In this paper, we aim to detect the potential tunnel disasters from the perspective of computer vision. An efficient lightweight network is proposed to mainly detect pedestrians and trains in tunnel, a lightweight backbone is leveraged to reduce the volume of network parameters and computational costs, while multi-scale feature fusion enhances the spatial and semantic features of various layers. As a foreground mask to the Gaussian Mixture Model, our detector aims to improve the performance of obstacle detection by reducing the false alarm rate. A number of experiments are carried out in the tunnel laboratory. According to experimental results, the detection framework proposed in this paper beats the state-of-the-art detectors in tunnel dataset. Further, the mask generated by the detector significantly decreases the false alarm rate of the Gaussian Mixture Model of obstacle detection, which proves that the framework proposed in this paper is applicable to practical tunnel safety monitoring.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"16 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122584037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bernoulli Track-before-detect Algorithm for Distributed Target with Unknown Amplitude Information","authors":"Ruofeng Yu, Wei Yang, Yaowen Fu, Wenpeng Zhang","doi":"10.1109/ICSP48669.2020.9321019","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321019","url":null,"abstract":"Track-before-detect algorithm is an effective solution of detecting and tracking a low signal-to-noise ratio (SNR) target whose amplitude distribution characteristic is needed as the prior information. Under the point target hypothesis in most previous works, the amplitude of target is usually assumed to be known or modeled as a state variable. However, these approaches cannot simply be migrated to the extended target tracking problem because of the unknown target extended length. This paper considers the issue of extended target joint detection and tracking with unknown amplitude distribution information through the use of Bernoulli particle filter based track-before-detect (BPF-TBD) methods. The proposed heuristic algorithm accumulates the multi-frame measurement data along the potential track of the target by a sliding window and then extracts the amplitude distribution information by means of principal component analysis (PCA) method. Simulation results show that the property of the proposed method asymptotically converges to the exact filter with prior correct expected amplitude distribution information, which indicates a superior performance in terms of feasibility and effectiveness.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121907929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment","authors":"Jian Xiao, Hongqing Liu, Yi Zhou, Zhen Luo","doi":"10.1109/ICSP48669.2020.9320989","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320989","url":null,"abstract":"This work studies the single channel speech enhancement problem in the case of low signal-to-noise ratio (SNR). To that aim, the supervised learning technique is utilized, where a new loss is developed to trade-off the speech distortion and residual noise. By a use of weighted combination of distortion and residual noise, the noise suppression and speech quality are considered simultaneously. In doing so, it also is easy to verify that the commonly used mean square error (MSE) loss is a special case of the proposed loss. Experimental results show, with the convolutional encoder-decoder-long short-term memory (CED-LSTM) network, the proposed loss outperforms the MSE and the recently proposed scale-invariant signal-to-distortion ratio (SI-SDR) loss.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128515720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Design of Integrated Waveform Based on MSK-LFM Signal","authors":"Wenxu Zhang, Heng Zhang","doi":"10.1109/ICSP48669.2020.9320941","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320941","url":null,"abstract":"At present, radar communication integration has become a research hotspot and has been widely concerned. Minimum Shift Keying-Linear Frequency Modulation, or MSK-LFM is a novel integrated waveform signal in radar communication. Its carrier is LFM signal, and the carrier is modulated by MSK to realize integrated function. For multipath interference in signal transmission. The spread spectrum technology based on MSK-LFM signal will be introduced in this paper. Simulation helped validate that the integrated waveform after spread spectrum is more stable. The integrated waveform has a thumbtack type ambiguity figure and shows the good ability to do multitarget detection.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129336738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhuorui Liang, Yong Wang, Xiaofeng Zhang, M. Xie, Xiongjun Fu
{"title":"Identification of Ship and Corner Reflector In sea clutter environment","authors":"Zhuorui Liang, Yong Wang, Xiaofeng Zhang, M. Xie, Xiongjun Fu","doi":"10.1109/ICSP48669.2020.9321063","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321063","url":null,"abstract":"The corner reflector is an effective passive jamming method against radar detection. In the complex sea clutter environment, the interference effect of corner reflector is more obvious. In the sea clutter environment, a kind of environment in which ship and corner reflectors exist simultaneously is simulated, and obtain the radar echo of them. Then, the corresponding polarization scattering matrix is calculated by the model of ship and triangle plate corner reflector, and get the polarization invariant data from the matrix. SVM is used as a classifier to identify the ship and corner reflector with polarization invariant features. Finally, the corner reflector echo is filtered out, and achieve the purpose of identifying ship target and corner reflector. The results show the effectiveness of the algorithm.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121389618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ESinGAN: Enhanced Single-Image GAN Using Pixel Attention Mechanism for Image Super-Resolution","authors":"Wenyu Sun, Baodi Liu","doi":"10.1109/ICSP48669.2020.9320934","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320934","url":null,"abstract":"Recently, the SinGAN model emerged, and it was famous for generating from a single image. The SinGAN model achieves superior performance to other advanced models for image super-resolution task (trained on a dataset or a single image). However, SinGAN does not consider the importance of feature pixels on the feature map. In this paper, we propose an Enhanced SinGAN model (ESinGAN), an unconditional generative model that can improve the defects of SinGAN using the Pixel Attention mechanism. To evaluate the proposed ESinGAN model’s performance, we carry out the experiments on a benchmark dataset. Through the experiments, we have achieved better performance than SinGAN and proved the effectiveness of the proposed method. Not only is the image significantly improved visually, but the PSNR and SSIM values of the model are also considerably increased. Besides, ESinGAN runs as fast as SinGAN under the same experimental environment.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127785217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Bandpass Filter With Multi Deep Denoising Autoencoder for Hearing Applications","authors":"Raghad Yaseen Lazim, Xiaojun Wu","doi":"10.1109/ICSP48669.2020.9320899","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320899","url":null,"abstract":"Speech enhancement techniques in hearing applications aimed to improve the quality of speech in a noisy environment. Deep denoising autoencoder suppresses noise from noise corrupted speech efficiently. Unfortunately, previous applications provide only limited benefits for the enhancement of speech in noisy environments. This paper presents a new approach for the hearing application, which indicates two stages of the bandpass filter and a model composed of three levels of deep denoising autoencoders. In the first stage, the bandpass filter designed to allow signals based on the human cochlea, which then followed by a model of three levels of multilayers deep denoising autoencoder, each which specialized for specific enhancement task of a complete set of tasks. The approach performance measured using the perceptual evaluation of speech quality, hearing aid sound quality index, and segmental signal-to-noise ratio. The simulation results prove that the proposed method yielded higher intelligibility and quality in comparison with single-multilayers neural networks.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126325254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}