Yawen Lu, Sophia Kourian, C. Salvaggio, Chenliang Xu, G. Lu
{"title":"Single Image 3D Vehicle Pose Estimation for Augmented Reality","authors":"Yawen Lu, Sophia Kourian, C. Salvaggio, Chenliang Xu, G. Lu","doi":"10.1109/GlobalSIP45357.2019.8969201","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969201","url":null,"abstract":"The intent of this paper is to introduce a novel method for 3D vehicle pose estimation, a critical component of augmented reality. The proposed method is able to recover the location of a specific object from a single image by combining pretrained reliable semantic segmentation and improved single image depth estimation. Our method exploits a novel pose estimation technique by generating new 2D images created from the projections of rotated point clouds. The rotation of the specific object is able to be predicted. Augmented objects can be shifted, rotated and scaled correctly based on the recovered orientation and localization. Through accurate vehicle pose estimation, virtual vehicles are able to be augmented accurately in place of real vehicles. The effectiveness of our method is verified by comparison with other recent pose estimation methods on the challenging KITTI 3D benchmark. Further experiments on the Cityscapes dataset also demonstrates good robustness in the method. Without requiring ground truth 3D vehicle pose labels for training, our model is able to produce competitive and robust performance in 3D vehicle pose estimation.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126714131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Radar as a Security Measure - Real Time Neural Model based Human Detection and Behaviour Classification","authors":"Prakhar Kaushik","doi":"10.1109/GlobalSIP45357.2019.8969161","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969161","url":null,"abstract":"Radar technology has great potential for use as security systems in college campuses with privacy advantages over its visual counterparts. We experimented with the Texas Instruments IWR1642 radar sensor to evaluate the feasibility of using radar systems for security monitoring. We introduce an end-to-end neural architecture which is capable of taking radar data inputs in real time and identify human versus nonhuman targets, and classify various human behavioral motions. The model is real time, robust to noise and displays state-of-the-art results for the problem.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126380238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3-D MIMO-SAR Imaging Using Multi-Chip Cascaded Millimeter-Wave Sensors","authors":"Muhammet Emin Yanik, Dan Wang, M. Torlak","doi":"10.1109/GlobalSIP45357.2019.8969133","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969133","url":null,"abstract":"Integration of multi-chip cascaded multiple-input multiple-output (MIMO) millimeter-wave (mmWave) sensors with synthetic aperture radar (SAR) imaging will enable cost-effective and scalable solutions for a variety of applications including security, automotive, and surveillance. In this paper, the first three-dimensional (3-D) holographic MIMO-SAR imaging system using cascaded mmWave sensors is designed and implemented. The challenges imposed by the use of cascaded mmWave sensors in high-resolution MIMO-SAR imaging systems are discussed. Especially, important signal processing functions such as near-field multistatic image reconstruction suitable for large MIMO apertures, multi-channel array calibration, spatial sampling, and image resolution are presented in the context of 3-D MIMO-SAR imaging. The prototyped 3-D MIMO-SAR imaging system is described in detail, along with various real imaging results.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115860489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Divide-and-Conquer Framework for Attention-based Combination of Multiple Investment Strategies","authors":"Xiao Yang, Weiqing Liu, Lewen Wang, Cheng Qu, Jiang Bian","doi":"10.1109/GlobalSIP45357.2019.8969091","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969091","url":null,"abstract":"In order to maximize the profit, investors usually examine diverse investment strategies based on various information when constructing their portfolios. However, they can hardly always construct a profitable portfolio due to the dynamic performance of the strategies and the dynamics of the market state along with the time. To address this challenge, we propose a 2D-attention framework to capture the dynamics of the above two factors in this paper. To capture the dynamic of the first factor, we design a strategy-wise attention model to dynamically combine multiple strategies according to their respective effectiveness. To deal with the second factor, we design a divide-and-conquer framework to learn multiple strategy-wise attention models, which categorizes the whole market periods into a few stable states and jointly learn respective models for each state and then build a state-wise attention model to combine them dynamically for the final task. Extensive experiments on real-world data demonstrate that our 2D-attention framework can significantly outperform several widely-used baselines.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121669811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On Theoretical Optimization of the Sensing Matrix for Sparse-Dictionary Signal Recovery","authors":"Jianchen Zhu, Shengjie Zhao, Xu Ma, G. Arce","doi":"10.1109/GlobalSIP45357.2019.8969228","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969228","url":null,"abstract":"Compressive Sensing (CS) is a new paradigm for the efficient acquisition of signals that have sparse representation in a certain domain. Traditionally, CS has provided numerous methods for signal recovery over an orthonormal basis. However, modern applications have sparked the emergence of related methods for signals not sparse in an orthonormal basis but in some arbitrary, perhaps highly overcomplete, dictionary, particularly due to their potential to generate different kinds of sparse representation of signals. Here, we first propose the Signal space Subspace Pursuit (SSSP) algorithm, and then we derive a low bound on the number of measurements required. The algorithm has low computational complexity and provides high recovery accuracy.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122916235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ibrahim Al-Omari, A. Rahimnejad, S. Gadsden, M. Moussa, H. Karimipour
{"title":"Power System Dynamic State Estimation Using Smooth Variable Structure Filter","authors":"Ibrahim Al-Omari, A. Rahimnejad, S. Gadsden, M. Moussa, H. Karimipour","doi":"10.1109/GlobalSIP45357.2019.8969306","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969306","url":null,"abstract":"With the integration of distributed energy resources (DER) traditional power systems evolved toward modernized smart grids. Although smart grids open up the possibility for more reliable and secure energy management, they impose new challenges on real-time monitoring and control of the power grid. State estimation is a key function which plays a vital role in reliable system control. In this paper, the smooth variable structure filter (SVSF) is used for power system dynamic state estimation (DSE). SVSF is a predictor-corrector based approach which can be applied to both linear and nonlinear system with the ability to deal with the system uncertainties. The simulation results on a single machine with infinite bus power network shows the superiority of the proposed SVSF compared to extended Kalman filter (EKF) and unscented Kalman filter (UKF). The results of the proposed method show a significant smoothness and accuracy in its performance compared to those obtained from EKF and UKF approaches; in particular, in the presence of measurement outliers.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131124223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Saurabhchand Bhati, Chunxi Liu, J. Villalba, J. Trmal, S. Khudanpur, N. Dehak
{"title":"Bottom-Up Unsupervised Word Discovery via Acoustic Units","authors":"Saurabhchand Bhati, Chunxi Liu, J. Villalba, J. Trmal, S. Khudanpur, N. Dehak","doi":"10.1109/GlobalSIP45357.2019.8969225","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969225","url":null,"abstract":"Unsupervised term discovery is the task of identifying and grouping reoccurring word-like patterns from the untranscribed audio data. It facilitates unsupervised acoustic model training in zero resource setting where no or minimal transcribed speech is available. In this paper, we investigate two-step bottom-up approaches for unsupervised discovery of word-like units. The first step discovers phone-like acoustic units from data and the second step combines the basic acoustic blocks to identify word-like units. We investigated Embedded Segmental K-means and Nested Hierarchical Pitman-Yor (PYR) model as bottom-up strategies. ESK-Means iteratively selects boundaries from an initial set to arrive at the word boundaries. The final performance critically depends on the quality of the initial boundaries. We used a segmentation method that discovers boundaries much closer to actual boundaries. PYR model has been used for word segmentation from space removed text data, and here we use it for word discovery from unsupervised acoustic units. The term discovery performance is evaluated on the Zero Resource 2017 challenge dataset, which consists of around 70 hours of unlabelled data. Our systems outperformed the baseline systems on all the languages without language-specific parameter tuning. We performed comprehensive experiments of the system parameters on the system performance.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131283155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yuheng Wang, Jiayue Cai, Daniel C. Louie, H. Lui, Tim K. Lee, Z. J. Wang
{"title":"Classifying Melanoma and Seborrheic Keratosis Automatically with Polarization Speckle Imaging","authors":"Yuheng Wang, Jiayue Cai, Daniel C. Louie, H. Lui, Tim K. Lee, Z. J. Wang","doi":"10.1109/GlobalSIP45357.2019.8969331","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969331","url":null,"abstract":"Skin cancer is the most common cancer in western countries with a high incidence rate. Among all different types of skin cancers, malignant melanoma is the most fatal but has a promising prognosis if it is detected and treated at the early stages. However, melanoma often resemble to seborrheic keratosis (SK), a benign skin condition, and cause mis-diagnosis. Therefore, it is important to develop a framework with computer aided system and non-invasive techniques to assist in the clinical diagnosis of melanoma. In this study, we extend a recent polarization speckle imaging method based on depolarization rate and achieved automatic detection of melanoma by leveraging the power of machine learning strategies. We collected 143 malignant melanoma and seborrheic keratosis lesions. Different machine learning methods, including support vector machine, random forest and k-nearest neighbor, were employed for the classification between melanoma and seborrheic keratosis. In order to explore the impact of different light sources, we further compared the classification performance of depolarization rate with blue and red light sources using different classifiers. The results suggested that the most reliable classification performance was achieved by support vector machine, yielding a high accuracy of 86.31% and the most balanced performance between sensitivity and specificity. In addition, the depolarization rate with the blue light source demonstrated a consistently better performance than that with the red light source across different methods. Our promising classification performance shows evidence for the potentials of computer aided diagnosis of melanoma with polarization speckle imaging, providing an additional non-invasive in vivo tool for skin cancer detection which could benefit future clinical dermatology research.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127681804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Learning Methods for Image Segmentation Containing Translucent Overlapped Objects","authors":"Tayebeh Lotfi Mahyari, R. Dansereau","doi":"10.1109/GlobalSIP45357.2019.8969558","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969558","url":null,"abstract":"Convolutional neural networks(CNN) are a subset of deep learning methods recently used widely for image segmentation. SegNet network [4] has shown interesting results for semantic segmentation, but it is designed to segment images with non-overlapped objects. However in some data translucent regions partially overlap. Having overlapped regions will cause methods not designed for overlapped objects to perform poorly or not work at all. To our knowledge no CNN has been designed yet to segment partially overlapped translucent objects.In this paper, we have designed a CNN to segment partially overlapped translucent regions. We used SegNet [4] as transfer learning for our overlapped image segmentation method. We also designed a new CNN with a simpler network for our data. Results on synthetic images give more than 95% segmentation accuracy for both methods.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132783157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust Minimum Variance Distortionless Response Beamformer based on Target Activity Detection in Binaural Hearing Aid Applications","authors":"Hala As’ad, M. Bouchard, Homayoun Kamkar-Parsi","doi":"10.1109/GlobalSIP45357.2019.8969387","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969387","url":null,"abstract":"In this work, a new beamforming algorithm for binaural hearing aids is introduced, with a good robustness to errors in the estimated target direction of arrival. The proposed design is based on a selection between a monaural MVDR beamformer and a binaural MVDR beamformer at each time-frequency bin. The decision of using either the monaural MVDR or the binaural MVDR at each time-frequency bin depends on a proposed target activity detection system, over an a priori determined target zone. The proposed target activity detection system uses a binaural beamforming method in target-cancelling mode, in order to get an estimate of the dominant speaker direction at each time-frequency bin. Simulations are performed using signals and propagation models obtained from real-life multichannel binaural hearing aids, to evaluate the resulting noise reduction and target distortion. Simulation results show the robustness of the proposed design to errors in the estimated target direction of arrival, in comparison with both the monaural MVDR and binaural MVDR algorithms.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133353327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}