2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)最新文献

筛选
英文 中文
Where Am I Looking: Localizing Gaze In Reconstructed 3D Space 我在看哪里:在重建的3D空间中定位凝视
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969158
Devarth Parikh, Yawen Lu, Yuan Xin, Di Wu, J. Pelz, G. Lu
{"title":"Where Am I Looking: Localizing Gaze In Reconstructed 3D Space","authors":"Devarth Parikh, Yawen Lu, Yuan Xin, Di Wu, J. Pelz, G. Lu","doi":"10.1109/GlobalSIP45357.2019.8969158","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969158","url":null,"abstract":"We propose a method to estimate the 3D gaze of the observer onto the scene using a portable eye tracker with a monocular camera. We reconstruct the 3D scene using Structure from Motion (SfM) and use camera pose and 3D reconstruction information of the scene to localize the position and pose of the observer in the 3D reconstructed space. Along with the position, we can obtain the 2D gaze of the observer using an eye-tracker. Each person may have a different perspective of the same 3D object in the scene, observing it from different positions. We use this information to fuse these multiple perspectives in 3D space to get a better understanding of how differently each observer perceives the same scene, compared to others. In the entire system, we developed a convo-lutional neural network to detect and track eye movement and a camera re-localization method to localize the camera in 3D environment. Based on our novel eye tracking and camera re-localization methods, we can accurately localize the gaze in the 3D reconstructed environment.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127795328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Geometric Convolutional Neural Network for 3D Object Detection 三维物体检测的几何卷积神经网络
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969077
Yawen Lu, Qianyu Guo, G. Lu
{"title":"A Geometric Convolutional Neural Network for 3D Object Detection","authors":"Yawen Lu, Qianyu Guo, G. Lu","doi":"10.1109/GlobalSIP45357.2019.8969077","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969077","url":null,"abstract":"We propose a method for accurate 3D vehicle detection based on geometric deep neural networks. From only a single RGB image, the framework is able to recover the 3D positions and predict 3D bounding boxes. In particular, the algorithm leverages single image depth estimation and semantic segmentation to produce 3D point cloud for specific objects. By geometrically constraining the object dimensions, an accurate and stable 3D bounding box which tightly fits into the real object can be estimated. We verify the effectiveness and robustness of our method by comparing with other recent state-of-art methods on the challenging KITTI 3D benchmark dataset as well as synthetic Virtual KITTI dataset. Without requiring ground truth 3D labels, our method is able to produce competitive and robust performance in 3D scene understanding and detection.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129112689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low-Complexity Adaptive Switched Prediction-Based Lossless Compression of Time-Lapse Hyperspectral Image Data 基于低复杂度自适应切换预测的时移高光谱图像数据无损压缩
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969499
T. Shinde, A. Tiwari, Weiyao Lin
{"title":"Low-Complexity Adaptive Switched Prediction-Based Lossless Compression of Time-Lapse Hyperspectral Image Data","authors":"T. Shinde, A. Tiwari, Weiyao Lin","doi":"10.1109/GlobalSIP45357.2019.8969499","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969499","url":null,"abstract":"Time-lapse hyperspectral image (HSI) data has an enormous size and demands lossless compression for most of the high fidelity applications. In literature, spatial and spectral correlations in HSI are widely studied and used for compression. We propose a novel adaptive switched prediction-based scheme, which efficiently exploits temporal correlations in addition to spatial, and spectral correlations. The predictor switching uses a threshold, which is chosen based on the residual error distribution of already encoded band. Hence, our method does not need any overhead to be transmitted. The proposed scheme outperforms other state-of-the-art methods in bit-rate, and the method is computationally efficient too.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133720894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GSP Analysis of Brain Imaging Data from Athletes with History of Multiple Concussions 多发脑震荡运动员脑成像数据的GSP分析
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969487
Saurabh Sihag, S. Naze, F. Taghdiri, M. Tartaglia, J. Kozloski
{"title":"GSP Analysis of Brain Imaging Data from Athletes with History of Multiple Concussions","authors":"Saurabh Sihag, S. Naze, F. Taghdiri, M. Tartaglia, J. Kozloski","doi":"10.1109/GlobalSIP45357.2019.8969487","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969487","url":null,"abstract":"Study of neurological disorders affecting the structure-function relationships in the brain has been an ongoing challenge in neuroscience. Joint analysis of structure and function of the brain may disentangle a number of mechanisms and operations that can help interpret the interdependence between white matter degeneration and degradation of cognitive abilities. In this scenario, graph signal processing analysis of different signals generated within the physical structure of the brain may provide new insights and corroborate existing clinical findings. This paper illustrates the utility of graph signal processing tools in the joint analysis of diffusion and functional magnetic resonance imaging (i.e. dMRI and fMRI) data collected from a population of former athletes with a history of multiple concussions, and healthy subjects. Specifically, the distributions of the energy of low-graph-frequency components of the functional networks (derived from fMRI) are observed to be significantly different for fronto-temporal regions of the brain in athletes and healthy subjects. Furthermore, for the two groups of subjects, we observe significantly different associations between the ages of subjects and the energies of high graph frequency components in lingual region. While the effect on fronto-temporal regions for former athletes is in line with the existing clinical studies on concussion, significantly different associations between age and features extracted using GSP for the two groups of subjects could inform future clinical applications and medical diagnosis.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133568369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling and Recovery of Graph Signals and Difference-Based Signals 图信号和差分信号的建模与恢复
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969536
Ariel Kroizer, Yonina C. Eldar, T. Routtenberg
{"title":"Modeling and Recovery of Graph Signals and Difference-Based Signals","authors":"Ariel Kroizer, Yonina C. Eldar, T. Routtenberg","doi":"10.1109/GlobalSIP45357.2019.8969536","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969536","url":null,"abstract":"In this paper, we consider the problem of representing and recovering graph signals with a nonlinear measurement model. We propose a two-stage graph signal processing (GSP) framework. First, a GSP representation is obtained by finding the graph filter that best approximates the known measurement function. The new GSP representation enables performing tractable operations over graphs, as well as gaining insights into the signal graph-frequency contents. Then, we formulate the signal recovery problem under the smoothness constraint and derive a regularized least-squares (LS) estimator, which is obtained by applying the inverse of the approximated graph filter on the nonlinear measurements. In the second part of this paper, we investigate the proposed recovery and representation approach for the special case of graph signals that are influenced by the differences between vertex values only. Difference-based graph signals arise, for example, when modeling power signals as a function of the voltages in electrical networks. We show that any difference-based graph signal corresponds to a filter that lacks the zero-order filter coefficient, and thus, these signals can be recovered up to a constant by the regularized LS estimator. In our simulations, we show that for the special case of state estimation in power systems the proposed GSP approach outperforms the state-of-the-art estimator in terms of total variation.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122236590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Accurate Evaluation of MSD Log-likelihood and its Application in Human Action Recognition MSD对数似然的准确评价及其在人体动作识别中的应用
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969324
Nuha Zamzami, N. Bouguila
{"title":"An Accurate Evaluation of MSD Log-likelihood and its Application in Human Action Recognition","authors":"Nuha Zamzami, N. Bouguila","doi":"10.1109/GlobalSIP45357.2019.8969324","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969324","url":null,"abstract":"In this paper, we examine the problem of modeling overdispersed frequency vectors that are naturally generated by several machine learning and computer vision applications. We consider a statistical framework based on a mixture of Multinomial Scaled Dirichlet (MSD) distributions that we have previously proposed in [1]. Given that the likelihood function plays a key role in statistical inference, e.g. in maximum likelihood estimation and Fisher information matrix investigation, we propose to improve the efficiency of computing the MSD log-likelihood by approximating its function based on Bernoulli polynomials. As compared to [1], the log-likelihood function is computed using the proposed mesh algorithm and a model selection approach is seamlessly integrated with the parameters estimation. The improved clustering framework offers a good compromise between other techniques and improves the approach used before for the same model. The merits of the proposed approach are validated via a challenging application that involves human action recognition.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127008395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Continuous Parkinsonian Tremor Estimation Using Motion Data 用运动数据估计连续帕金森震颤
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969093
Murtadha D. Hssayeni, J. Jimenez-shahed, M. Burack, B. Ghoraani
{"title":"Continuous Parkinsonian Tremor Estimation Using Motion Data","authors":"Murtadha D. Hssayeni, J. Jimenez-shahed, M. Burack, B. Ghoraani","doi":"10.1109/GlobalSIP45357.2019.8969093","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969093","url":null,"abstract":"Tremor is one of the main symptoms of Parkinson’s Disease (PD) that reduces the quality of life of affected patients. Tremor is measured as part of the Unified Parkinson Disease Rating Scale (UPDRS) part III. However, the assessment is based on onsite physical examinations and do not necessarily represent the patients’ tremor experience in their day-to-day life. In this work, we developed two methods based on deep long short-term memory (LSTM) networks and gradient tree boosting to estimate Parkinsonian tremor using gyroscope sensor signals collected as the patients performed a variety of free body movements. The developed methods were assessed on data from 24 PD subjects. Subject-based, leave-one-out cross-validation demonstrated that the method based on gradient tree boosting provided a high correlation (r=0.93 (p<0.0001)) between the estimated and clinically-assessed tremor subscores in comparison to the LSTM-based method with (r=0.77 (p<0.0001)).","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132483553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A QoE-Based Alarm Model for Terminal Video Quality 基于qos的终端视频质量报警模型
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969366
X. Peng, Yiping Duan, Bingrui Geng, Xiwen Liu, Xiaoming Tao, N. Ge
{"title":"A QoE-Based Alarm Model for Terminal Video Quality","authors":"X. Peng, Yiping Duan, Bingrui Geng, Xiwen Liu, Xiaoming Tao, N. Ge","doi":"10.1109/GlobalSIP45357.2019.8969366","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969366","url":null,"abstract":"With the advent of the 5G era, quality of experience (QoE) has become one of the most important indicators to measure mobile video quality. If QoE degradation events can be accurately predicted, it will be beneficial for operators to provide better services. However, many factors could affect QoE, including users’ subjective factors, network and users’ terminal parameters. This paper proposes a model for QoE degradation alarm in the mobile video stream. Specifically, a large-scale dataset is collected in the practical environment by a self-developed application. The dataset is cleaned to reduce the influence of redundancy and outliers firstly. Then, a QoE degradation alarm model is proposed based on data-driven approaches. The random forest is chosen as the core of this model which integrates the advantages of several classification tree and can deal with the high-dimensional data. In order to have a better prediction performance, Synthetic Minority Oversampling Technique (SMOTE) is used for solving the imbalance problem of the train set. As is shown by the experimental results, this alarm model can strike a balance between precision and false alarm rate and performance outperforms the state-of-the-art methods. This result means the proposed model can save the operational cost while increasing users’ satisfaction with the mobile video quality.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133197205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Deep Neural Hybrid Beamforming for Multi-User mmWave Massive MIMO System 多用户毫米波大规模MIMO系统的深度神经混合波束形成
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969154
Jiyun Tao, Jing Xing, Jienan Chen, Chuan Zhang, Shengli Fu
{"title":"Deep Neural Hybrid Beamforming for Multi-User mmWave Massive MIMO System","authors":"Jiyun Tao, Jing Xing, Jienan Chen, Chuan Zhang, Shengli Fu","doi":"10.1109/GlobalSIP45357.2019.8969154","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969154","url":null,"abstract":"Hybrid beamforming (HB) has emerged as a promising technology to support ultra high transmission capacity and with low complexity for Millimeter Wave (mmWave) multiple-input and multiple-output (MIMO) system. However, the design of digital and analog beamformer is a challenge task with non-convex optimization, especially for the multi-user scenario. Recently, the blooming of deep learning research provides a new vision for the signal processing of communication system. In this work, we propose a deep neural network based HB for the multi-User mmWave massive MIMO system, referred as DNHB. The HB system is formulated as an autoencoder neural network, which is trained in a style of end-to-end self-supervised learning. With the strong representation capability of deep neural network, the proposed DNHB exhibits superior performance than the traditional linear processing methods. According to the simulation results, DNHB outperforms about 2 dB in terms of bit error rate (BER) performance compared with existing methods.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"291 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133037374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Pain Detection from Facial Videos Using Two-Stage Deep Learning 基于两阶段深度学习的面部视频疼痛检测
2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI: 10.1109/GlobalSIP45357.2019.8969274
Guglielmo Menchetti, Zhanli Chen, Diana J. Wilkie, R. Ansari, Y. Yardimci, A. Enis Cetin
{"title":"Pain Detection from Facial Videos Using Two-Stage Deep Learning","authors":"Guglielmo Menchetti, Zhanli Chen, Diana J. Wilkie, R. Ansari, Y. Yardimci, A. Enis Cetin","doi":"10.1109/GlobalSIP45357.2019.8969274","DOIUrl":"https://doi.org/10.1109/GlobalSIP45357.2019.8969274","url":null,"abstract":"A new method to objectively measure pain using computer vision and machine learning technologies is presented. Our method seeks to capture facial expressions of pain to detect pain, especially when a patients cannot communicate pain verbally. This approach relies on using Facial muscle-based Action Units (AUs), defined by the Facial Action Coding System (FACS), that are associated with pain. It is impractical to use human FACS coding experts in clinical settings to perform this task as it is too labor-intensive and recent research has sought computer-based solutions to the problem. An effective automated system for performing the task is proposed here in which we develop an end-to-end deep learning-based Automated Facial Expression Recognition (AFER) that jointly detects the complete set of pain-related AUs. The facial video clip is processed frame by frame to estimate a vector of AU likelihood values for each frame using a deep convolutional neural network. The AU vectors are concatenated to form a table of AU values for a given video clip. Our results show significantly improved performance compared with those obtained with other known methods.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133805224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信