2016 Digital Media Industry & Academic Forum (DMIAF)最新文献_第4页

EEG-based affect states classification using Deep Belief Networks 基于脑电图的深度信念网络情感状态分类

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574921

Haiyan Xu, K. Plataniotis

{"title":"EEG-based affect states classification using Deep Belief Networks","authors":"Haiyan Xu, K. Plataniotis","doi":"10.1109/DMIAF.2016.7574921","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574921","url":null,"abstract":"Affective states classification has become an important part of the Brain-Computer Interface (HCI) study. In recent years, affective computing systems using physiological signals, such as ECG, GSR and EEG has shown very promising results. However, like many other machine learning studies involving physiological signals, the bottle neck is always around the database acquisition and the annotation process. To investigate potential ways to address this small sample problem, this paper introduces a Deep Belief Networks (DBN) based learning system for the EEG-based affective processing system. Through the greedy-layer pretraining using unlabeled data as well as a supervised fine-tuning process, the DBN-based approaches significantly reduced the number of labeled samples required. The DBN methods also acted as an application specific feature selector, by examining the weight vector between the input feature vector and the first invisible layer, we can gain much needed insights on the spatial or spectral locations of the most discriminating features. In this study, DBNs are trained on the narrow-band spectral features extracted from multichannel EEG recordings. To evaluate the efficacy of the proposed DBN-based learning system, we carried out an subject-independent affective states classification experiments on the DEAP database to classify 2-dimensional affect states. As a baseline to the proposed DBN approach, the same classification problem was also carried out using support vector machines (SVMs) and one-way ANOVA based feature selection process. The classification results shown that the proposed framework using Deep Belief Networks not only provided better classification performance, but also significantly lower the number of labeled data required to train such machine learning systems.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128439921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Vision-based engagement detection in Virtual Reality 虚拟现实中基于视觉的交战检测

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574933

Ghassem Tofighi, Haisong Gu, K. Raahemifar

{"title":"Vision-based engagement detection in Virtual Reality","authors":"Ghassem Tofighi, Haisong Gu, K. Raahemifar","doi":"10.1109/DMIAF.2016.7574933","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574933","url":null,"abstract":"User engagement modeling for manipulating actions in vision-based interfaces is one of the most important case studies of user mental state detection. In a Virtual Reality environment that employs camera sensors to recognize human activities, we have to know were user intend to perform an action and when he/she is disengaged. Without a proper algorithm for recognizing engagement status, any kind of activities could be interpreted as manipulating actions, called “Midas Touch” problem. Baseline approach for solving this problem is activating gesture recognition system using some focus gestures such as waiving or raising hand. However, a desirable natural user interface should be able to understand user's mental status automatically. In this paper, a novel multi-modal model for engagement detection, DAIA 1, is presented. using DAIA, the spectrum of mental status for performing an action is quantized in a finite number of engagement states. For this purpose, a Finite State Transducer (FST) is designed. This engagement framework shows how to integrate multi-modal information from user biometric data streams such as 2D and 3D imaging. FST is employed to make the state transition smoothly using combination of several boolean expressions. Our FST true detection rate is 92.3% in total for four different states. Results also show FST can segment user hand gestures more robustly.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130047595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Assessing unreliability in OTT video QoE subjective evaluations using clustering with idealized data 利用理想化数据聚类评估OTT视频质量质量主观评价的不可靠性

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574940

Jie Jiang, P. Spachos, M. Chignell, L. Zucherman

引用次数: 4

Enabling enterprise-scale systems using cloud-based personal media 使用基于云的个人媒体启用企业级系统

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574938

S. Fels, J. C. A. Silva

引用次数: 0

HDR Video Coding based on a temporally constrained Tone Mapping Operator 基于时间约束的色调映射算子的HDR视频编码

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574900

C. Ozcinar, Paul Lauga, G. Valenzise, F. Dufaux

引用次数: 6

High Dynamic Range versus Standard Dynamic Range compression efficiency 高动态范围与标准动态范围压缩效率

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574890

Ronan Boitard, M. Pourazad, P. Nasiopoulos

{"title":"High Dynamic Range versus Standard Dynamic Range compression efficiency","authors":"Ronan Boitard, M. Pourazad, P. Nasiopoulos","doi":"10.1109/DMIAF.2016.7574890","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574890","url":null,"abstract":"High Dynamic Range (HDR) image and video technology aims at conveying the full range of perceptible shadow and highlight details with sufficient precision. HDR is regarded by many experts as the next evolution in digital media. However, industrial broadcasters have concerns regarding the bandwidth overhead that this new technology entails. While many consider that broadcasting HDR content would increase bandwidth requirements by around 20%, this number is based on studies where, in addition to the SDR main stream, HDR-related side information is conveyed. A recent subjective evaluation reported that encoding HDR video content in a single layer might require less bandwidth than its associated SDR version. Similar results were discussed in the MPEG ad-hoc group on High Dynamic Range and Wide Color Gamut. In this article, we explain how having more information can result in lower bandwidth requirements. To this end, we describe several limitations of the human vision system that, when exploited, optimize the HDR distribution pipeline for a human observer. Our theoretical assumption about the higher efficiency of HDR is backed up by a statistical analysis of pixel distribution in real images. The Spatial Index objective metric also reconfirms our assumption.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133431676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

New just noticeable coding distortion model for perceptual coding 一种新的感知编码的不明显编码失真模型

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574928

Shengyang Xu, Mei Yu, G. Jiang, Shuqing Fang

{"title":"New just noticeable coding distortion model for perceptual coding","authors":"Shengyang Xu, Mei Yu, G. Jiang, Shuqing Fang","doi":"10.1109/DMIAF.2016.7574928","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574928","url":null,"abstract":"With the aim of improving the efficiency and perceptual quality in video coding, this paper proposes a novel just-noticeable coding distortion (JNCD) model that considers human visual perception redundancy and unreasonable factors of existing just-noticeable distortion (JND) models in the coding process. First, we design a psycho-physical experiment to analyze the just-noticeable gradient difference (JNGD) and build a JNGD model to filter the gradient components that are imperceptible to human eyes. We use total variation (TV) to decompose an image into a structural image and a textural image, and calculate their gradients. Then, we use JNGD to filter out imperceptible gradient components in each gradient image. Second, human visual sensitivity to different gradient magnitudes is analyzed to model the relationship between the human visual perceptible gradient magnitude and JNCD. Finally, considering the perceived difference of human eye perception in edge, flat, and textural regions of an image, we adjust the JNCD value in each region and establish a JNCD model of the whole image. To verify the efficiency of the proposed JNCD model, we compare it with the classic JND model and test it on the high-efficiency video coding (HEVC) platform. The proposed model has advantages in subjective visual effects, meaning that it is helpful in analysis of human visual perception redundancy and the relevant perceptual video coding.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124138185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VisQuery: Visual querying of streaming data via pattern matching VisQuery:通过模式匹配对流数据进行可视化查询

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574924

Chenhui Li, G. Baciu, Yunzhe Wang

引用次数: 0

Perception-based Histogram Equalization for tone mapping applications 基于感知的直方图均衡化色调映射应用

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574892

Stelios E. Ploumis, Ronan Boitard, M. Pourazad, P. Nasiopoulos

{"title":"Perception-based Histogram Equalization for tone mapping applications","authors":"Stelios E. Ploumis, Ronan Boitard, M. Pourazad, P. Nasiopoulos","doi":"10.1109/DMIAF.2016.7574892","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574892","url":null,"abstract":"Due to the ever increasing commercial availability of High Dynamic Range (HDR) content and displays, backward compatibility of HDR content with Standard Dynamic Range displays is currently a topic of high importance. Over the years, a significant amount of Tone Mapping Operators (TMOs) have been proposed to adapt HDR content to the restricted capabilities of SDR displays. Among them, the Histogram Equalization (HE) is considered to provide good results for a wide set of images. However, the naïve application of HE results either in banding artifacts or noise amplification when the HDR image has large unified areas (i.e. sky). In order to differentiate relevant information from noise in a uniform background, or in dark areas, the authors proposed a ceiling function. Their method results in noise-free but dim images. In this paper we propose a novel ceiling function which is based on the Perceptual Quantizer (PQ) function. Our method uses as threshold the number of code-words that PQ assigns on a luminance range in the original HDR image and the corresponding number of code-words in the resulting SDR image. We limit the number of code-words on SDR to be equal or less than the HDR. The saved code-words during the ceiling operation are redistributed to increase the contrast as well as the brightness of the final image. Results shows that provided SDR images are noise-free and brighter than the one obtained with prior HE operators. Finally since the proposed method is a Global TMO, it is thereby of low complexity and suitable for real time applications.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125615833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Joint antenna allocation and rate adaption for video transmission in massive MIMO systems 大规模MIMO系统中视频传输的联合天线分配和速率自适应

2016 Digital Media Industry & Academic Forum (DMIAF) Pub Date : 2016-07-04 DOI: 10.1109/DMIAF.2016.7574906

Bowen Liu, Heli Zhang, Hong Ji, Xi Li, Ke Wang

{"title":"Joint antenna allocation and rate adaption for video transmission in massive MIMO systems","authors":"Bowen Liu, Heli Zhang, Hong Ji, Xi Li, Ke Wang","doi":"10.1109/DMIAF.2016.7574906","DOIUrl":"https://doi.org/10.1109/DMIAF.2016.7574906","url":null,"abstract":"Massive multi-input-multi-output (MIMO) networks could achieve higher data transmission rate benefited from the advantages of space diversity and multiplexing. In recent years, large amounts of research about different service adopted in massive MIMO network have been proposed. In this paper, we investigate instant video communication services requested by users in massive MIMO networks. After defining a detailed system model for video streaming in massive MIMO networks, we jointly consider the problem of antenna allocation and time-average video streaming scheduling. Since the problem is NP-hard, we reformulate it by decomposing the problem into two sub-problems that are antennas allocation and video packets queuing so that some fast common algorithms can be employed. To solve the two sub-problems, Enhanced Hungarian algorithm (EHA) and Enhanced Kuhn-Munkras algorithm (EKM) are designed for antenna allocation, and High Quality Fair Queuing (HQFQ) algorithm is proposed for video streaming scheduling. Consequently, numerical solution can be calculated in the time scale of real-life video streaming sessions. Various results demonstrate that our approach performs well in balance of quality of service and fairness to video streaming users.","PeriodicalId":404025,"journal":{"name":"2016 Digital Media Industry & Academic Forum (DMIAF)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128848604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2